Home » Technical Topics » Data Science

An Introduction to Statistical Sampling

dreamstime_s_97267612_780x480

Image Source: Statistical Aid

Sampling is a statistical procedure of selecting some representative part from an existing population or study area. Specifically, draw a sample from the study population using some statistical methods. For example-
if we want to calculate the average age of Bangladeshi people then we can not deal with the whole population. In that time we must have to deal with some representative part of this population. This representative part is called sample and the procedure is called sampling.

Why need sampling

—  It makes possible the study of a large population which contains different characteristics.
—  It is for economy.
—  It is for speed.
—  It is for accuracy.
—  It saves the sources of data from being all consumed.
Sometimes we can€™t work with population such as blood test, in that situation sampling is must.
 

Types 

Probability Sampling

It is based on the concept of random selection where each population elements have a non-zero chance to occur as a sample. Sampling techniques can be divided into two categories: probability and non-probability. Randomization or chance is the core of probability sampling techniques.
For example, if a researcher is dealing with a population of 100 people, each person in the population would have the odds of 1 out of 100 for being chosen. This differs from non-probability sampling, in which each member of the population would not have the same odds of being selected.

Different types of probability sampling

 
Applications
·    In opinion poll, a relatively small number of persons are interviewed and their opinions on current issues are solicited in order to discover the attitude of the community as a whole.
·    At border stations, customs officers enforce the laws by checking the effects of only a small number of travelers crossing the border.
·    A departmental store wises to examine whether it is losing or gaining customers by drawing a sample from its lists of credit card holders by selecting every tenth name.
·    In a manufacturing company, a quality control officer take one sample from every lot and if any sample is damage then he reject that lot.
Advantages
—  Creates samples that are highly representative of the population.
—  Sampling bias is tens to zero.
—  Higher level of reliability of research findings.
—  Increased accuracy of sample error estimation.
—  The possibility to make inferences about the population.

Disadvantages

—  Higher complexity compared to non-probability sample.
—  More time consuming, especially when creating larger sample.
—  Usually more expensive.

Non-Probability sampling

The process of selecting a sample from a population without using statistical probability theory is called non-probability sampling.
Example
Lets say that the university has roughly 10000 students. These 10000 students are our population (N). Each of the 10000 students is known as a unit, but its hardly possible to get known and select every student randomly.
Here we can use Non-Random selection of sample to produce a result.
 

Applications

 
·   It can be used when demonstrating that a particular trait exist in the population.
·    It can also be useful when the researcher has limited budget, time and workforce.
 

Advantages

·        Select samples purposively
·        Enable researchers to reach difficult to identify members of the population.
·        Lower cost
·        Limited time.
 

Disadvantage

Difficult to make valid inference about the entire population because the sample selected is not representative.
We cannot calculate confidence interval.