Determining sample sizes is a challenging undertaking. For simplicity, I've limited this picture to the one of the most common testing situation: testing for differences in means. Some assumptions have been made (for example, normality and equal sample sizes). Additionally, the formulas shown are for one-tailed tests. Usually, a small tweak (e.g. replacing Z_{α} by Z_{α}/2) is all that's required for two-tailed tests.
Click on the picture to zoom in
The picture shows the traditional (frequentist) route for determining sample size. Another possible route is to use Bayesian methods. Although becoming more popular, none of the major software programs include those methods and--unlike the frequentist route—no standard Bayesian methods exist for determining sample size. If you're interested in Bayesian methods, refer to Chapter 13 in Chow, et al, 2008.
Chow et. al (2017). Sample Size Calculations in Clinical Research. CRC Press
Ryan, T. (2013). Sample Size Determination and Power. Wiley.
DSC Resources
Comment
@Frank Deruyck Ah, that's where it gets complicated! You can't solve for n directly (as you noted). If you use the equation above it (for known std dev) to find "n", you'd have an estimate, but it would be an underestimate. So at this point you'd have a couple of choices: use the calculated "Z" sample size equation as a baseline, and guess a but larger. Or, use software to get "n", then use the equation to check the software's solution. Either way, it's really a guesstimate scenario, because if your population standard deviation is unknown, you're entering the statistical unknown.
@John Williams
I can see why you might think that! However, my intent was merely to show the process with an example. It can be tweaked slightly for many more situations. For example, the process for comparing variances is very, very similar.
Hi Stephanie, in sample size calculation with unknow standard deviation a t value needs to be specified depending on n however this n is unknown and needs to be computed so how to specify the this t(alpha,n-1) value?
© 2020 Data Science Central ® Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles
You need to be a member of Data Science Central to add comments!
Join Data Science Central