Microarray experiments are often performed to detect differently expressed genes among different clinical phenotypes. The method used to calculate the appropriate sample size for this purpose differs from the sample size calculation used for general clinical experiments, because microarrays include tens of thousands of genes. We proposed a sample size calculation method that considers variance among an entire gene set and used the Bonferroni correction to address the multiplicity problem. Specifically, by adjusting for the multiplicity problem, the existing equation for sample size calculation was modified based on the Bonferroni correction. By k-means cluster analysis, the variances across all genes can be divided into several groups with similar values, and the sample sizes for each group were subsequently calculated and weight-averaged. The results of this study show that the sample size was related to the number of genes on a chip. The weighted sample size, calculated by the proposed method, preserved the Type I error for selection of significant genes within a microarray data set.
Bibliographical noteFunding Information:
This study was supported by the Korea Research Foundation (KRF-2008-005-J00803) and the Korea Health 21 R&D Project, Ministry of Health & Welfare (0405-BC01-0604-0002).
All Science Journal Classification (ASJC) codes
- Applied Microbiology and Biotechnology