|
|
||||||||
1 Department of Family Practice and Community Medicine, University of Kentucky, Lexington, Ky
2 Department of Statistics, University of Kentucky, Lexington, Ky
CORRESPONDING AUTHOR: Shersten Killip, MD, MPH, K-302 Kentucky Clinic 0284, 740 S. Limestone, Lexington, KY 40536-0284, skill2{at}email.uky.edu
| ABSTRACT |
|---|
|
|
|---|
METHODS Using a case study with accompanying equations, we show that clustered samples are not as statistically efficient as simple random samples.
RESULTS Similarity among subjects within preexisting groups or clusters reduces the variability of responses in a clustered sample, which erodes the power to detect true differences between study arms. This similarity is expressed by the intracluster correlation coefficient, or
(rho), which compares the within-group variance with the between-group variance. Rho is used in equations along with the cluster size and the number of clusters to calculate the effective sample size (ESS) in a clustered design. The ESS should be used to calculate power in the design phase of a clustered study. Appropriate accounting for similarities among subjects in a cluster almost always results in a net loss of power, requiring increased total subject recruitment. Increasing the number of clusters enhances power more efficiently than does increasing the number of subjects within a cluster.
CONCLUSIONS Primary care research frequently uses clustered designs, whether consciously or unconsciously. Researchers must recognize and understand the implications of clusters to avoid costly sample size errors.
Key Words: Statistics cluster analysis data interpretation, research design primary care practice-based research methods/quantitative theory
| INTRODUCTION |
|---|
|
|
|---|
This article will use a case study to introduce the concepts involved in cluster sampling. It is intended as an introduction to the concepts and language of cluster sampling; researchers are encouraged to consult a statistician familiar with cluster sampling to help in the design and analysis phases of clustered studies. The goal is to raise the awareness of cluster sampling issues among primary care researchers and to help primary care researchers design and publish statistically rigorous findings.
| DEFINITION AND EXPLANATION OF CLUSTERED DESIGNS |
|---|
|
|
|---|
What happened? Most statistical methodologies were designed to analyze data that is both selected and analyzed on the same level. Clustered data result when some preexisting group structure is used to select study participants, but the researcher is interested in the individual level data. Clustered designs can be used for many reasons, but they always cause some loss of statistical efficiency as a result of the "relatedness" within the preexisting groups. Primary care research, which often studies patients from multiple private practices, can produce clustered results by selecting groups of patients at the practice (or practitioner) level, then analyzing the data at the individual patient level, as in our case study.
Why does clustering erode statistical power? Consider the nature of preexisting groups. Most groups form because of some kind of selection factors. Among patients who all see the same primary care physician, there can be many similarities that may include geographic, socioeconomic, racial, ethnic, sexual, religious, political, or age-related similarities, stemming from the propensity of patients to choose a physician with whom they identify. All of these factors can have some impact on the average response of one physicians patients compared with anothers.
The responses of persons selected by any or all of the factors mentioned above tend to be more similar to one anothers than the responses of a group of individuals selected truly at random. Because these responses are similar, they lead to a decrease in the variation among responses of persons in the same cluster, or the variance of the within-cluster responses. This similarity among responses within a group can magnify the apparent differences in outcomes or responses between groups, and they must be taken into account. Adjustment for clustering thus results in a reduction of the effective sample size.
In the case study, solo practitioners were chosen at the physician level to keep the discussion simple. Figure 1
illustrates the design of the study. Physicians who choose to work together, however, share similarities just as their patients do, and these similarities must be taken into account. Using physicians who worked together would have introduced a third level, the practice level, to our study. This concept is illustrated in Figure 2
. Multilevel clustering is termed nesting, and there are specific statistics to deal with that mathematical situation. Clustering is a specific term for the simplest type of nesting, using only 2 levels of data, as shown in Figure 1
.
|
|
| ADVANTAGES AND DISADVANTAGES OF CLUSTERED DESIGNS |
|---|
|
|
|---|
Clustering is the design of choice to avoid a phenomenon known as contamination. In our example, asking the same physician to present 2 entirely different counseling scripts to his or her patients is impractical; the physician will likely get confused. Also, patients of the same physician may be acquainted. Some patients could pass on a version of the counseling to other patients. As a result, the loss of efficiency from clustering is necessary to preserve the integrity of the intervention.
THE INTRACLUSTER CORRELATION COEFFICIENT, OR
|
|---|
|
|
|---|
(the Greek rho), is a measure of the relatedness of clustered data. It accounts for the relatedness of clustered data by comparing the variance within clusters with the variance between clusters. Mathematically, it is the between-cluster variability divided by the sum of the within-cluster and between-cluster variabilities. Equation 1: *
![]()
where sb2 = the variance between clusters, and sw2 = the variance within clusters.
Values of
range from 0 to 1 in human studies. From equation 1, as the within-cluster variance (sw2) moves toward 0,
gets closer and closer to 1. In the theoretical case where
= 1, all responses within a cluster are identical. In that case the effective sample size is reduced to the number of clusters.
A very small value for
implies that the within-cluster variance is much greater than the between-cluster variance, and a
of 0 shows that there is no correlation of responses within a cluster. Usually, values of r are between 0.01 and 0.02 in human studies.24 The calculation of
usually requires a pilot study. We encourage all investigators to publish their
values, which will (eventually) aid in being able to estimate
for a given type of population.
| EFFECTIVE SAMPLE SIZE AND THE DESIGN EFFECT |
|---|
|
|
|---|
To get the effective sample size, the total sample size (the number of patients per cluster times the number of clusters) is divided by a correction factor that includes
and the sample size per cluster (m). This correction factor is called the design effect. In the case study above, we created the special case of clustered data with all groups having the same number of subjects (each physician recruited 32 patients). In this special case:
Equation 2:
![]()
and Equation 3:
![]() |
where m = number of subjects in a cluster, k = number of clusters, mk = total number of subjects in a clustered study, ESS = effective sample size, DE = design effect, and
= intracluster correlation coefficient (see equation 1).
If
= 0, then the design effect = 1, and the sample size is unaffected. If
> 0, even if it is still very small, the design effect may be magnified by a large cluster size (m). This would then reduce the effective sample size of the study (see equation 2). If
= 1, the design effect (equation 2) is 1, and the effective sample size therefore reduces to k, the number of clusters.
These equations can be reversed in the planning phase to calculate correctly the total sample size needed for a clustered study. All power calculations and resultant sample size estimates can be calculated initially using usual formulas for a clustered study, which will give researchers the effective sample size. Equation 2 can be used to find mk, or the total required sample size, given the effective sample size and design effect
THE EFFECT OF AND THE DESIGN EFFECT ON POWER AND SAMPLE SIZE CALCULATIONS
|
|---|
|
|
|---|
and the design effect on sample size and power, we will do a sample calculation. Using our case study, we have 4 physicians recruiting 32 patients each. Let us say that
= 0.017 in this case. What is the effective sample size after adjusting for clustering?
If m = 32, k = 4, and
= 0.017:

Note that despite the small value for
, the design effect came out to 1.527. This reduced our effective sample size to 84 compared with the 128 subjects actually enrolled in the trial, which explains why the power was only 61%.
If we change the numbers for m and k, we can show that the magnitude of the design effect is highly dependent on m, the number of patients in a cluster. Table 1
illustrates the changes in effective sample size and power for our example as we vary m and k but hold the product mk constant. Table 2
shows the effect of increasing m while holding k constant. Note the increasing design effect as m increases and its effect on the effective sample size; the investigator would have needed to recruit almost 80 patients per physician (320 total subjects) to adequately power his study with only 4 physicians. Table 3
shows that by increasing the number of physicians he enrolled in his study to 16, he would only have needed a total of 160 subjects to reach 80% power.
|
|
|
| SUMMARY |
|---|
|
|
|---|
, is a measure of relatedness of responses within a cluster. In human studies it is usually small, but in the design effect it is magnified by the number of elements in the cluster (m). The smaller the design effect, the larger the effective sample size. A high k (number of clusters) and a low m (number of elements within a cluster) give the smallest design effect. When designing studies, increasing clusters (k) will increase the studys power more than increasing the elements in the clusters (m). Standard formulas can be used to calculate sample sizes in clustered situations, but the resulting effective sample size (ESS) must then be adjusted using the design effect (DE) to find the total required sample size.
| FOOTNOTES |
|---|
Funding support: This work was supported by grant # 1 D14 HP 00041 from the Health Resources and Services Administration.
* For equal cluster size, a weighted average is needed to adjust this formula.1 ![]()
Received for publication August 18, 2003. Revision received December 29, 2003. Accepted for publication January 20, 2004.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H Bentzen, A Bergland, and L Forsen Risk of hip fractures in soft protected, hard protected, and unprotected falls Inj. Prev., October 1, 2008; 14(5): 306 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Brown, M. D. Sobsey, and D. Loomis Local Drinking Water Filters Reduce Diarrheal Disease in Cambodia: A Randomized, Controlled Trial of the Ceramic Water Purifier Am J Trop Med Hyg, September 1, 2008; 79(3): 394 - 400. [Abstract] [Full Text] [PDF] |
||||
![]() |
C Brown, T Hofer, A Johal, R Thomson, J Nicholl, B D Franklin, and R J Lilford An epistemology of patient safety research: a framework for study design and interpretation. Part 2. Study design Qual. Saf. Health Care, June 1, 2008; 17(3): 163 - 169. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. van Weel, E. van Weel-Baumgarten, and J. Mold The Importance of Longitudinal Studies in Family Medicine: Experiences of Two Practice-based Research Networks J Am Board Fam Med, January 1, 2006; 19(1): 69 - 74. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. Chen, G. E. Fryer Jr., and T. E. Norris Effects of Comorbidity and Clustering upon Referrals in Primary Care J Am Board Fam Med, November 1, 2005; 18(6): 449 - 452. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. C. Stange and W. L. Miller In This Issue: The Patient Voice, Clinical Research, Clustered Data, and the Wonca Research Conference Ann. Fam. Med, May 1, 2004; 2(3): 194 - 197. [Full Text] [PDF] |
||||
![]() |
S. J. Zyzanski, S. A. Flocke, and L. M. Dickinson On the Nature and Analysis of Clustered Data Ann. Fam. Med, May 1, 2004; 2(3): 199 - 200. [Full Text] [PDF] |
||||
Read all TRACK Comments
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |