Annals of Family Medicine
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Annals of Family Medicine 2:204-208 (2004)
© 2004 Annals of Family Medicine, Inc.
doi: 10.1370/afm.141

This Article
Right arrow Abstract
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Supplemental data: Appendix
Right arrow In Brief
Right arrow TRACK Discussion: Submit a Comment
Right arrow TRACK Discussion: View Comments
Right arrow Alert me when this article is cited
Right arrow Alert me when TRACK Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Killip, S.
Right arrow Articles by Pearce, K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Killip, S.
Right arrow Articles by Pearce, K.

What Is an Intracluster Correlation Coefficient? Crucial Concepts for Primary Care Researchers

Shersten Killip, MD, MPH1, Ziyad Mahfoud, PhD2 and Kevin Pearce, MD, MPH1

1 Department of Family Practice and Community Medicine, University of Kentucky, Lexington, Ky
2 Department of Statistics, University of Kentucky, Lexington, Ky

CORRESPONDING AUTHOR: Shersten Killip, MD, MPH, K-302 Kentucky Clinic 0284, 740 S. Limestone, Lexington, KY 40536-0284, skill2{at}email.uky.edu


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
BACKGROUND Primary care research often involves clustered samples in which subjects are randomized at a group level but analyzed at an individual level. Analyses that do not take this clustering into account may report significance where none exists. This article explores the causes, consequences, and implications of cluster data.

METHODS Using a case study with accompanying equations, we show that clustered samples are not as statistically efficient as simple random samples.

RESULTS Similarity among subjects within preexisting groups or clusters reduces the variability of responses in a clustered sample, which erodes the power to detect true differences between study arms. This similarity is expressed by the intracluster correlation coefficient, or {rho} (rho), which compares the within-group variance with the between-group variance. Rho is used in equations along with the cluster size and the number of clusters to calculate the effective sample size (ESS) in a clustered design. The ESS should be used to calculate power in the design phase of a clustered study. Appropriate accounting for similarities among subjects in a cluster almost always results in a net loss of power, requiring increased total subject recruitment. Increasing the number of clusters enhances power more efficiently than does increasing the number of subjects within a cluster.

CONCLUSIONS Primary care research frequently uses clustered designs, whether consciously or unconsciously. Researchers must recognize and understand the implications of clusters to avoid costly sample size errors.

Key Words: Statistics • cluster analysis • data interpretation, research design • primary care • practice-based research • methods/quantitative • theory


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
Clustered samples are not as statistically efficient as simple random samples. Similarities among subjects in clusters can reduce the variability of responses from a cluster compared with those expected from a simple random sample. If statistics meant for simple random samples are used to design and analyze clustered studies, they will result in overestimation of the effective sample size. This issue is important for primary care research, because the design of many primary care research studies creates clusters.

This article will use a case study to introduce the concepts involved in cluster sampling. It is intended as an introduction to the concepts and language of cluster sampling; researchers are encouraged to consult a statistician familiar with cluster sampling to help in the design and analysis phases of clustered studies. The goal is to raise the awareness of cluster sampling issues among primary care researchers and to help primary care researchers design and publish statistically rigorous findings.


    DEFINITION AND EXPLANATION OF CLUSTERED DESIGNS
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
Case Study
A clinical trial was designed to evaluate the impact of physician advice on condom use. The outcome was patient-reported use of condoms 6 months after randomization to a control group or a counseling group. The investigator avoided contamination by randomizing the physicians to be control physicians or counseling physicians (randomizing at the physician level) but wanted to analyze data at the patient level. Four physicians in solo practice were recruited; 2 to counsel, and 2 to be controls. A sample-size calculation was done, which suggested that for a 2-sample t test, a minimum effect size of 0.5, and a significance level of .05, 128 patients total would be necessary to achieve a power of 80%. Each physician was therefore asked to recruit 32 patients. When the paper was submitted to a journal, the paper was rejected for "erroneous statistics" and "inadequate power: 61%."

What happened? Most statistical methodologies were designed to analyze data that is both selected and analyzed on the same level. Clustered data result when some preexisting group structure is used to select study participants, but the researcher is interested in the individual level data. Clustered designs can be used for many reasons, but they always cause some loss of statistical efficiency as a result of the "relatedness" within the preexisting groups. Primary care research, which often studies patients from multiple private practices, can produce clustered results by selecting groups of patients at the practice (or practitioner) level, then analyzing the data at the individual patient level, as in our case study.

Why does clustering erode statistical power? Consider the nature of preexisting groups. Most groups form because of some kind of selection factors. Among patients who all see the same primary care physician, there can be many similarities that may include geographic, socioeconomic, racial, ethnic, sexual, religious, political, or age-related similarities, stemming from the propensity of patients to choose a physician with whom they identify. All of these factors can have some impact on the average response of one physician’s patients compared with another’s.

The responses of persons selected by any or all of the factors mentioned above tend to be more similar to one another’s than the responses of a group of individuals selected truly at random. Because these responses are similar, they lead to a decrease in the variation among responses of persons in the same cluster, or the variance of the within-cluster responses. This similarity among responses within a group can magnify the apparent differences in outcomes or responses between groups, and they must be taken into account. Adjustment for clustering thus results in a reduction of the effective sample size.

In the case study, solo practitioners were chosen at the physician level to keep the discussion simple. Figure 1Go illustrates the design of the study. Physicians who choose to work together, however, share similarities just as their patients do, and these similarities must be taken into account. Using physicians who worked together would have introduced a third level, the practice level, to our study. This concept is illustrated in Figure 2Go. Multilevel clustering is termed nesting, and there are specific statistics to deal with that mathematical situation. Clustering is a specific term for the simplest type of nesting, using only 2 levels of data, as shown in Figure 1Go.



View larger version (19K):
[in this window]
[in a new window]
 
Figure 1. Two-level nesting, or clustering.

 


View larger version (24K):
[in this window]
[in a new window]
 
Figure 2. Three-level nesting.

 

    ADVANTAGES AND DISADVANTAGES OF CLUSTERED DESIGNS
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
While the loss of statistical efficiency and the need to recruit more study participants are clear disadvantages of clustered studies, there are some advantages to clustering. Clustering is often used for practical reasons when a simple random sample would be unrealistic. For example, a random survey of all patients in a given area would be extremely difficult. A clustered survey of randomly chosen patients within primary care practices is much more practical.

Clustering is the design of choice to avoid a phenomenon known as contamination. In our example, asking the same physician to present 2 entirely different counseling scripts to his or her patients is impractical; the physician will likely get confused. Also, patients of the same physician may be acquainted. Some patients could pass on a version of the counseling to other patients. As a result, the loss of efficiency from clustering is necessary to preserve the integrity of the intervention.


    THE INTRACLUSTER CORRELATION COEFFICIENT, OR {rho}
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
The intracluster correlation coefficient (ICC) ,or {rho} (the Greek rho), is a measure of the relatedness of clustered data. It accounts for the relatedness of clustered data by comparing the variance within clusters with the variance between clusters. Mathematically, it is the between-cluster variability divided by the sum of the within-cluster and between-cluster variabilities.

Equation 1: *


where sb2 = the variance between clusters, and sw2 = the variance within clusters.

Values of {rho} range from 0 to 1 in human studies. From equation 1, as the within-cluster variance (sw2) moves toward 0, {rho} gets closer and closer to 1. In the theoretical case where {rho} = 1, all responses within a cluster are identical. In that case the effective sample size is reduced to the number of clusters.

A very small value for {rho} implies that the within-cluster variance is much greater than the between-cluster variance, and a {rho} of 0 shows that there is no correlation of responses within a cluster. Usually, values of r are between 0.01 and 0.02 in human studies.2–4 The calculation of {rho} usually requires a pilot study. We encourage all investigators to publish their {rho} values, which will (eventually) aid in being able to estimate {rho} for a given type of population.


    EFFECTIVE SAMPLE SIZE AND THE DESIGN EFFECT
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
In accounting for the similarities among clustered subjects, there is a net loss of independent data. The effective sample size is the term used to describe the sample size in clustered samples compared with the number of subjects actually enrolled in the study. For example, if you have 4 physicians’ offices (from the case study above) enrolling 32 patients each, you have 128 subjects in your study. Depending on the intracluster correlation coefficient and the design effect, however, you may effectively have far fewer subjects enrolled in your trial from a statistical perspective.

To get the effective sample size, the total sample size (the number of patients per cluster times the number of clusters) is divided by a correction factor that includes {rho} and the sample size per cluster (m). This correction factor is called the design effect. In the case study above, we created the special case of clustered data with all groups having the same number of subjects (each physician recruited 32 patients). In this special case:

Equation 2:


and Equation 3:


where m = number of subjects in a cluster, k = number of clusters, mk = total number of subjects in a clustered study, ESS = effective sample size, DE = design effect, and {rho} = intracluster correlation coefficient (see equation 1).

If {rho} = 0, then the design effect = 1, and the sample size is unaffected. If {rho} > 0, even if it is still very small, the design effect may be magnified by a large cluster size (m). This would then reduce the effective sample size of the study (see equation 2). If {rho} = 1, the design effect (equation 2) is 1, and the effective sample size therefore reduces to k, the number of clusters.

These equations can be reversed in the planning phase to calculate correctly the total sample size needed for a clustered study. All power calculations and resultant sample size estimates can be calculated initially using usual formulas for a clustered study, which will give researchers the effective sample size. Equation 2 can be used to find mk, or the total required sample size, given the effective sample size and design effect


    THE EFFECT OF {rho} AND THE DESIGN EFFECT ON POWER AND SAMPLE SIZE CALCULATIONS
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
To illustrate the effect of {rho} and the design effect on sample size and power, we will do a sample calculation. Using our case study, we have 4 physicians recruiting 32 patients each. Let us say that {rho} = 0.017 in this case. What is the effective sample size after adjusting for clustering?

If m = 32, k = 4, and {rho} = 0.017:


Note that despite the small value for {rho}, the design effect came out to 1.527. This reduced our effective sample size to 84 compared with the 128 subjects actually enrolled in the trial, which explains why the power was only 61%.

If we change the numbers for m and k, we can show that the magnitude of the design effect is highly dependent on m, the number of patients in a cluster. Table 1Go illustrates the changes in effective sample size and power for our example as we vary m and k but hold the product mk constant. Table 2Go shows the effect of increasing m while holding k constant. Note the increasing design effect as m increases and its effect on the effective sample size; the investigator would have needed to recruit almost 80 patients per physician (320 total subjects) to adequately power his study with only 4 physicians. Table 3Go shows that by increasing the number of physicians he enrolled in his study to 16, he would only have needed a total of 160 subjects to reach 80% power.


View this table:
[in this window]
[in a new window]
 
Table 1. Effective Sample Size and Power Holding mk Constant
 

View this table:
[in this window]
[in a new window]
 
Table 2. Effective Sample Size and Power Holding k Constant
 

View this table:
[in this window]
[in a new window]
 
Table 3. Effective Sample Size and Power Holding m Constant
 

    SUMMARY
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 
The intracluster correlation coefficient, or {rho}, is a measure of relatedness of responses within a cluster. In human studies it is usually small, but in the design effect it is magnified by the number of elements in the cluster (m). The smaller the design effect, the larger the effective sample size. A high k (number of clusters) and a low m (number of elements within a cluster) give the smallest design effect. When designing studies, increasing clusters (k) will increase the study’s power more than increasing the elements in the clusters (m). Standard formulas can be used to calculate sample sizes in clustered situations, but the resulting effective sample size (ESS) must then be adjusted using the design effect (DE) to find the total required sample size.


    FOOTNOTES
 
Conflict of interest: none reported

Funding support: This work was supported by grant # 1 D14 HP 00041 from the Health Resources and Services Administration.

* For equal cluster size, a weighted average is needed to adjust this formula.1 Back

Received for publication August 18, 2003. Revision received December 29, 2003. Accepted for publication January 20, 2004.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 DEFINITION AND EXPLANATION OF...
 ADVANTAGES AND DISADVANTAGES OF...
 THE INTRACLUSTER CORRELATION...
 EFFECTIVE SAMPLE SIZE AND...
 THE EFFECT OF {rho}...
 SUMMARY
 REFERENCES
 For Further Reading:
 

  1. Donner A, Klar N. Design and Analysis of Cluster Randomization Trials in Health Research. American ed. New York, NY: Oxford University Press; 2000:9,112–113.
  2. Murray DM, Rooney BL, Hannan PJ, et al. Intraclass correlation among common measures of adolescent smoking. Am J Epidemiol. 1992;140:1038–1050.
  3. Murray DM, Short BJ. Intraclass correlation among measures related to alcohol use by young adults. J Studies Alcohol. 1995;56: 681–694.[Medline]
  4. Murray DM, Short BJ. Intraclass correlation among measures related to alcohol use by adolescents Add Behav. 1997;22:1–12.



This article has been cited by other articles:


Home page
Inj. Prev.Home page
H Bentzen, A Bergland, and L Forsen
Risk of hip fractures in soft protected, hard protected, and unprotected falls
Inj. Prev., October 1, 2008; 14(5): 306 - 310.
[Abstract] [Full Text] [PDF]


Home page
Am J Trop Med HygHome page
J. Brown, M. D. Sobsey, and D. Loomis
Local Drinking Water Filters Reduce Diarrheal Disease in Cambodia: A Randomized, Controlled Trial of the Ceramic Water Purifier
Am J Trop Med Hyg, September 1, 2008; 79(3): 394 - 400.
[Abstract] [Full Text] [PDF]


Home page
Qual Saf Health CareHome page
C Brown, T Hofer, A Johal, R Thomson, J Nicholl, B D Franklin, and R J Lilford
An epistemology of patient safety research: a framework for study design and interpretation. Part 2. Study design
Qual. Saf. Health Care, June 1, 2008; 17(3): 163 - 169.
[Abstract] [Full Text] [PDF]


Home page
J Am Board Fam MedHome page
C. van Weel, E. van Weel-Baumgarten, and J. Mold
The Importance of Longitudinal Studies in Family Medicine: Experiences of Two Practice-based Research Networks
J Am Board Fam Med, January 1, 2006; 19(1): 69 - 74.
[Abstract] [Full Text] [PDF]


Home page
J Am Board Fam MedHome page
F. M. Chen, G. E. Fryer Jr., and T. E. Norris
Effects of Comorbidity and Clustering upon Referrals in Primary Care
J Am Board Fam Med, November 1, 2005; 18(6): 449 - 452.
[Abstract] [Full Text] [PDF]


Home page
Ann Fam MedHome page
K. C. Stange and W. L. Miller
In This Issue: The Patient Voice, Clinical Research, Clustered Data, and the Wonca Research Conference
Ann. Fam. Med, May 1, 2004; 2(3): 194 - 197.
[Full Text] [PDF]


Home page
Ann Fam MedHome page
S. J. Zyzanski, S. A. Flocke, and L. M. Dickinson
On the Nature and Analysis of Clustered Data
Ann. Fam. Med, May 1, 2004; 2(3): 199 - 200.
[Full Text] [PDF]

TRACK Comments:

Read all TRACK Comments

GRTs: a Timely and Useful Tool in Family Practice Research
Richard J Kryscio
Annals of Family Medicine, 27 May 2004 [Full text]
More on Clusters
James J. Diamond
Annals of Family Medicine, 3 Sep 2004 [Full text]

This Article
Right arrow Abstract
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Supplemental data: Appendix
Right arrow In Brief
Right arrow TRACK Discussion: Submit a Comment
Right arrow TRACK Discussion: View Comments
Right arrow Alert me when this article is cited
Right arrow Alert me when TRACK Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Killip, S.
Right arrow Articles by Pearce, K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Killip, S.
Right arrow Articles by Pearce, K.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS