Abstract
PURPOSE We wanted to determine whether the severity and duration of illness caused by the common cold are influenced by randomized assignment to open-label pills, compared with conventional double-blind allocation to active and placebo pills, compared with no pills at all.
METHODS We undertook a randomized controlled trial among a population with new-onset common cold. Study participants were allocated to 4 parallel groups: (1) those receiving no pills, (2) those blinded to placebo, (3) those blinded to echinacea, and (4) those given open-label echinacea. Primary outcomes were illness duration and area-under-the-curve global severity. Secondary outcomes included neutrophil count and interleukin 8 levels from nasal wash at intake and 2 days later.
RESULTS Of 719 randomized study participants, 2 were lost and 4 exited early. Participants were 64% female, 88% white, and aged 12 to 80 years. Mean illness duration for each group was 7.03 days for those in the no-pill group, 6.87 days for those blinded to placebo, 6.34 days for those blinded to echinacea, and 6.76 days for those in the open-label echinacea group. Mean global severity scores for the 4 groups were no pills, 286; blinded to placebo, 264; blinded to echinacea, 236; and open-label echinacea, 258. Between-group differences were not statistically significant. Comparing the no-pill with blinded to placebo groups, differences (95% confidence interval [CI]) were −0.16 days (95% CI, −0.90 to 0.58 days) for illness duration and −22 severity points (95% CI, −70 to 26 points) for global severity. Comparing the group blinded to echinacea with the open-label echinacea group, differences were 0.42 days (95% CI, −0.28 to 1.12 days) and 22 severity points (95% CI, −19 to 63 points). Median change in interleukin 8 concentration and neutrophil cell count, respectively by group, were 30 pg/mL and 1 cell for the no-pill group, 39 pg/mL and 1 cell for the group binded to placebo, 58 pg/mL and 2 cells for the group blinded to echinacea, and 70 pg/mL and 1 cell for the group with open-label echinacea, also not statistically significant. Among the 120 participants who at intake rated echinacea’s effectiveness as greater than 50 on a 100-point scale for which 100 is extremely effective, illness duration was 2.58 days shorter (95% CI, −4.47 to −0.68 days) in those blinded to placebo rather than no pill, and mean global severity score was 26% lower but not significantly different (−97.0, 95% CI, −249.8 to 55.8 points). In this subgroup, neither duration nor severity differed significantly between the group blinded to echinacea and the open-label echinacea group.
CONCLUSIONS Participants randomized to the no-pill group tended to have longer and more severe illnesses than those who received pills. For the subgroup who believed in echinacea and received pills, illnesses were substantively shorter and less severe, regardless of whether the pills contained echinacea. These findings support the general idea that beliefs and feelings about treatments may be important and perhaps should be taken into consideration when making medical decisions.
- Double-blind method
- clinical trial
- common cold
- expectancy
- placebo effect
- upper respiratory tract infection
INTRODUCTION
In 1955 Beecher, in his article “The Powerful Placebo,” looked at “15 studies…involving 1,082 patients.”1 Averaging results across these studies, Beecher famously claimed that “placebos are found to have an average significant effectiveness of 35.2% ± 2.2%.”1 Since then, placebo effects,2–7 expectancy,8–10 and context effects11,12 have been studied extensively and occasionally touted as therapeutic.13,14 The evidence of placebo effects is strongest for pain15–20 and depression,21–24 where multiple well-designed trials have compared placebo with no treatment. Although the magnitude and universality of placebo effects have been questioned,25–27 blinding (concealment, masking) of matched active and placebo control groups has remained a fundamental component of clinical trials. Even so, the question of whether the results of blinded trials could be generalized to clinical practice has received little scrutiny. Few trials have included both open-label and no-treatment groups. The underlying quandary is as follows: If blinding is needed because awareness of treatment can influence outcomes, then the results of blinded trials may not generalize to real-life situations, precisely because a person is aware of his or her treatment in real life but is intentionally kept unaware in blinded trials.28
The common cold provides an excellent but under-utilized opportunity to address these questions. In the United States, some 500 million noninfluenza viral respiratory infections cost society approximately $40 billion each year.29 High incidence of the common cold provides an accessible population for investigating placebo effects. Outcomes are measured in terms of symptoms, function, and impact on quality of life, all considered more responsive to placebo mechanisms than biomarkers of physiological function. Perhaps more importantly, there are no proven effective treatments for colds; hence, there are no serious ethical issues in terms of randomizing to placebo or no-treatment groups.
Evidence regarding placebo effects in the common cold is limited. Eccles has reported “a review of 8 clinical trials on the effects of antitussive medicines on cough associated with acute upper respiratory tract infection [showing] that 85% of the reduction in cough is related to treatment with placebo, and only 15% attributable to the active ingredient.”30 The only large common cold trial reporting placebo effects randomized 955 participants with new-onset common cold to 5 groups: no treatment, nasal saline (placebo), or 3 different doses of nasal ipratropium.31 Although the authors stressed results suggesting dose-dependent benefits of ipratropium, the difference between the placebo and no-treatment groups was greater than the difference between the placebo and any one of the active groups. In terms of nasal mucus weight, placebo beat no treatment by 27% (P <.01). Symptom severity rated on visual analogue scales was 40% lower for placebo than for no treatment (P <.001). Interpretation of results is limited in that the saline nasal spray used as the placebo was not blinded, and that nasal saline may have its own physiological benefits.32
A 2002 request for applications by National Center for Complementary and Alternative Medicine at the US National Institutes of Health regarding placebo effects prompted us to propose research aimed at investigating how simulated real-life conditions (not taking a pill vs taking a named known pill) compared with conventional double-blind randomized controlled trial conditions. We chose a design that would assess 2 kinds of pill-related placebo effects (no pill vs blinded to placebo, and blinded to vs open-label treatment) and would also provide evidence regarding potential effects of physician-patient interaction. Our review of the relevant literature,33 detailed methods,34 and results related to echinacea35 and physician-patient interaction36 are available elsewhere.
METHODS
This trial used a 2-way factorial design in which study participants with new-onset common cold were randomized in 2 directions: (1) pill-related groups, and (2) clinical interaction–related groups. Clinical interaction effects were investigated by randomization to (1) no physician visit, (2) a standard physician visit, or (3) an enhanced patient-oriented visit. A priori power estimates suggested that even if we found positive effects of both clinical interaction and pill-related interventions, we would not be able to draw firm conclusions regarding relative magnitude or possible interactions.
In the pill direction, each participant had an equal 25% chance of being randomized to 1 of 4 groups: no pill, blinded to placebo, blinded to echinacea, or open-label echinacea. Echinacea was selected as the active treatment because its popularity and name recognition provided reasonable potential of expectancy-related placebo effects. At the time the current study was designed, several published trials of echinacea had reported benefit,37–40 whereas one, our own,41 had not. The study design reported here provided the advantage of being able to test for specific effects of echinacea in addition to the main focus on effects related to placebo and expectancy. The specific a priori hypotheses relevant to the current report were that (1) participants assigned pills would have shorter illnesses duration and lower global severity scores than those not assigned pills, (2) those getting open-label echinacea would do better than those the blinded to echinacea, and (3) those who believed in the positive effects of echinacea would do better than those who did not if they were assigned pills, especially if the pills were open-label echinacea.
Inclusion Criteria
Participants older than 12 years were eligible; if they were younger than 18 years, parental permission was obtained. To be enrolled, participants had to answer yes to either “Do you think that you have a cold?” or “Do you think you are coming down with a cold?” At least 2 points on the Jackson scale were required, with Jackson’s 8 symptoms (sneezing, nasal discharge, nasal obstruction, sore throat, headache, malaise, chilliness, and cough) each rated as absent (0), mild (1), moderate (2), or severe (3).42 One of the first 4 Jackson’s cold-specific symptoms was required, and none could have begun more than 36 hours before enrollment. Timing of onset of first symptom was determined by a telephone screening interview, after which the participant was interviewed in person according to a structured format. Anyone with a history of allergic rhinitis or asthma was excluded if he or she reported itchy eyes, sneezing, cough, or shortness of breath. Cold remedies other than those specified by the study protocol were disallowed. Pregnant women and those with immune system disorders were excluded.
Recruitment, Enrollment, Monitoring
We recruited participants directly from the community, using newspaper advertising, direct mailings, promotional items, and e-mail to University of Wisconsin (UW) students, staff, and alumni. Participants were enrolled at 2 sites: a UW family medicine teaching clinic in Verona, Wisconsin, and at the UW Department of Family Medicine, Madison, Wisconsin. Participants were screened by telephone, then met in person for consent, enrollment, and randomization. Nasal wash samples were obtained at enrollment, and again 2 days later. Participants completed questionnaires twice daily until their colds resolved, to a maximum of 14 days. Once colds had resolved, an exit interview was completed, and questionnaire booklets were collected.
Randomization, Allocation, and Blinding
SAS 8 (SAS Institute, Cary, North Carolina) was used to generate a single block of 804 numbers so that each of 12 cells (3 clinician groups by 4 pill groups) was represented equally. An envelope-within-envelope technique was used, with envelopes prepared by the UW Hospitals Pharmaceutical Research Center. Directly after consent, the larger outer envelope was opened to reveal allocation to no pill (25%), blinded to a pill (50%), or open-label pill (25%). For the 66.6% of the participants randomized to a clinical visit, a second smaller envelope was opened by the clinician before entering the examination room. The randomized allocation key was not shared with investigators until after all data were collected, entered, and cleaned, and analysis strategies were determined. Blinding was tested at an exit interview by asking participants to which group they thought they had been assigned.
Interventions
Matching echinacea and placebo tablets were manufactured by MediHerb (MediHerb, Warwick, Australia). Echinacea tablets included root extracts of Echinacea purpurea and E angustifolia.35 Placebo and echinacea tablets measured 9 mm by 19 mm and were covered by reddish brown digestible coatings. Investigators and colleagues found it impossible to distinguish between the placebo and echinacea pills in terms of appearance, smell, or taste. Twenty-four tablets were dispensed in conventional plastic pill bottles. Instructions were to take 2 pills 4 times in the first 24 hours, then continue with 1 pill 4 times daily for the next 4 days.
Outcomes, Data Collection, and Monitoring
Illness duration and severity were selected as primary outcomes. Duration was defined as total time elapsed from enrollment until the last time answering yes to the question, “Do you think you still have a cold?” Times and dates were recorded, so duration was measured in terms of hours and minutes, and then converted to decimalized days.
Illness severity was rated twice daily on the validated 21-item Wisconsin Upper Respiratory Symptom Survey (WURSS-21).43–46 The first and last items are assessed separately, so the total score is a simple sum of 19 items. Ten items rate cold symptoms, and 9 evaluate quality-of-life functional impairment. Each item is rated on a conventional Likert scale, in which 0 = none, 1 = very mild, 3 = mild, 5 = moderate, and 7 = severe. Area-under-the-curve global severity was calculated by first averaging morning and evening WURSS scores, then applying trapezoidal approximations.47
Biomarkers of immune response and inflammation (interleukin 8 and neutrophil counts) were measured from nasal wash drawn at baseline (day 1) and again 2 days later (day 3). General health was assessed daily using both the 24-hour recall SF-8 Health Survey48 and the EuroQol’s Feeling Thermometer.49 The SF-8 yields separate physical and mental health scores.48 Stress was measured with Cohen’s Perceived Stress Scale-4 (PSS-4),50,51 self-reported at baseline, day 3, and exit.
Belief in the effects of echinacea (expectancy) was assessed at intake by asking, “Have you ever taken echinacea before?” Participants answering yes were then asked, “How effective do you think that echinacea is?” Participants scored responses on a 100-mm visual analogue scale ranging from 0 (totally ineffective) to 100 (extremely effective).
Potential side effects were assessed with open-ended questions during telephone monitoring and again at the exit interview when we asked participants whether they had experienced any of the following during their cold: bad taste, diarrhea, headache, nausea, rash, or stomach upset.
The protocol was approved and monitored by the Human Subjects Committee of the University of Wisconsin Institutional Review Board. A committee that monitored data and safety met annually to review enrollment, retention, and adverse effects.
Statistical Analysis
The target sample size of 720 participants who would finish the protocol was based on 80% power to detect 20% differences in severity-weighted days of illness (global severity) between allocation groups, assuming α = .05, proportionally stable standard deviation, and one-sided comparison. The decision to use one-sided testing was justified by the generally positive literature on placebo effects and was approved by multiple scientific reviewers before the trial began. Power estimates were informed by data collected during our previously published echinacea trial41 and by validation studies on the WURSS-21 instrument.43–46
The effects of assignment to placebo-related and clinician-related treatment groups were tested using analysis of variance (ANOVA), analysis of covariance, and multivariate regression. Assessment and response to missing data followed Schafer’s approach.52 The final analysis structure testing for treatment effects was a general linear model53,54 implemented using NCSS statistical software.55 Models included the following covariates, all specified before data collection began: duration of symptoms before enrollment, symptom severity at enrollment, age, sex, ethnicity, education, income, smoking status, general health, and assignment to clinician intervention groups.
RESULTS
We began enrolling participants in January 2004 and concluded in August 2008. Of 3,321 persons screened, 719 were enrolled and randomized. Figure 1⇓ shows major reasons for nonenrollment. Of the 719 enrolled, 2 were lost to follow-up, and 4 exited early. Duration and global severity could not be calculated for these 6 participants. Overall, there were few missing data (less than 3%); however, 33 participants either refused or failed to show up within time limits for the nasal wash visit on day 3, limiting sample size for interleukin 8 concentration determination and neutrophil count. Our sample was 64% female and 88% white, with 84% reporting at least some college education, and 36% reporting household incomes of $25,000 per year or less. Only 13% were current smokers. There were no substantive differences in the comparison groups for these baseline measures (Table 1⇓).
As predicted, the average duration (7.03 days) and global severity scores (286 points) were greatest among those assigned to no pills (Tables 2⇓ and 3⇓). Compared with the no-pill group, those blinded to placebo averaged colds that were slightly shorter (−0.16 days; 95% CI, −0.90 to 0.58 days) and less severe (−22 points; 95% CI, −70 to 26), differences that were not statistically significant. Comparing groups blinded to echinacea with those getting open-label echinacea, differences were 0.42 days (95% CI, −0.28 to 1.12 days) and 22 severity points (95% CI, −19 to 63 points), with trends favoring the group blinded to echinacea. Multivariate models adjusting for potential confounders yielded nonsignificant trends toward pill benefit, with 0.35, 0.62 and 0.26 days reduced duration for the 3 pill groups compared with the no-pill group. Because the global severity measure was skewed, we carried out Box Cox transformation, with or without multivariate adjustment; these analyses did not change results or conclusions.
Secondary outcomes are shown in Table 4⇓. Changes from enrollment to day 3 in neutrophil count and interleukin 8 concentration (pg/mL) were lowest in the no-pill group, and highest in the open-label echinacea group. Because interleukin 8 concentrations and neutrophil counts were highly skewed, median, geometric mean, and mean rank difference are shown instead of arithmetic mean. Figure 2⇓ shows Cohen’s d standardized effect size and 95% confidence intervals for global severity, illness duration, interleukin 8 concentration, and nasal neutrophil counts for the 3 pill groups compared with the no-pill group. Physical health, mental health, and stress were similar among the 4 groups. Four of the 6 assessed side effects were reported most frequently in the no-pill group. Headache was reported by 62% of those without pills compared with less than 50% in the other 3 groups. A χ2 test suggests this discrepancy is not due to chance (χ2=22.3; P <.001). Other between-group differences of outcomes shown in Table 4⇓ were not statistically significant.
For the 120 participants who at baseline rated echinacea’s effectiveness greater than 50 on the 100-mm expectancy scale, apparent placebo effects were more pronounced. Mean duration and global severity among the 32 participants randomly assigned no pills was 8.41 days and 374 points, compared with 5.83 days and 277 points for the participants blinded to placebo (n = 25), 7.10 days and 275 points for the participants blinded to echinacea (n = 33), and 7.07 days and 264 points for the participants in the open-label echinacea group (n = 30). Comparing participants with no pills with those who were blinded to placebo, illnesses were 2.58 days shorter (95% CI, −4.47 to −0.68 days), with mean global severity 97 points (26%) lower, but not statistically significant (−97.0 points; 95% CI, −249.8 to 55.8 points). For this subsample of those who believed in the benefits of echinacea, those assigned placebo appeared to do as well or better than those assigned echinacea, regardless of whether they were in the group blinded to placebo or the open-label group (Tables 2⇑ and 3⇑). Comparing duration and global severity in the no-pill group with the other 3 groups combined (2-sample t test) yielded P = .022 for duration and P = .055 for global severity. The overall F test for the one-way ANOVA with 4 groups was nonsignificant for severity (P = .30), but close to significant for duration (P = .06).
Adherence was assessed by asking participants, “Did you take all your pills as directed?” and by pill-counts in returned bottles. Of the 545 participants given pills, 518 (95%) said that they took them as directed. There were 524 bottles returned, of which 486 (93%) were empty and 38 (7%) had pills left. There were no significant differences in pill counts among those groups assigned to blinded to placebo, blinded to echinacea, and open-label echinacea.
Adequacy of blinding was tested at an exit interview by asking, “Do you believe that you were given echinacea or placebo?” Of the 363 participants blinded to which pills they were given, 141 (39%) answered correctly, 110 (30%) answered incorrectly, and 107 (29%) answered “don’t know” or “won’t guess.” Of 179 assigned placebo, 72 participants (40%) answered correctly compared with 69 (38%) who were blinded to the echinacea pills they received. These differences were not statistically significant.
DISCUSSION
Results of this trial do not support a powerful placebo effect in the common cold. The data, however, are consistent with modest placebo effects attributable to receiving pills, regardless of the content of those pills. Those who were randomly assigned pills reported illness duration that was on average 0.16 to 0.69 days shorter and 8% to 17% less severe than those assigned no pills. These observed trends were in hypothesized directions, but were not statistically significant. Contrary to our a priori hypotheses, those who were assigned open-label echinacea did not appear to fare better than those who were blinded to echinacea. Consistent with a priori hypotheses, trends toward apparent placebo benefit widened among those who rated echinacea’s effectiveness highly, with illnesses 1.31 to 2.58 days shorter and 26% to 29% less severe for those assigned pills compared with those assigned no pills. These differences were statistically significant for duration (P = .022), but not for global severity (P = .055). Among secondary outcomes, only headache seemed to be influenced by pill assignment, with 62% of those assigned no pills reporting a headache, compared with less than 50% for each of the groups assigned pills (P <.001). Because these analyses were secondary, caution should be used with interpretation.
Overall, this trial could be interpreted either as an appropriately powered trial that failed to conclusively show placebo effects, or as a trial suggesting small but perhaps meaningful effects related to expectation and pill-allocation. With respect to interpretation, at least 2 issues deserve consideration; both are crucial to understanding clinical trials, yet are often given inadequate attention.
The first concerns the interpretation of multiple outcomes. It is generally agreed that the statistical testing and interpretation of multiple outcomes should be undertaken with caution, as the chances of finding statistical significance increases with the number of tests done. What is not generally agreed upon is how to interpret multiple trends toward positive results when few or none of the individual tests reach significance. Some trialists have chosen to use composite endpoints, a choice that for theoretical and practical reasons is controversial.56,57 For composite endpoints to be meaningful, they should be theoretically justified and specified in advance. Trials that fail to achieve statistical significance in the primary outcome but that display consistent patterns among primary and secondary outcomes are considered by some experts to be more convincing than trials that achieve statistical significance in one measure but no overall pattern.58,59 For the results displayed here, we suspect that patterns reflect real underlying causal associations, and that the trial was underpowered. Whether these possible causal associations are due to placebo mechanisms alone or whether echinacea may be contributing pharmacologically cannot be determined with the data at hand.
The second issue relates to the question of how much of a difference matters. Addressing this question, a minimal clinically important difference (MCID) was defined in 1989 to be “the smallest difference in score in the domain of interest which patients perceive as beneficial and which would mandate, in the absence of troubling side effects and excessive cost, a change in the patient’s management.”60 In 2005 we proposed the alternative concept of sufficiently important difference (SID), with SID defined as “the smallest amount of patient-valued benefit that an intervention would require in order to justify associated costs, risks, and other harms.”61 Using benefit-harm trade-off structured interview methods to estimate SID for the common cold, we found that most persons would want illness duration to be reduced by at least 24 hours,62 or overall severity to be reduced by at least 30%,63 to justify the inconvenience, cost, and possible side effects of conventional cold treatments. But average MCID or SID is only part of the story, as health values vary tremendously across patients.61–63 For low-risk interventions, such as vitamins or herbal medicine, our data suggest that more than one-quarter of the population would consider worthwhile an intervention that would reduce overall severity by 20%,63 or the length of a cold by 12 hours,62 degrees of benefit that are well within the plausible benefit range suggested by the data shown here.
One additional issue deserves mention. In 2002, when we designed this trial, nearly all published echinacea trials had reported benefit,37–40 and societal expectations were high. Then came 4 highly publicized negative echinacea trials, our own in Annals of Internal Medicine (2002), followed by the Taylor et al trial in the Journal of the American Medical Association (2003),64 Yale and Liu’s in Archives of Internal Medicine (2004),65 and the Turner et al trial in the New England Journal of Medicine (2005).66 It is possible that the societal message from these negative trials reduced echinacea-related expectancy and therefore contributed to the modest effects seen here. It is also quite possible that random assignment to no-pill or known-pill conditions poorly simulates real life, where a person is not only aware of a treatment but has actively participated in the decision to take it. The data shown here suggest that those who believe in echinacea’s effectiveness may gain the most benefit from being randomly assigned to echinacea (or a pill that might be echinacea). We suspect that larger placebo effects are derived when a person chooses his or her own treatment rather than have it randomly assigned. Looking at placebo effects across a wide range of expectancies may dilute effects in participants with positive expectancy by averaging them with results of participants with neutral or even negative expectancies.
Most experts agree that placebo effects2–7 derive not from sham interventions per se, but from the beliefs, expectancies, and health-related values8–10 that are embedded in a social, cultural, and clinical context.11,12 As such, placebo effects may change with time. Perhaps the best evidence in this regard relates to depression, where most of the benefits attributable to antidepressants appear to be placebo effects,21–23 and where placebo response is “variable, substantial, and growing” over time.24 This makes sense. When antidepressants were first developed, there was a great deal of skepticism about whether pharmacologically active drugs could influence depression, long considered a problem of mental, not physical, health. As acceptance of antidepressants grew, so did the societal expectation that fuels placebo response in depression. In our case, the reverse may be true, with expectation regarding echinacea lower now than a decade ago. Speculations aside, we interpret the results reported here as suggestive of placebo effects in common cold, especially among those who believe in a therapy, but conclude that these effects are not large; instead, they are limited and nuanced.
Acknowledgments
The Australian firm MediHerb provided the echinacea and placebo tablets used in this trial, but did not provide monetary funding, and did not review or approve this paper. Leadership at St Mary’s Hospital allowed us to use their employee health clinic room for examinations and nasal wash collection. Graduate student Tola Ewers contributed to statistical analysis and table construction. The University of Wisconsin Department of Family Medicine provided invaluable institutional support. Terry Little assisted with formatting and submission. Finally, our greatest thanks go to the many research participants who volunteered their attention, energy, and good will during a time of illness.
Footnotes
-
Conflicts of interest: authors report none.
-
Funding support: The National Center for Complementary and Alternative Medicine (NCCAM) at the National Institutes of Health was primary sponsor of this research project (1-R01-AT-1428). During development of the proposal, NCCAM supported Dr Barrett with a patient-oriented career development grant (K23 AT00051). During conduct of this trial, Dr Barrett was supported by the Robert Wood Johnson Foundation Generalist Physician Faculty Scholars Program. When funds ran low before the trial had finished, leadership at the University of Wisconsin School of Medicine and Public Health provided resources to complete the project.
- Received for publication March 24, 2010.
- Revision received December 20, 2010.
- Accepted for publication January 18, 2011.
- © 2011 Annals of Family Medicine, Inc.