Abstract
PURPOSE Indicators of prognosis should be considered to fully inform clinical decision making in the treatment of depression. This study examines whether self-rated health predicts long-term depression outcomes in primary care.
METHODS Our analysis was based on the first 5 years of a prospective 10-year cohort study underway since January 2005 conducted in 30 randomly selected Australian primary care practices. Participants were 789 adult patients with a history of depressive symptoms. Main outcome measures include risks, risk differences, and risk ratios of major depressive syndrome (MDS) on the Patient Health Questionnaire.
RESULTS Retention rates during the 5 years were 660 (84%), 586 (74%), 560 (71%), 533 (68%), and 517 (66%). At baseline, MDS was present in 27% (95% CI, 23%–30%). Cross-sectional analysis of baseline data showed participants reporting poor or fair self-rated health had greater odds of chronic illness, MDS, and lower socioeconomic status than those reporting good to excellent self-rated health. For participants rating their health as poor to fair compared with those rating it good to excellent, risk ratios of MDS were 2.10 (95% CI, 1.60–2.76), 2.38 (95% CI, 1.77–3.20), 2.22 (95% CI, 1.70–2.89), 1.73 (95% CI, 1.30–2.28), and 2.15 (95% CI, 1.59–2.90) at 1, 2, 3, 4, and 5 years, after accounting for missing data using multiple imputation. After adjusting for age, sex, multimorbidity, and depression status and severity, self-rated health remained a predictor of MDS up to 5 years.
CONCLUSIONS Self-rated health offers family physicians an efficient and simple way to identify patients at risk of poor long-term depression outcomes and to inform therapeutic decision making.
INTRODUCTION
Depression is the largest contributor to the burden of disease in high-income countries, and further increases are expected.1,2 Of those who seek professional help for depression, 80% turn to a primary care physician.3 The high prevalence of depressive symptoms in primary care patients highlights the crucial role of primary care in the detection and treatment of depression.4,5 In clinical practice understanding the prognosis for an individual is central to fully inform treatment decisions,6 yet prognosis has received limited attention compared with therapeutic and etiological research.6,7 Validated prognostic indices provide estimates of the probability of an outcome for those with a particular condition7 and offer improved accuracy in prognostic assumptions that influence clinical decisions.8
The accuracy of family physicians in predicting the course of depression is described as fair,9 although as in other conditions it has been studied relatively little. Useful predictors of poor depression outcome in primary care have been identified in 3 main areas: individual characteristics (age,10 high neuroticism levels,11 perceived need for care,12 low mastery13); socioeconomic status (low education level,12 low income,12 unemployment12); loneliness,12 low social support,12,14 limitations of social activities due to health14; and health, both physical (medical comorbidity,11,12,14,15 chronic pain,11 daily smoking13) and mental (comorbid anxiety,11,12,15 double depression,11 severity of depressive symptoms,11,12,14,16 somatic symptoms of depression,16 history of recurrent depression,15 remission status at 3 months,15 appropriate use of antidepressant medications,14 greater mental dysfunction16). Reliable assessment of the predictors listed above requires time and a multitude of validated instruments. In addition, physicians tend to prefer their clinical judgment when deciding about treatment.17 Many are ambivalent about questionnaire use17,18 and may disregard such scores.19 Considering the everyday caseload of physicians, methods that take more than a few minutes have low acceptability in clinical practice,20–22 highlighting the need for practical, simple, reliable, and easily interpretable predictive tools23 that translate smoothly into daily clinical practice.24
Self-rated health may be a candidate for a simple and transportable prognostic index. It provides a single-question assessment of one’s subjective health, encompassing physical functioning, as well as psychosocial aspects of health.25,26 Self-rated health has been used as a reliable, quick assessment for population health monitoring.27–29 Two large population studies and one large international multisite study in primary care found that self-rated health is associated with depression.4,30,31
Although there is sound literature on the association between depression and self-rated health, we lack prognostic studies investigating the link between it and depression outcomes over time. Two studies suggest that self-rated health may be an important predictor of poor depression outcome.32,33 An association between worse self-rated health and poor depression outcome 6 years later was noted in a study of 585 people with major depression drawn from a population-based longitudinal health survey.32 Self-rated health was not the main focus of the study and was not included in the final prognostic model. In the longitudinal US Cardiovascular Health Study of 5,888 Medicare recipients, self-rated health discriminated older adults most at risk for becoming or remaining depressed within a year.33 Nonetheless, it is unknown whether self-rated health can predict risk of long-term depression outcomes in primary care. We hypothesized, a priori, that people who rated their health as poor or fair at baseline would be at greater risk of poor depression outcomes over a 5-year follow-up.
METHODS
Participants
The participants were part of the Diagnosis, Management and Outcomes of Depression in Primary Care (diamond) study, an ongoing (2005–2016) prospective, longitudinal cohort study of patients with depressive symptoms based on the Centre for Epidemiologic Studies Depression Scale (CES-D ≥16)34 from 30 randomly selected Australian general practices. Study design, methods, and sample size calculations have been reported earlier.35 Participants completed annually a wide range of validated and study-specific measures that included demographic and socioeconomic characteristics, chronic conditions, major depression, and self-rated health.
Measures
Patient Health Questionnaire
The 9-item Patient Health Questionnaire (PHQ-936) is a widely used self-report measure of depression that is brief, easy to administer, and well validated.37 It consists of 9 criteria assessed over the last 2 weeks on which the diagnosis of the Diagnostic and Statistical Manual of Mental Disorders (Fourth Edition) (DSM-IV) depressive disorders is based. The PHQ-9 can establish provisional depressive disorder diagnoses or assess depression severity.38 The outcome was current major depressive syndrome (MDS) based on the PHQ-9 findings. Participants were classified as those experiencing MDS if they answered that 5 or more of the 9 depressive symptom criteria had been present at least “more than one-half the days” in the past 2 weeks and 1 of the symptoms was depressed mood or anhedonia, and those who were not.38 Depression severity was a continuous measure (range = 0 to 27) calculated by summing PHQ-9 items.
Self-Rated Health
Health status during the preceding month was assessed using the general health question of the Short-Form 12 Health Questionnaire.39 Participants rated their health by answering the question: “In general, would you say your health is excellent, very good, good, fair, or poor.” These responses were dichotomized into poor to fair (self-rating of fair or poor) and good to excellent (self-rating of good, very good, or excellent). This assessment has shown to be consistent with previously validated single-item measures of general subjective health, which have shown good reliability and validity.40
Statistical Analysis
Data analyses were conducted using Stata 12 (Stata-Corp LP). Data were summarized using counts and percentages. Statistical analyses were undertaken in 2 stages. The first stage sought to understand the components of self-rated health (cross-sectional analyses). Marginal logistic regression was used to examine the association between self-rated health and participant characteristics in the diamond screening sample (N = 7,667)41 and the diamond baseline sample (N = 789),35 and results were reported as odds ratios with 95 percent confidence intervals and P values. Generalized estimating equations with robust standard errors were used to allow for the clustering of patients in practices.42 Using backwards elimination stepwise logistic regression, patient characteristics were selected to build the multivariable model if they exhibited a moderate level of association (P <. 1) with self-rated health. For participant characteristics highly correlated with each other (eg, long-term illness and multimorbidity), only 1 of the characteristics was included in the multivariable model.23
The second stage of analysis (longitudinal analyses) examined whether poor to fair self-rated health was predictive of risk of major depressive syndrome (outcome) at 1, 2, 3, 4, and 5 years (time) from enrolment in the diamond study (baseline), addressing the primary aim of the article. Generalized linear models using generalized estimating equations specifying the identity and log link functions were used to estimate risk differences and risk ratios, respectively. Robust standard errors were used to adjust for the correlated outcomes of individuals within practices and repeated outcomes measured within individuals. The outcome measured at each year of follow-up was arranged as a single variable, and the time variable identified the year. The model included self-rated health (poor to fair vs good to excellent) and time (treated as a categorical variable) as fixed effects, and it fitted an interaction between self-rated health and time to obtain separate estimates of the parameter at each follow-up year.
Multivariable regression was used to adjust the risk ratio for self-rated health for baseline depression, fitted as either a binary variable or continuous variable, and for potential confounders (sex, age, and multimorbidity).
Prognostic performance was assessed in terms of calibration (difference between predicted and observed rates of MDS8) and discrimination (C stastistics43). Calibration refers to how closely the predicted risk of MDS agrees with the observed risk of MDS at each year of follow-up. Discrimination is the ability of a model to differentiate between those participants who do and those who do not experience MDS during the follow-up.
Multiple imputation was used to handle missing data.44 Given that the largest percentage of incomplete cases was 46% at 5 years, 50 data sets were imputed45 using the “mi impute mvn” command in Stata 12. The imputation model included all the variables of the multivariable model described above and auxiliary variables correlated with missing responses at follow-up times, which included measures of socioeconomic status, abuse, and mental and physical health. Continuous variables included in the imputation model were checked for deviations from the normal distribution and skewed variable, and they were transformed to a symmetric distribution. The imputed outcomes were dichotomized using adaptive rounding.46 The estimates of the parameters for each imputed data set were combined using Rubin’s rules.47 Results of the longitudinal analyses are reported as risks of MDS, risk differences, and risk ratios for poor to fair compared with good to excellent self-rated health at 1, 2, 3, 4, and 5 years. Estimates of the risk differences and risk ratios are reported with corresponding 95% confidence intervals and P values.
Ethics approval was granted by the University of Melbourne’s Human Research Ethics Committee (ID 030613X).
RESULTS
Seven hundred eighty-nine participants who scored 16 or more on the CES-D at screening were recruited into the study. Retention rates ranged from 84% at 1 year to 66% at 5 years (Figure 1). Complete baseline data (N = 789) were available for self-rated health, CES-D, PHQ-9, multimorbidity, medication, and demographic variables, such as sex, age, and physician location. Less than 2% of data was missing on other variables at baseline, except for first episode of depression, which had 9% (74) of missing values.
At baseline, 41% (95% CI, 36%–45%) of participants rated their health as poor to fair, and MDS was present in 27% (95% CI, 23%–30%). For 211 participants with MDS, 130 participants (62%, 95% CI, 54%–69%) rated their health as poor to fair, compared with 192 of 578 participants with no MDS (33%, 95% CI, 29%–38%). Table 1 shows a comparison of the participants’ characteristics by self-rated health at baseline. Poor to fair self-rated health was strongly associated with MDS, long-term illness that limited daily activity, and lower socioeconomic status at baseline. The association between participants’ characteristics and self-rated health was similar when examined in the 7,667 participants who took part in the screening phase of the diamond study (details available from the authors).
Tables 2 and 3 report the risks, risk differences, and risk ratios of MDS for poor to fair self-rated health compared with good to excellent at each year of follow-up for the complete case and multiple imputation analyses. Prevalence of MDS ranged from 22% to 27% across the 5 years of follow-up using the multiply imputed data. Across the 5 years, risk of MDS ranged between 15% and 19% for participants reporting good to excellent health and 34% and 39% for those with poor to fair health. In comparison, the risks of MDS were lower when estimated using the complete cases. The difference between the estimated risks for the complete cases and the imputed data indicates that multiple imputation corrects for a downward bias in the complete-cases estimates, assuming the data are missing at random.
Self-rated health was a strong and consistent predictor of risk of MDS during the 5 years of follow-up. Generally, there was a twofold increase in the risk of MDS for participants rating their health as poor to fair compared with those rating it as good to excellent. Risk differences and risk ratios estimated for the complete cases were more variable over time and estimated with slightly lower precision than those estimated using multiple imputation.
When adjusted for baseline MDS (Table 2), the risk ratios for self-rated health were attenuated, reflecting the positive correlation between self-rated health and MDS at baseline. These risk ratios remained relatively unchanged after adjusting for multimorbidity, age and sex (Supplemental Table). Predicted risks for MDS were lowest for participants rating their health as good to excellent and no MDS at baseline and highest for those with MDS rating their health as poor to fair (Figure 2). Poor to fair self-rated health predicted MDS even when adjusting for depression severity at baseline (Supplemental Table).
When self-rated health was fitted as a single predictor, discrimination was 0.68 (95% CI, 0.65–0.70) for the complete cases and 0.64 (95% CI, 0.61–0.66) for the combined multiply imputed data. Differences between observed and estimated risks of MDS were 1% or less. Discrimination was 0.74 (95% CI, 0.72–0.77) after adjusting for MDS at baseline; 0.75 (95% CI, 0.73–0.78) after adjusting for MDS at baseline, multimorbidity, age, and sex; and 0.80 (95% CI, 0.78–0.82) after adjusting for depression severity (complete case analyses).
DISCUSSION
This study found that self-rated health predicts the risk of future depression in a primary care cohort with depressive symptoms. Compared with those rating their health as good to excellent, those rating their health as poor to fair had a twofold greater risk of MDS up to 5 years later. The association between self-rated health and future depressive status remained strong even after adjusting for age, sex, multimorbidity, and baseline depression status or severity.
These findings build upon a previous study that noted an association between lower levels of self-rated health and future depression in a population-based cohort32 and a study of older adults in which self-rated health was found to identify those most at risk of depression.33 Participants rating their health as good to excellent had a risk of future depression similar to that of the general primary care patient population,4,48 whereas those rating their health as poor to fair had a risk of future depression similar to that of the population seen in subspecialty clinics, such as oncology and cardiology.49 Rating one’s health as poor or fair is a subjective summary measure of health that incorporates disability, depression, and financial difficulties consistent with the findings of Mavadatt et al.26 Poor to fair self-rated health may discriminate a subgroup of patients with complex and difficult-to-treat forms of depression complicated by physical ill health and social disadvantage.
Few prognostic tools exist to assist physicians in determining those at increased risk of persistent or recurring depression. One example is the Diagnostic Prognostic Index, developed and validated (in a sample of 1,471 primary care patients with major depression) to predict risk of depression at 6 months.14 The Diagnostic Prognostic Index is composed of several items—number of depression symptoms lasting 2 years or more, anxiety symptoms, having completed 3 months of antidepressants at sample entry, common physical symptoms, health interfering with social activities, and social support. The discrimination performance of the single self-rated health question compares reasonably well with the Diagnostic Prognostic Index; based on its quartiles, the calculated discrimination performance using multiple imputation was 0.69 (95% CI, 0.66–0.72) vs the self-rated health performance of 0.64 (95% CI, 0.61–0.66). Compared with the Diagnostic Prognostic Index, self-rated health has the advantage of simplicity; it has 1 question vs 7 items. In addition discrimination for self-rated health is stable for 5 years, and its use is applicable in a broader population of patients. When self-rated health and MDS categories are combined, discrimination of the model is good8 and superior to the Diagnostic Prognostic Index.
Strengths
Longitudinal cohort studies are the best design to answer questions about prognosis.7 The diamond cohort is representative of patients with depressive symptoms in Australian primary care35,41 and is similar to primary care populations in other developed countries.4 This cohort represents the heterogeneous and variable nature of depression seen in primary care.11,14 Data are of good quality, with recruitment and attrition rates in the range of those reported in longitudinal studies of depression in primary care.50,51 Missing data have been dealt with rigorously using multiple imputation methods to overcome possible biases related to complete case analysis. Depression outcome was assessed with the PHQ-9 diagnostic criteria, which have good levels of agreement with diagnoses of independent mental health professionals.52 We have assessed predictive accuracy (ie, discrimination and calibration) as proposed in the current literature on prognosis.8,23,53
Limitations
A single predictor rarely gives an adequate estimate of prognosis7 and cannot be considered a prognostic model; rather, it represents one potential predictor that is easy to implement in primary care. Self-rated health may not be the single best prognostic variable for depression-prone cohorts, and future research could investigate other potential indicators. This study focused on patients with recent experience of depressive symptoms. Further research is needed to determine whether self-rated health successfully predicts outcome beyond the study population.54
Implications for Physicians
Family physicians, who are often the first contact for health care, are required to differentiate a wide range of symptoms from a complex narrative in a short period of time. The goal of prognosis is to improve clinical decision making, and the findings described here have important clinical relevance. Self-rated health provides sufficient prognostic information to indicate long-term increased risk of poor depression outcome for primary care patients with recent experience of depressive symptoms. Simply assessing self-rated health using the single question presented here and assessing depression status using the PHQ-9 (which is widely used in primary care38) could fit well within routine clinical care. Such information will provide reliable information about risk of future depression.
Although a prognostic model including self-rated health and depression severity was more accurate, it is more complex to interpret and would require more effort to implement into routine care. Interestingly, the findings suggest that incorporating age, sex, and multimorbidity into the model adds little to the prognostic information provided by assessment of self-rated health and MDS. Accuracy could be increased by building more comprehensive models that include other risk factors for depression based on specific validated questionnaires. There is a trade-off, however, between validity and clinical acceptability of prognostic tools, especially in busy practice.6
When a patient with a recent experience of depressive symptoms rates his or her health as poor to fair, it should prompt a thorough biopsychosocial evaluation leading to an intensive and comprehensive treatment plan. Patients with depressive symptoms who report poor to fair self-rated health may benefit from being fast-tracked to proactive follow-up and collaborative care models rather than a stepped-care approach.
Acknowledgments
We acknowledge the 30 dedicated general practitioners, their patients, and practice staff for making this research possible. We thank the cohort participants for their ongoing involvement in the study. We also thank the diamond project team and associate investigators involved in the study: Lena Sanci, Maria Potiriadis, Konstancja Densley, Aves Middleton, Melanie Charity, and the casual research staff.
Footnotes
-
Conflicts of interest: none reported.
-
Funding support: The diamond study was initiated with pilot funding from the Beyondblue Victorian Centre of Excellence and the main cohort has received project grant funding from the National Health and Medical Research Council (ID 299869, 454463, 566511, and 1002908). The one year Computer Assisted Telephone Interview was funded by a Stream 3 grant from the Australian Primary Health Care Research Institute (APHCRI). Research fellowship of Ambresin was supported by a scholarship from the Centre Hospitalier Universitaire Vaudois and the Department of Psychiatry of the University of Lausanne, Switzerland.
-
Disclaimer: No funding body had a role in study design; the collection, analysis, and interpretation of data; the preparation, review, or approval of the manuscript; or the decision to submit this manuscript for publication.
-
diamond Study investigators: Jane M. Gunn, Helen Herrman, Mike Kyrios, Kelsey Hegarty, Christopher Dowrick, Gail Gilchrist, Grant Blashki, Dimity Pond, Patty Chondros, Renata Kokanovic, and Victoria Palmer.
-
Supplementary materials: Available at www.annfammed.org/content/12/1/57/suppl/DC1/
- Received for publication August 16, 2012.
- Revision received March 29, 2013.
- Accepted for publication April 8, 2013.
- © 2014 Annals of Family Medicine, Inc.