Abstract
PURPOSE We wanted to compare the male osteoporosis risk estimation score (MORES) with the fracture risk assessment tool (FRAX) in screening men for osteoporosis.
METHODS This study reports analysis of data from the Third National Health and Nutrition Examination Survey (NHANES III), a nationally representative sample of the US population, comparing the operating characteristics of FRAX and MORES to identify men at risk for osteoporosis using a subset of 1,498 men, aged 50 years and older, with a valid dual-energy x-ray absorptiometry (DXA) scan. DXA-derived bone mineral density using a T score of −2.5 or lower at either the femoral neck or total hip defined the diagnosis of osteoporosis. Outcomes included the operating characteristics, area under the receiver-operator characteristic curve, and agreement of the FRAX and MORES.
RESULTS Sixty-seven (4.5%) of the 1,498 men had osteoporosis of the hip. The sensitivity, specificity, and area under the curve (AUC) for the MORES were 0.96 (95% CI, 0.87–0.99), 0.61 (95% CI, 0.58–0.63), and 0.87 (95% CI, 0.84–0.91), respectively. The sensitivity, specificity, and AUC for the FRAX were 0.39 (95% CI, 0.27–0.51), 0.89 (95% CI, 0.88–0.91), and 0.79 (95% CI, 0.75–0.84) respectively. Agreement was poor.
CONCLUSIONS Compared with the MORES, the FRAX underperformed as a screening strategy for osteoporosis using the threshold score suggested by the US Preventive Services Task Force (USPSTF). An integrated approach that uses the MORES to determine which men should have a DXA scan and the FRAX to guide treatment decisions, based on the risk of a future fracture, identified 82% of men who were candidates for treatments based on National Osteoporosis Foundation guidelines.
INTRODUCTION
Osteoporosis leads to bone fragility and an increased risk of fractures. The World Health Organization (WHO) considers osteoporosis a critical health problem1 and a major cause of morbidity and mortality in the aging population.2,3 Men and women with osteoporosis experience hip fractures at a similar rate,4 but men experience a worse 1-year mortality.5,6 Evidence indicates the benefits of bisphosphonate therapy in men are similar to those in women.7–10 Despite treatment benefits, screening recommendations for men lag behind those for women.
In 2011 the US Preventive Services Task Force (USPSTF) concluded, “The current evidence is insufficient to assess the balance of benefits and harms of screening for osteoporosis in men,” but went on to state, “the men most likely to benefit from screening would have a 10-year risk of an osteoporotic fracture equal to or greater than the risk of an osteoporotic fracture in 65-year-old white women without additional risk factors for fracture,” which equates to a risk of 9.3%.11 The USPSTF correctly noted that the World Health Organization’s fracture risk assessment tool (FRAX)12,13 is designed to predict the 10-year risk of a hip fracture or a major osteoporotic fracture (hip, humerus, forearm, or spine) but failed to cite any references to support using the FRAX as a screening instrument for osteoporosis for men or women.
The purpose of this study is to compare the operating characteristics and level of agreement of the Male Osteoporosis Risk Estimation Score (MORES)14 and the FRAX12,13 in determining which men should be referred for a diagnostic dual-energy x-ray absorptiometry (DXA) scan.
METHODS
We compared the MORES with the FRAX using a cross-sectional sample of men, representative of the US population, included in the National Health and Nutrition Examination Survey III (NHANES III). The MORES and FRAX outcomes were derived blinded to the results of the dual-energy x-ray absorptiometry (DXA) scans used to measure bone mineral density and the value of the alternate instrument.
When we developed the MORES, we randomly assigned the 2,995 men, aged 50 years and older, from the NHANES III data set to either a development or a validation cohort.14 For this study, we used the 1,498 men from the validation cohort. We defined osteoporosis based on the bone mineral density of the femoral neck or total hip using the non-Hispanic white female, aged 20 to 29 years, the NHANES III reference cohort.15 We included the bone mineral density of the total hip because it predicts future hip as well as femoral neck16 fractures and is used commonly in clinical practice.
The WHO, when it developed the FRAX, used this same reference cohort to define osteoporosis based on the bone mineral density of the femoral neck.12,13,17 By using both the femoral neck and total hip bone density measures, we were able to compare the MORES with the FRAX directly. Vertebral (spinal) bone density measurements were not available in NHANES III.
The NHANES III data set contains the necessary data to define or approximate the variables used in the MORES and the FRAX. The MORES and FRAX were used to classify men into to screen and do-not-screen categories.
Instruments
The MORES uses age, weight, and history of chronic obstructive pulmonary disease (COPD) to stratify the risk for osteoporosis in men aged 50 years and older. A score of 6 points or higher yielded a sensitivity of 0.93, a specificity of 0.59, and an area under the curve (AUC) of 0.832.
The FRAX, a Web-based calculator, uses age, sex, race/ethnicity, body mass index (BMI), paternal history of hip fracture, personal history of a low-impact fracture, smoking, alcohol consumption, use of glucocorticoids, history of rheumatoid arthritis, and history of secondary causes of osteoporosis to predict risk of a hip fracture or a major osteoporotic fracture.
The NHANES III data set also includes bone mineral density results for the femoral neck and total hip for men aged 50 years and older. NHANES III bone mineral density results were transformed to T scores (scores of −2.5 or lower at either the femoral neck or total hip defined the diagnosis of osteoporosis) based the bone mineral density of the femoral neck and total hip from the reference cohort.
Analysis
We calculated the sensitivity and specificity for the MORES and FRAX from 2 × 2 contingency tables.
We computed the MORES according to the algorithm developed in the initial MORES study. Table 1 summarizes the scoring algorithm derived from the regression coefficients of the MORES logistic regression model. A score of 6 or more points defined the screen group; a score of less than 6 points defined the do-not-screen group.
We calculated the FRAX online by entering the required clinical data. A 10-year risk of a major osteoporotic fracture of 9.3% or greater, as suggested by the USPSTF, defined the screen group; a score of less than 9.3% defined the do-not-screen group.11
We used the range of observed values for both instruments to construct receiver operator characteristic curves and obtained estimates of the AUC for both instruments.18
We calculated 2 statistics from a 2 × 2 contingency table of paired responses for the MORES and FRAX. We used the McNemar χ2 statistic to test for the null hypothesis of marginal homogeneity that the discordant cells contain equal proportions.19 We used the κ statistic to test for the level of agreement, taking onto account the agreement occurring by chance.20
We estimated that 75 (5%) men in the cohort of 1,498 men aged 50 years and older will have osteoporosis. This group determined the width of the 95% confidence intervals around an estimate of the sensitivity of the MORES and FRAX. With a sample size of 75, a 2-sided 95% CI for the sensitivity, using the large sample normal approximation, will extend 0.09 from the observed sensitivity, with an expected sensitivity of 0.80, and 0.11 from the observed sensitivity, with an expected sensitivity of 0.50.
A sample size of 1,462 pairs (1,498 expected) will have 90% power to detect a difference in proportions of discordant pairs of 0.05 when the proportion of discordant pairs is expected to be 0.35 and the method of analysis is a McNemar test of equality of paired proportions with an alpha of 0.05. Calculations were done using nQuery Advisor 6.0 (Statistical Solutions LTD).
The Institutional Review Board for Human Subjects Research granted approval for this study. All analyses used weighted data based on sampling of participants in the NHANES III data set.
RESULTS
The mean age of men in the sample population was 64.2 ± 9.7 years; 88.5% were non-Hispanic white, 8.5% were African-American, and 2.9% were Mexican-American. Sixty-seven men (4.5%) had osteoporosis of the femoral neck and/or total hip. Table 2 summarizes the distribution of the risk factors from the MORES and FRAX.
The MORES and FRAX have different operating characteristics for identifying men with osteoporosis (Table 3). Based on the MORES, 41.8% (n = 626) of the men would be referred for DXA, of whom 10.2% (n = 64) had osteoporosis. In comparison, based on the FRAX, 12.3% (n = 185) of the men would be referred for DXA, of whom 14.1% (n = 26) had osteoporosis. Compared with the FRAX, the MORES had greater sensitivity (0.96 vs 0.39) but had lower specificity (0.61 vs 0.89). Given the low prevalence of osteoporosis, neither instrument served as a rule-in test; however, both tools were reasonably effective as a rule-out test. The MORES had a higher AUC, which approached statistical significance (Figure 1).
The McNemar test was highly significant, P <.001, indicating nonequivalence of the tests. The κ statistic was 0.16, P <.001, which indicates a level of agreement barely more than chance.
DISCUSSION
The 2011 USPSTF osteoporosis screening guidelines made no recommendation for or against screening men, leaving the decision to clinicians and the men for whom they provide care. By comparing the FRAX and the MORES as screening tools to aide clinicians in identifying men at risk for osteoporosis, we found the MORES is more effective based on the significantly higher sensitivity.
The MORES incorporates age and weight, 2 major risk factors for osteoporosis, and a history of COPD, to predict men at increased risk of osteoporosis. The FRAX uses a complex set of risk factors to predict future fractures. In the NHANES III data set, the FRAX did not perform well as a screening test for osteoporosis at the suggested threshold. The FRAX may indirectly predict osteoporosis given the association of osteoporosis and fragility fractures and by sharing age and weight as risk factors with the MORES.
The strengths of this study include use of a sample of men representative of the US population, and that it is the first study we found that evaluated the USPSTF’s reliance on the FRAX to identify men at risk for osteoporosis.11 The NHANES III data set also had some limitations in that it derives from data from 1988 to 1994. The distribution of risk factors associated with osteoporosis and risk of fracture may have changed with time, but the impact of these factors on osteoporosis and fracture risk is likely unaffected. The data set did not allow for perfect modeling of the FRAX variables, ie, the data did not contain information for paternal hip fracture. This study did not address vertebral osteoporosis, because data are lacking in NHANES III, and vertebral osteoporosis is not used to calculate the FRAX. The MORES, however, does identify men at risk of vertebral osteoporosis.21
By 2030, osteoporotic fractures and the resultant functional impairment is expected to increase at a greater rate in men than in women.22–27 For clinicians and patients who are concerned about osteoporosis, the MORES provides a simple clinical approach to identifying men at greatest risk for osteoporosis and for whom a DXA scan is reasonable. The MORES can be readily administered at the time of an office encounter. The FRAX requires a computer program and clinical information less easily obtained, but electronic health records can simplify access through Internet links.
An integrated approach that uses the MORES to identify men who should have a diagnostic DXA scan and then uses the FRAX to guide treatment decisions may be the optimal approach. For example, 178 (11.9%) men in the sample were candidates for treatment based on the National Osteoporosis Foundation guidelines.28 Sixty-seven men had osteoporosis and 111 men had osteopenia of the hip associated with a 10-year risk greater than 3% for a hip fracture or greater than 20% for a major osteoporotic fracture. Integrating the MORES and FRAX identified 82% (n = 145) of the men who were candidates for treatment. In contrast, the FRAX, as the sole approach, identified 50% (n = 89) of candidates for treatment. Clinicians who refer men for DXA testing based on the MORES could use the bone mineral density results to calculate the FRAX and engage in a shared decision about treatment during a follow-up encounter.
We suggest that the USPSTF’s reliance on the FRAX to identify men who could be screened for osteoporosis is an application for which FRAX was not designed. We believe the MORES is a better screening tool for osteoporosis in men than the FRAX, whereas the FRAX is a valuable tool to guide treatment decisions.
Footnotes
Conflicts of interest: authors report none.
Funding support: Funded in part by the Texas Academy of Family Physicians Foundation grant “A Comparative Analysis of the FRAX and the Male Osteoporosis Risk Estimation Score.”
Previous presentations: Our findings were presented at the 41st Annual North American Primary Care Research Group Meeting, November 9–13, 2013; Ottawa, Ontario; and the Annual Scientific Meeting of the American Geriatrics Society; May 15–18, 2014; Orlando, Florida.
- Received for publication September 1, 2015.
- Revision received February 4, 2016.
- Accepted for publication February 22, 2016.
- © 2016 Annals of Family Medicine, Inc.