Abstract
PURPOSE Chest pain frequently poses a diagnostic challenge for general practitioners (GPs). Utilizing risk stratification tools might help GPs to rule out acute coronary syndrome (ACS) and make appropriate referral decisions. We conducted a systematic review of studies evaluating risk stratification tools for chest pain in primary care settings, both with and without troponin assays. Our aims were to assess the performance of tools for ruling out ACS and to provide a comprehensive review of the current evidence.
METHODS We searched PubMed and Embase for articles up to October 9, 2023 concerning adult patients with acute chest pain in primary care settings, for whom risk stratification tools (clinical decision rules [CDRs] and/or single biomarker tests) were used. To identify eligible studies, a combination of active learning and backward snowballing was applied. Screening, data extraction, and quality assessment (following the Quality Assessment of Diagnostic Accuracy Studies-2 tool) were performed independently by 2 researchers.
RESULTS Of the 1,204 studies screened, 14 were included in the final review. Nine studies validated 7 different CDRs without troponin. Sensitivities ranged from 75.0% to 97.0%, and negative predictive values (NPV) ranged from 82.4% to 99.7%. None of the CDRs outperformed the unaided judgment of GP’s. Five studies reported on strategies using troponin measurements. Studies using high-sensitivity troponin showed highest diagnostic accuracy with sensitivity 83.3% to 100% and NPV 98.8% to 100%.
CONCLUSION Clinical decision rules without troponin and the use of conventional troponin showed insufficient sensitivity to rule out ACS in primary care and are not recommended as standalone tools. High-sensitivity troponin strategies are promising, but studies are limited. Further prospective validation in primary care is needed before implementation.
INTRODUCTION
Chest pain often poses a diagnostic challenge for general practitioners (GPs). While chest pain is the hallmark symptom of acute coronary syndrome (ACS), it is often caused by less urgent conditions. It is difficult for GPs to differentiate possible ACS from other causes based on symptoms and physical examination alone. This, unfortunately, results in a low threshold for referral, leading to additional burden and cost on the health care system.1-3
Only 1.5% to 3.6% of patients with chest pain in primary care are diagnosed with ACS.3,4 More often, chest pain is caused by non-cardiac conditions, such as musculoskeletal complaints or gastrointestinal disease.2 It is thought that risk stratification tools, such as clinical decision rules (CDRs) or troponin point-of-care tests (POCT), could help GPs to make more informed referral decisions when evaluating patients with acute chest pain.
Due to different ACS risks in primary and secondary care, the use of risk stratification tools may differ in these settings, and results from studies in secondary care cannot be assumed to apply to primary care.5 A systematic review from 2019 that assessed CDRs for coronary artery disease and ACS in primary care, concluded that no CDR demonstrated sufficient sensitivity to rule out ACS.6 Since then new studies have emerged, and high-sensitivity troponin POCTs became available to pre-hospital settings. In this systematic review, our aim was to assess the performance of risk stratification tools for ruling out ACS in patients with chest pain in primary care and to provide a comprehensive review of the current evidence.
METHODS
This study is reported in accordance with the 2018 Preferred Reporting Items for a Systematic review and Meta-Analysis of diagnostic test accuracy studies statement.7 The protocol is available upon request.
Search Strategy and Eligibility Criteria
We searched the electronic databases PubMed and Embase for original articles published in English, up to October 9, 2023. A complete overview of the search is presented in Supplemental Appendix 1. Two authors (S.B., A.M.) identified articles for potential inclusion. A third author (R.E.H.) was available to resolve any disagreements. Eligibility criteria were: (1) adults (aged 18 years or more); (2) acute chest pain; (3) enrollment in primary care; (4) available outcome data for ACS, acute myocardial infarction (AMI), or the composite endpoint of major adverse cardiac events (MACE); and (5) use of diagnostic tools. Diagnostic tools were defined as risk stratification tools constructed by a set of condition-specific findings that are applicable in a primary care setting. Exclusion criteria were: studies conducted in ambulance care or emergency department settings or requiring advanced diagnostic testing (eg, coronary angiography, serial biomarker testing); or cardiac monitoring.
Outcomes of Interest
Outcomes of interest were the diagnostic test characteristics of the risk stratification tool: sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and overall discriminative properties (C-statistic). Reference diagnoses of interest were ACS, AMI, and MACE. Major adverse cardiac events are typically defined as a composite of death from any cause, ACS, and urgent coronary interventions. When applicable, we reported the performance of risk stratification tools compared with unaided clinical judgement.
Selection Process
For the selection process, we used an open-source machine learning-aided software (ASReview, Utrecht University) for the initial screening of the titles and abstracts. We followed the principles of the SYstematic review Methodology Blending Active Learning and Snowballing (SYMBALS).8,9 A more complete description of ASReview’s methods and evidence, including clarification of our stopping criteria can be found in Supplemental Appendix 2. Selected articles were imported into the Covidence systematic review software (Veritas Health Innovation), for full text screening, quality assessment, and data extraction. After reaching the final subset of articles, those reference lists were then screened for possible additional inclusions.
Quality Assessment and Data Extraction
Two researchers (S.B., A.M.) separately extracted data elements from each study. The quality of the included articles was assessed by S.B. and A.M. using the 4 domains of the Quality Assessment of Diagnostic Accuracy Studies-2 tool: patient selection, index test, reference standard, and flow and timing.10 Risk of bias and applicability concerns were assessed separately. A third author (R.E.H.) independently reviewed the extracted data and quality assessment for accuracy.
Data Synthesis and Analysis
Extracted data on study and patient characteristics, outcome measures, and follow-up information are presented in Table 1. Diagnostic test characteristics and discriminative properties are shown in Table 2.
Study and Patient Characteristics of Included Studies (N = 14)
Diagnostic Accuracy of Risk Stratification Tools for ACS/MAC
RESULTS
Search Results
After deduplication, the initial search yielded 1,204 titles and abstracts eligible for inclusion. Following further screening, 53 articles remained, of which 40 were excluded after full text screening. An update of the search resulted in 1 additional article (Figure 1). Backward snowballing yielded no additional inclusions.
Flowchart of systematic review of the literature.
ASR = Active learning for Systematic Reviews.
Study and Patient Characteristics
Table 1 displays the characteristics of the 14 included studies. Nine studies derived or validated a CDR and 5 studies validated troponin assays. Study populations ranged from 115 to 2,192 patients, with follow-up periods from 3 weeks to 6 months. An overview of the components of the included CDRs can be found in Supplemental Table 1. Supplemental Table 2 shows troponin assay characteristics and threshold values.
Quality Assessment
Five of the 14 (35.7%) studies showed a high risk of bias in at least 1 of the quality domains. High risk of bias was most common in the index test and reference standard domains.15,16,18-20 Applicability concerns were most frequent in patient selection and index test.11,12,14 A detailed overview of the risk of bias and applicability assessment can be found in Supplemental Appendix 3.
Risk Stratification Tools
A complete overview of diagnostic test characteristics can be found in Table 2.
Grijseels/Bruins Slot Rule
Grijseels et al developed an algorithm in 1995 which consists of a combination of patient and symptom characteristics and electrocardiogram (ECG) findings.11,12 Discriminative properties showed C-statistics ranging from 0.70 to 0.72. In 2011, Bruins Slot et al updated the algorithm to a point-based rule with 3 risk categories for ACS.1 Validation of the updated rule resulted in C-statistics of 0.66 to 0.72, sensitivity of 84.4% to 97.0%, and NPV of 91.7% to 98.2%. When comparing the CDR to unaided clinical judgment, GPs categorized patients with and without ACS better.1,15
Willemsen CDR
In 2019, Willemsen et al evaluated a heart-type fatty acid–binding protein (H-FABP) POCT.13 The diagnostic accuracy of H-FABP as standalone test to rule out ACS had a sensitivity of 25.8% and NPV of 91.6%. Using H-FABP with a CDR increased the sensitivity to 87.5% and NPV to 97.2%. Compared with unaided clinical judgment, fewer ACS cases were missed (1.3% vs 2.6%), but there were 13.9% more referrals.
Marburg Heart Score
The Marburg Heart Score (MHS) was developed in 2010 to rule out coronary artery disease.24 Later, it was evaluated to rule out ACS or MACE in 1 prospective and 2 retrospective studies, with different rule-out thresholds (Table 1).14,15,16 C-statistics ranged from 0.64 to 0.77, with sensitivity of 75.0% to 81.3%, and NPV of 88.0% to 98.6%. The MHS did not outperform unaided clinical judgement.14
INTERCHEST
Like the MHS, the 2017 International Chest pain prediction (INTERCHEST) score was developed to rule out coronary artery disease, and later tested to rule out ACS or MACE. The rule was derived from pooled individual patient data of 5 studies (3,099 patients total).25 Twofold retrospective validation resulted in C-statistics of 0.77 to 0.85, sensitivity of 87.5% to 88.8%, and NPV of 98.6% to 99.1%.15,16 Discriminative power was similar to unaided clinical judgement.
Gencer Rule
The Gencer rule was developed in 2010 to rule out coronary artery disease and consists of 7 components.26 It was validated to rule out MACE in 1 study with a C-statistic of 0.72, sensitivity of 84.4%, and NPV of 98.0%.15 The performance of the rule was equivalent to unaided clinical judgement.
Safety First
Wouters et al in 2022 developed a computerized risk stratification tool for the triage of acute chest pain patients in out-of-hours primary care settings.17 The tool includes 7 predictors of ACS. After derivation, an internal-external validation technique was used, resulting in a C-statistic of 0.77 to 0.79. The authors did not recommend a cut-off point; however, diagnostic accuracy for various risk thresholds (0.1% to 20%), showed a sensitivity of 46% to 98% and NPV of 93% to 99%.
HEART-Score Variants
The original HEART-score tool (History, ECG, Age, Risk factors, and Troponin), was developed to identify patients at low risk for short-term MACE among patients with acute chest pain in the emergency department.27 For use in primary care, the original HEART-score was modified.18 The first variation (simplified HEART-score) simply omits troponin. The second variation, HEART-GP, replaces troponin with the GP’s sense of alarm. The modified scores had a C-statistic of 0.86 and 0.90, respectively, sensitivity of 96.9%, and NPV of 99.7% for both versions. Compared with unaided clinical judgment, both scores improved safety, at the cost of additional referrals (simplified HEART had 26% more, and HEART-GP had 4% more).
Conventional Troponin
In a study by Planer et al in 2006, a qualitative cardiac troponin T (cTnT) test was combined with the GP’s clinical assessment.19 The combined strategy resulted in a sensitivity of 95.8% and NPV of 99.6%. The cTnT test as a standalone test showed poor sensitivity (20.8%) and did not outperform unaided clinical judgment.
Tomonaga et al in 2011 compared the use of a 3-in-1 POCT for cTnT, N-terminal pro-B-type natriuretic peptide (NT-proBNP), and D-dimer to usual care in patients with chest pain, dyspnea, or symptoms suggestive for thromboembolic events.20 The use of this combined POCT test led to more accurate working diagnoses (76% vs 60%) and fewer referrals (30% vs 55% false positives). For ACS, however, the sensitivity was 89.5% compared with 100% in the usual care group, with a NPV of 98.9%, compared with 100%. Sensitivity for troponin as standalone test to rule out ACS was 58.8%, missing 7 out of 17 (41.2%) patients with ACS.
A Swedish study by Nilsson et al in 2013 compared 3 primary care practices that already incorporated the use of cTnT-POCT to 4 practices that did not use cTnT-POCT.21 Use of cTnT-POCT by GPs reduced hospital referrals (25% vs 43%), but at the expense of sensitivity (71.4% in the practices using cTnT-POCT vs 100% in the control group). Troponin as standalone test showed a sensitivity of 28.6% and NPV of 95.9%, missing 5 out of 7 (71%) patients with ACS.
High-Sensitivity Troponin
In 2015, Andersson et al used the original (stored) samples from the study by Nilsson et al in 2013 to measure high-sensitivity (hs) cTnT and compare the diagnostic outcomes with the outcomes for conventional troponin.22 They found a sensitivity of 83.3% compared with 33.3%, missing 1 out of 6 (17%) patients with ACS. This came at the expense of specificity (76.2% vs 97.5%) and thus led to additional referrals (23% vs 2% false positives).
Lastly, in 2021, Johannessen et al validated 3 strategies to rule out AMI using hs-cTnT in a primary care emergency outpatient clinic in Norway.23 The first was a single hs-cTnT rule-out strategy. High sensitivity cTnT was measured at presentation and AMI was ruled out if hs-cTnT was below the limit of detection (ie, very low). No AMI were missed and sensitivity and NPV were 100% with a C-statistic of 0.85. The second and third strategies were the original HEART-score and a modified HEART-score with lower hs-cTnT thresholds. The modified HEART-score outperformed the original HEART-score (sensitivity 98.4% vs 91.8% and NPV 99.8% vs 99.4%), but not the single hs-cTnT rule out strategy.
DISCUSSION
This systematic review provides an overview of the evidence on risk stratification tools for acute chest pain in primary care. Seven CDRs with variable diagnostic properties have been evaluated. Some may improve safety (ie, modifications of the HEART-score), but at the expense of more referrals. None of the CDRs demonstrated superiority over unaided GP assessment. In studies evaluating strategies using troponin we found promising results among those that used high sensitivity troponin assays, either as a standalone tool, or incorporated into a CDR, however, further study is needed.
Strengths and Limitations
This systematic review is an extensive and up-to-date overview of risk stratification tools for acute chest pain in primary care. It will aid GPs in their decision for referral and additional testing. A strength of this review is the inclusion of relatively new strategies using hs-troponin. A limitation is the considerable heterogeneity of the included studies, particularly in patient selection, (a delayed-type) reference standard, and follow-up intervals. Furthermore, several of the included studies were limited by small sample sizes (7 had fewer than 500 patients) and low ACS prevalence. This automatically generates high NPVs and might cause overestimation of the tool’s performance if the sensitivity is not taken into consideration. Finally, we only included articles in English in the review, which might introduce language bias.
Interpretation of Results
Prior international research indicates that a 1% miss rate is considered acceptable for ACS and MACE.28,29 Additionally, a risk stratification tool also needs to be efficient. Other research found that Dutch GPs would accept 25-50 unnecessary referrals for every patient with ACS.28 Assuming an ACS prevalence of 5%, a specificity of at least 50% is needed. Among the studied CDRs, none met the desired sensitivity and specificity. Only the modified HEART-scores may approach these requirements, but they need further validation. Hence, we cannot recommend currently available CDRs as standalone tools to rule out ACS.
Conventional troponin as a standalone test did not meet the desired diagnostic accuracy, and its use seems to be outdated since the introduction of hs-troponin assays. The studies using hs-troponin measurement show the most promising results with sensitivities up to 100%. Although the study by Andersson reported a lower sensitivity (83%) for their single measurement rule-out strategy, this might be due to using a hs-cTnT threshold below the 99th percentile.22 In contrast, the study by Johannessen used a much lower threshold below the limit of detection with 100% sensitivity.23 The European Society of Cardiology guidelines underline the necessity of a very low threshold to rule out ACS by a single hs-troponin measurement and report assay specific thresholds.30
The One-hoUr Troponin in a low-prevalence population of Acute Coronary Syndrome (OUT-ACS) study (published in 2020), prospectively validated the European Society of Cardiology 0/1-hr algorithm, including hs-troponin, to rule out AMI in primary care.30,31 Although this strategy approaches both desired sensitivity and specificity (98.4% and 79.4%), the algorithm requires serial troponin tests and observation of patients, which is often not possible in primary care settings.
Finding a tool that exceeds the GP’s accuracy might be challenging, since unaided clinical judgment is already reasonably good with a sensitivity 75% to 100% and specificity 18% to 79%.1,13-15,19,20 Best diagnostic accuracy might therefore be achieved by combining the GP’s risk assessment with a stratification tool instead of risk stratification tools as standalone tests.
Future Research
Results for risk stratification strategies using hs-troponin are promising, but require further prospective validation, since studies in primary care are limited.22,23 Although hs-troponin strategies are extensively researched and found to be safe in emergency department settings, results cannot be automatically transferred to primary care due to the lower pre-test probability for ACS.5 While the CDRs did not meet the desired test accuracy to rule out ACS, a combination of a CDR with hs-troponin measurement might offer optimal results. Such strategies are currently under research in primary care practices (Pijn op de borst-HELP32) and out-of-hours primary care (HEART-GP33).
Variations in health care systems across countries may influence the applicability of risk stratification tools. All but 1 of the included studies were conducted in Europe, where GPs often function as gatekeepers to secondary care. These rule-out strategies, however, could also benefit countries with other primary care structures, such as the United States or countries with large rural regions, where specialized care is scarce and diagnostic options are limited, especially since cardiac monitoring is not required. Such changes, however, require cultural and organizational changes.
Finally, pre-hospital care comprises both primary care and ambulance services and we advocate for standardized care across the entire continuum. Future research should focus on risk stratification tools that are applicable in the entire pre-hospital care chain, and preferably in line with hospital care. The latter is especially important for strategies using troponin since different assays and thresholds hamper communication between health care providers.
CONCLUSION
Clinical decision rules without troponin or with conventional troponin do not show sufficient sensitivity to rule out ACS in primary care and are not recommended as standalone tools. The first hs-troponin studies in primary care show promising results, but further prospective validation in primary care is needed before recommending its implementation.
Acknowledgments
F.S. Jamaludin (clinical librarian) for support on the construction and execution of the literature search.
Footnotes
Conflicts of interest: authors report none.
Funding support: This work is supported by 3 grants: ZonMw-HGOG (Healthcare research the Netherlands, project ID 839150004), ZonMw (Healthcare research the Netherlands, project ID 852001907), Dutch Heart Foundation (project ID: 03 to 003 to 2022 to 0029).
- Received for publication January 17, 2024.
- Revision received April 29, 2024.
- Accepted for publication May 7, 2024.
- © 2024 Annals of Family Medicine, Inc.