Abstract
PURPOSE To evaluate the accuracy of signs and symptoms for the diagnosis of acute rhinosinusitis (ARS).
METHODS We searched Medline to identify studies of outpatients with clinically suspected ARS and sufficient data reported to calculate the sensitivity and specificity. Of 1,649 studies initially identified, 17 met our inclusion criteria. Acute rhinosinusitis was diagnosed by any valid reference standard, whereas acute bacterial rhinosinusitis (ABRS) was diagnosed by purulence on antral puncture or positive bacterial culture. We used bivariate meta-analysis to calculate summary estimates of test accuracy.
RESULTS Among patients with clinically suspected ARS, the prevalence of imaging confirmed ARS is 51% and ABRS is 31%. Clinical findings that best rule in ARS are purulent secretions in the middle meatus (positive likelihood ratio [LR+] 3.2) and the overall clinical impression (LR+ 3.0). The findings that best rule out ARS are the overall clinical impression (negative likelihood ratio [LR−] 0.37), normal transillumination (LR− 0.55), the absence of preceding respiratory tract infection (LR− 0.48), any nasal discharge (LR− 0.49), and purulent nasal discharge (LR− 0.54). Based on limited data, the overall clinical impression (LR+ 3.8, LR− 0.34), cacosmia (fetid odor on the breath) (LR+ 4.3, LR− 0.86) and pain in the teeth (LR+ 2.0, LR− 0.77) are the best predictors of ABRS. While several clinical decision rules have been proposed, none have been prospectively validated.
CONCLUSIONS Among patients with clinically suspected ARS, only about one-third have ABRS. The overall clinical impression, cacosmia, and pain in the teeth are the best predictors of ABRS. Clinical decision rules, including those incorporating C-reactive protein, and use of urine dipsticks are promising, but require prospective validation.
INTRODUCTION
Acute rhinosinusitis (ARS) is defined as inflammation of the paranasal sinuses, most often the maxillary sinuses, that is caused by viruses or bacteria and has a duration of less than 6 weeks.1 Acute rhinosinusitis is a common outpatient infection, responsible for over 3 million outpatient visits annually in the United States; the symptoms overlap considerably with that of other upper respiratory tract infections, making accurate diagnosis challenging.2 While 75% of patients with ARS receive an antibiotic, and it is the most common reason for outpatient prescription of antibiotics,2,3 only about one-third with sinus symptoms have a confirmed bacterial pathogen when sinus fluid is cultured.4,5
Helping physicians more accurately identify which patients with clinically suspected sinusitis actually have acute bacterial rhinosinusitis (ABRS) could reduce harm from inappropriate antibiotic use. A systematic review found a 5% absolute increase in the rate of cure with antibiotics for clinically diagnosed ARS, compared with an 11% increase in rate of cure with imaging-diagnosed ARS; more accurate clinical diagnosis could identify the patients most likely to benefit from antibiotics.6 Previous systematic reviews of the clinical diagnosis of ARS are all more than 15 years old and did not use modern analytic techniques such as bivariate meta-analysis.7–10 The goal of the current study is therefore to perform a comprehensive meta-analysis of the clinical diagnosis of ARS and ABRS.
METHODS
Search Strategy
PubMed and Embase were searched using terms for acute rhinosinusitis and diagnostic testing (see Supplemental Table 1, available at http://www.AnnFamMed.org/content/17/2/164/suppl/DC1/, for search strategy).11 A separate search was performed to identify studies that assessed inter-rater agreement of signs and symptoms of sinusitis. PubMed was searched using the search terms (“inter-rater” OR “interrater” OR “kappa”) AND (“sinusitis” OR “sinus”). The primary searches occurred in 2017, and were updated in April, 2018.
Inclusion Criteria And Quality Assessment
We included studies of adults and children with suspected ARS or acute respiratory tract infection that reported data for the accuracy of at least 1 sign or symptom. One study12 included patients with a clinical diagnosis of ARS for whom their physician recommended antibiotics, but the results were similar to those of studies with broader inclusion criteria. All studies took place in the outpatient setting and used radiography, ultrasound, computed tomography (CT), or antral puncture as the reference standard. We included studies in which all patients received the same reference standard. Studies involving highly specialized patient populations (ie, patients with HIV or odontogenic sinusitis, children with brain cancer, or inpatients) were excluded. There were no limits by date of publication or language.
Where studies reported findings separately by maxillary, frontal, or ethmoid sinus, only maxillary sinus findings are shown. Where individual sinuses as well as results by person are reported, results by person are shown. When it was possible to use different cut points (definitions of abnormal) for a test or reference standard, the cut point that yielded the highest diagnostic odds ratio (DOR) was selected.13,14 Where data were reported for 2 time periods (ie, any fever or fever in the last 24 hours) we included the most recent time period in relation to the visit. Where results for more than 1 reference standard were reported for the same set of patients, data for the highest quality reference standard are reported (in descending order: positive bacterial culture of antral puncture fluid, antral puncture revealing purulent fluid, magnetic resonance imaging [MRI], CT, ultrasound, and finally radiography).4,15–18 Acute rhinosinusitis was diagnosed when any reference standard was abnormal, and acute bacterial rhinosinusitis when inspection of antral puncture fluid or culture of puncture fluid were consistent with bacterial infection.4,5
To evaluate study quality, we used the Quality Assessment of Diagnostic Accuracy Studies 2 criteria, adapted for the diagnosis of acute rhinosinusitis (Supplemental Table 2).19 Studies were classified as low risk of bias if all 4 domains were all judged to be low risk of bias. Studies with only a single domain at high risk of bias were classified as moderate risk of bias, and all other studies were classified as high risk of bias.
Data Extraction and Analysis
Each included study was reviewed by 2 investigators, who extracted data regarding study quality and the accuracy of signs and symptoms. Any discrepancies were resolved by discussion to achieve consensus, involving a third investigator if necessary. We used the MADA package in R version 3.2.2 (R Project for Statistical Computing) to perform bivariate meta-analysis for each clinical sign or symptom, and the META package to calculate summary estimates of prevalence. We determined the prevalence of sinusitis for subgroups by age and reference standard using a random effects summary measure. Summary measures of accuracy are reported for each sign or symptom. The positive likelihood hood ratios (LR+) and negative likelihood ratios (LR−) were the primary measures of diagnostic accuracy. A likelihood ratio (LR) near 1.0 means that the test adds little diagnostic information, LRs greater than 1 increase the likelihood of disease, and LRs less than 1 decrease the likelihood of disease.20 The DOR (LR+ divided by LR−) was chosen as an overall measure of discrimination because the small numbers of studies made receiver operating characteristic curves unstable and difficult to interpret in many cases.
RESULTS
We identified 1,638 studies after removing duplicates. We also searched the reference lists of previous meta-analyses, review articles, and practice guidelines for additional articles, finding an additional 11 studies. A total of 1,649 abstracts were screened by 2 reviewers for relevance, 182 full-text articles were accessed, and a final total of 17 studies met our inclusion criteria (Supplemental Figure 1).
Study Characteristics
The characteristics of the 17 included studies are summarized in Table 1. Six were small (fewer than 100 participants), with a range of 30 to 400 participants. The Quality Assessment of Diagnostic Accuracy Studies 2 framework is summarized in Supplemental Table 2. Overall, 4 studies were at low risk of bias, 7 at moderate risk of bias, and 6 at high risk of bias. Common threats to validity included failure to use a high-quality reference standard, nonconsecutive sample of patients, and mask the person performing the reference standard to results of the index test. All of the studies at low risk of bias used purulent fluid or bacterial culture as the reference standard, so those results are reported separately.33
Prevalence of ARS and ABRS
The prevalence of acute sinusitis for patients presenting with sinus symptoms is summarized in Table 2, stratified by age group and reference standard. Prevalence ranged from 19% to 63% for adults, and from 57% to 79% for children.
Imaging studies had the highest prevalence and was similar for studies using plain film radiography or CT as the reference standard (59% vs 56%, P = .70). For the diagnosis of ABRS, studies using the presence of purulence from antral puncture had a higher prevalence than those using positive culture of antral puncture fluid (49% vs 31%, P <.01). Comparing the studies of adults only (Table 2), the prevalence of ARS was similar for studies using any imaging vs purulence on antral puncture as the reference standard (51% vs 49%), but was lower for those using bacterial culture (31%).
In the subset of all primary care, urgent care, or emergency department studies (n = 10; 1,632 patients) the prevalence of ARS was 49% (95% CI, 39-59). In the subset of studies using antral puncture or culture as the reference standard (n = 4; 411 patients), the prevalence of ABRS was 42% (95% CI, 31-55).
All studies enrolled patients with symptoms compatible with sinusitis. In a systematic review of blood tests and imaging for ARS,11 2 studies that recruited patients with broader inclusion criteria of cold18 or runny nose34 found lower ARS prevalence at 16%and 28%, respectively.
The Interrater Reliability of Signs and Symptoms
Three studies reported the interrater agreement of signs and symptoms for ARS among adult patients.26,30,35 A value of κ from 0.0-0.2 represents slight agreement, 0.2-0.4 is fair agreement, 0.4-0.6 is good agreement, 0.6-0.8 substantial agreement, and 0.8-1.0 shows near perfect agreement. There was substantial agreement for history of cough (κ = 0.70),35 red streak in lateral recess of oropharynx (κ = 0.70),30 colored nasal discharge (κ = 0.68),26 and maxillary toothache (κ = 0.60).26 There was good agreement for sinus tenderness (κ = 0.59)26 and history of fever (κ = 0.53).35 There was only slight agreement for purulence on nasal inspection (κ = 0.14),26 and considerable heterogeneity from 2 studies regarding sinus transillumination (κ values of 0.22 and 0.80).26,30
Accuracy of Individual Signs and Symptoms Acute Rhinosinuitis
The individual signs and symptoms that best ruled in ARS when present were purulent secretions observed in the middle meatus, nasal speech, patient report of pain in the teeth, and purulent secretions observed in the posterior pharynx or postnasal region. The individual findings that best ruled out ARS were absence of a preceding respiratory tract infection, absence of any nasal discharge, absence of purulent nasal discharge as a symptom, and normal transillumination. The accuracy of individual signs and symptoms for the diagnosis of ARS is summarized in Table 3 (see Supplemental Table 3, for complete individual study data). Few positive findings had a LR ≥2.0 and few findings when absent had a LR ≤0.5.
Four studies reported data for the overall clinical impression as a diagnostic test for ARS, including 3 that used antral puncture as the reference standard.15,22,25,26 The accuracy of the overall clinical impression in this study was good (LR+ 3.0, LR− 0.37), with the highest diagnostic odds ratio of any finding (DOR 8.3).
Three studies reported data for prolonged duration of symptoms and the likelihood of ARS, using cutoffs of 5, 10, and 21 days.25,29,30 In our study, there was no clear pattern, with sensitivity ranging from 25% to 70% and specificity from 15% to 75%.
Acute Bacterial Rhinosinusitis
Six studies used purulent antral puncture fluid or positive bacterial culture as the reference standard and are summarized in Table 4.4,5,15,25,28,32 The 3 findings that significantly increased or decreased the likelihood of ABRS were the overall clinical impression, cacosmia (fetid odor of the breath), and pain in the teeth. Individual study data using these reference standards is shown in Supplemental Table 4. No conclusions can be drawn due to the limited number of studies reporting data for each sign or symptoms.
Only 2 studies reported the accuracy of signs and symptoms for ABRS using bacterial culture of antral fluid as the reference standard.4,32 Pain in teeth as a symptom (LR+ 2.8, LR− 0.76) and maxillary tenderness on exam (LR+ 1.8, LR− 0.71) significantly changed the likelihood of ABRS.
Accuracy of Combinations of Signs and Symptoms
Acute Rhinosinusitis
Four studies reported combinations of findings for the diagnosis of ARS (Table 5).12,24,26,27 Lindbaek and colleagues proposed a 4-item score; patients with all 4 findings present had a LR of 25 for ARS, those with 2 or 3 findings a LR of 1.2, and those with 0 or 1 finding had a LR of 0.16.12 Williams and colleagues developed a 5-point clinical score using maxillary toothache, abnormal transillumination, poor response to nasal decongestants or antihistamines, colored nasal discharge, or mucopurulence on examination for the diagnosis of ARS using radiography as the reference standard; likelihood ratios ranged from 6.4 for 4 or 5 points to 0.16 for 0 points.26
Huang and colleagues used a 4-item score for results of a urine test strip measuring the protein, pH, leukocyte esterase, and nitrite as measures of an inflammatory response in nasal discharge.24 The samples were collected by having patients blow their nose into a piece of plastic wrap, and the scorer assigned either 0 to 2 or 0 to 3 points per item for a total range of 0 to 11 points. Using a reference standard of radiography, the highest strata of scores (≥4) had a LR of 127.24 Van Diujn and colleagues applied a 5-item score based on a logistic regression equation to estimate the probability of sinusitis compared to ultrasonography of the sinuses as the reference standard.27 This study is limited, like all of the others, by lack of prospective validation and the use of an imaging reference standard that is not specific for ABS.
Acute Bacterial Rhinosinusitis
Two studies reported the accuracy of combinations of signs and symptoms for the diagnosis of ABRS (Table 5).25,36 Berg proposed a 4-item score using a reference standard of purulent antral puncture fluid. The score discriminated well: those with a score or 3 or 4 findings present had a LR of 7 for ABRS, while those with ≤1 findings present had a LR of 0.06.25
The most recently reported clinical decision rule used 5 signs or symptoms plus C-reactive protein (see Table 5 for scoring details) to identify patients at low risk (n = 77, 16%), moderate risk (n = 75, 49%), or high risk (n = 23, 73%) for ABRS using positive bacterial culture of antral puncture fluid as the reference standard.36 Unfortunately, none of the above scores have been prospectively validated.
DISCUSSION
Most individual signs and symptoms have limited accuracy for the diagnosis of ARS and ABRS. Purulent secretions seen in the middle meatus and the overall clinical impression are most useful for ruling in ARS when present, while overall clinical impression, absence of a recent or preceding RTI, absence of any nasal discharge, and absence of purulent nasal discharge reduced the likelihood of ARS (Table 3). Since ARS may have a viral etiology, the more important clinical question is how to best diagnose ABRS. We found that the overall clinical impression, cacosmia (an uncommon but highly specific finding), and pain in the teeth were the best predictors of ABRS (Table 4). Other individual signs and symptoms had positive and negative likelihood ratios between 0.5 and 2.0, indicating little diagnostic value.
Clinical decision rules have been developed for a range of other respiratory conditions, including pneumonia,37 streptococcal pharyngitis,38 and influenza.39 We identified 6 clinical rules for the diagnosis of ARS or ABRS, and they have promising accuracy, but none has been prospectively validated. In some cases, including 1 of the clinical decision rules for ABRS, they incorporate point-of-care tests such as C-reactive protein as well as signs or symptoms to increase the accuracy of prediction.37 Another potentially useful point-of-care test is use of a urine dipstick to detect leukocyte esterase or nitrites in nasal discharge (Table 5).24 While it could be incorporated into a clinical decision rule or even used alone, its accuracy requires further validation. While C-reactive protein is accurate and widely used in some countries at the point of care to identify patients more or less likely to have a bacterial respiratory infection,40,41 it is not currently available in most US outpatient settings.11 Finally, a recent systematic review by the author found that older studies of ultrasound found that it to be approximately 80% sensitive for ARS.11 Patients who do not have any sinus fluid detected are therefore at low risk for ARS. Thus, studies that evaluate the ability of modern handheld ultrasound devices to detect sinus fluid are needed.
An important limitation of our findings is the variety of reference standards used to define both ARS and ABRS. Because imaging may detect fluid associated with a viral upper respiratory infection, and cannot distinguish purulent from non-purulent fluid, it is likely to lead to over diagnosis of ABRS. Similarly, some patients with fluid that appears purulent may have a viral infection. Relying on these tests could lead to overtreatment with antibiotics. On the other hand, bacterial culture of antral puncture fluid is likely to be more specific but may lack sensitivity if organisms do not grow in culture. In addition, puncturing the antrum is painful and invasive, making it impractical for use in clinical practice. Ultimately, the true prevalence of ABRS among patients with clinically suspected ARS is likely to be between 31% (prevalence using bacterial culture) and 50% (prevalence using imaging) in adults.
An important question is whether detection of a bacterial pathogen in sinus fluid means that the patient will benefit from antibiotics. Most clinical trials have enrolled patients with clinically suspected sinusitis and found a small benefit (5 additional cures per 100 persons receiving an antibiotic). Trials using imaging (3 radiography, 1 CT) found a larger benefit, suggesting some validity for the concept of imaging as a reference standard.6 To date no randomized trials of antibiotics or other interventions have enrolled patients with ABRS diagnosed by inspection of fluid.
While many studies have attempted to identify combinations of signs or symptoms that diagnose ARS or ABRS, it is equally important to determine which patients are at low likelihood of ABRS (would be unlikely to benefit from an antibiotic), as it is to identify which patients are likely to have an uncomplicated course. Thus, research to determine low risk criteria that help rule out ABRS is needed.
CONCLUSION
Only about one-third of patients with clinically suspected ARS have a positive bacterial culture of antral puncture fluid. Acute rhinosinusitis as diagnosed by any reference standard is significantly less likely in patients without any nasal discharge, without a complaint of purulent nasal discharge, and with normal transillumination. The overall clinical impression is also useful for both ruling in and ruling out ARS. Evidence regarding diagnosis of ABRS is limited, but we conclude that the overall clinical impression, pain in the teeth, and cacosmia are the most useful findings for clinicians trying to identify patients most likely to benefit from antibiotics. Clinical decision rules, including those incorporating C-reactive protein, and the use of urine dipsticks to test the nasal discharge are promising, but all require prospective validation.
Footnotes
Conflicts of interest: authors report none.
Supplementary materials: Available at http://www.AnnFamMed.org/content/17/2/164/suppl/DC1/.
- Received for publication August 4, 2018.
- Revision received November 22, 2018.
- Accepted for publication December 13, 2018.
- © 2019 Annals of Family Medicine, Inc.