Pragmatic strategies that enhance the reliability of data abstracted from medical records

doi:10.1016/j.apnr.2004.04.005

Applied Nursing Research

Volume 18, Issue 1, February 2005, Pages 50-54

https://doi.org/10.1016/j.apnr.2004.04.005 Get rights and content

Abstract

The processes and procedures used to promote interrater reliability in the abstraction of data from medical records are described. Several proactive strategies that serve the purpose of leading to standard interpretations of clinical data are discussed. These include (a) establishment of priorities for the sources of information; (b) creation of orders of value for the likeliness of validity of recorded data; (c) standardization of terminology; and (d) reaffirmation of decisions, based on an evolving body of evidence. Lessons learned from this project can assist nurse researchers to develop high-quality information retrieval methods, when multiple observers (or abstractors) are used during a medical record abstraction data collection process.

Introduction

Retrospective research designs often rely on the abstraction of patients' medical records to obtain research data. This valid approach to data collection is, however, based on several assumptions about the validity of the data. These include (a) the data needed for the research will be present in the record; (b) the data in the record will be in a form that can be abstracted, or manipulated, for research purposes (e.g., grams converted to ounces); (c) the data in the record will accurately represent what was, in fact, the case; (d) data addressing any single item that is recorded in more than one place in the medical record will be consistently recorded by one or more individuals who enter that data; and (e) medical record entries will be interpretable in a manner common to all those who access the record (Allison et al., 2000). A source of data variance (validity error) is introduced into the data abstraction process in every instance when one of these assumptions proves to be invalid.

The reliability of these data is an equally compelling concern. The potential for interrater variance is an additional concern when data are to be abstracted by more than one individual. This challenge increases when information is to be obtained from a large number of providers, in multiple locations, each of whom may use a different medical record form and format, and also when data are collected over a long period.

This article describes the process and procedure used to promote interrater reliability (IRR) in the abstraction of data for a California statewide study of the quality of breast and cervical cancer screening services. San Diego State University's Cancer Clinical Quality Assurance Project funded by and in collaboration with the Cancer Detection Section of the California Department of Health Services conducted the abstraction and developed the processes and procedures described here (http://www.qap.sdsu.edu). Several proactive strategies that serve the purpose of leading to standard interpretations of clinical data are discussed. Lessons learned from this project can assist nurse researchers to develop high-quality information retrieval methods, when multiple observers (or abstractors) are used during the data collection process.

Section snippets

Review of the literature

The validity of information contained in the medical record has been tested in a number of recent investigations. Clegg et al. (2001) noted that the decentralization of the health-care system, with the result that therapies are being implemented in a wider variety of settings, has made the process of collection of outcomes data using chart review and abstraction both more difficult and more expensive. They compared the results of using two methods of self-report (a mailed questionnaire vs. an

Instruments

This project developed two computerized database applications for the abstraction of breast and cervical cancer screening and diagnostic information from patient medical records. The researchers recognized and acknowledged concerns about the validity of certain types of data contained in the medical record that are documented in the literature cited above. Certain design elements were strategically incorporated into the computerized medical record abstraction tool in the interest of promoting

Results

The overall concordance (P_o) computed during the combined data abstraction phases ranged from 96% to 100% for six of nine major demographic outcome variables and from 90% to 94% for two additional items. The overall degree of agreement computed during this same timeframe for the documentation of procedures and their results ranged from 89% to 99% for seven variables. Assuming an expected proportion of agreement between two individual reviewers to be 50%, the upper and lower bounds of kappa over

Discussion

Two of the abstracted data elements were associated with lower computed interrater agreements. This finding was likely associated with the more subjective nature of these data sources. The patient self-report of breast symptoms was the first of these subjective sources. The second source of subjective data was the lexicon of terminology used by clinical providers when describing the findings of a clinical examination. Whether the provider translated that clinical finding into a predictive

Summary and conclusion

The IRR assessment process designed for this project provided the following benefits: (a) facilitated consistency in the data collection process; (b) allowed for continuous quality improvement by providing feedback and suggestions for improvement throughout the abstraction process; (c) provided a forum for resolving questions encountered in field abstraction; and (d) established benchmarks to evaluate data consistency and the IRR process itself. The processes and procedures established for the

References (14)

J.J. Allison et al.
The art and science of chart review
Joint Commission Journal on Quality Improvement
(2000)
J. Luck et al.
How well does chart abstraction measure quality? A prospective comparison of standardized patients with the medical record
American Journal of Medicine
(2000)
L.X. Clegg et al.
Comparison of self-reported initial treatment with medical records: Results from the prostate cancer outcomes study
American Journal of Epidemiology
(2001)
M. Cragie et al.
Reliability of health information on the net: An examination of experts' ratings
Journal of Medical Internet Research
(2002)
P. Eccleston et al.
Accounting for overlap? An application of Mezzich's kappa statistic to test interrater reliability of interview data on parental accident and emergency attendance
Journal of Advanced Nursing
(2001)
H.E. Harris et al.
Methodological considerations in the design of an obstetric database abstracted from medical records
Methods of Information in Medicine
(1997)
R.A. Hayward et al.
Estimating hospital deaths due to errors: Preventability is in the eye of the reviewer
Journal of the Medical Association
(2001)

There are more references available in the full text version of this article.

Cited by (26)

Researching the Appropriateness of Care in the Complementary and Integrative Health Professions Part 5: Using Patient Records: Selection, Protection, and Abstraction
2019, Journal of Manipulative and Physiological Therapeutics
Citation Excerpt :
When the data source is the patient record, the complexity of data can be challenging for abstractors to interpret in a standardized manner. Therefore, a comprehensive abstraction guide is essential, including priorities of source for data elements, standardization of terminology, definitions for unstructured data elements, and a process for guideline revision during data collection.9,10 The utility of the abstraction tool used is maximized by logically organizing the content to be user friendly and approximate the organization of the medical record.
The purpose of this paper is to describe the 4-step process (consent, selection, protection, and abstraction) of acquiring a large sample of chiropractic patient records from multiple practices and subsequent data abstraction.
From April 2017 to December 2017, RAND acquired patient records from 99 chiropractic practices across the United States. The records included patients enrolled in a survey e-study (prospective sample) and a random sample of all clinic patients (retrospective sample) with chronic back or neck pain. Clinic staff were trained to collect the sample, scan, and transfer the records. We designed an online data collection tool for abstraction. Protocols were instituted to protect patient confidentiality. Doctors of chiropractic were selected and trained as abstractors, and a system was established to monitor data collection.
In compliance with data protection protocols, 3603 patient records were scanned, including 1475 in the prospective sample and 2128 in the random sample. A total of 1716 patients (prospective sample) consented to having their records scanned, but only 1475 could be retrieved. Of records scanned, 19% were unusable owing to illegibility, no care during the period of interest, or poor scanning. The abstractor interrater reliability for appropriateness of care decisions was fair to moderate (κ .38-.48).
The acquisition, handling, and abstraction of a large sample of chiropractic records was a complex task with challenges that necessitated adapting planned approaches. Of the records abstracted, many revealed incomplete provider documentation regarding the details of and rationale for care. Better documentation and more standardized record keeping would facilitate future research using patient records.
Reliability of a Canadian database for primary care nursing services' clinical and administrative data
2018, International Journal of Medical Informatics
The use of electronic clinical and administrative data can be an advantageous source of information for assessing nursing performance in primary care. In Québec (Canada), the I-CLSC electronic database could be used to measure performance indicators. However, little is known about the reliability of the data contained in this database. The objective of this study was to assess the reliability of the clinical and administrative data contained in the I-CLSC electronic database based on the data entered in medical records.
We used a longitudinal design for this study. A sample of 100 patients who had experienced 107 episodes of wound care were randomly selected from all patients who had two or more consultations during the year 2015. The paper records were used as reference. We collected data regarding eight nursing sensitive indicators from both sources. We assessed the concordance between the electronic data and the paper records by measuring inter-rater agreement.
Six of the eight indicators showed a percentage agreement ≥ 85%, and kappa scores between 0.7 and 1.00 (p < 0.001), indicating high to perfect levels of agreement between the two data sources. Two indicators presented fair kappa scores.
This database provides reliable data relating to the organization of care but shows lower reliability for specific acts performed by nurses in primary care. This existing database can be used to assess, manage and improve certain dimensions of nursing performance in primary care.
Looking through the retrospectoscope: Reducing bias in emergency medicine chart review studies
2014, Annals of Emergency Medicine
Citation Excerpt :
Thus, the sensitivity and specificity of the medical record is low because there may be errors and idiosyncrasies in the reading, interpreting, coding, and transcribing of the data. Solution: The variables to be collected from the chart, as well as how these variables are defined, should be determined a priori and documented in a coding guide for abstractors.6,7,11,12 When the methods are reported, the coding rules for each abstracted element should be provided.
Research strategies that result in optimal data collection from the patient medical record
2012, Applied Nursing Research
Data obtained from the patient medical record are often a component of clinical research led by nurse investigators. The rigor of the data collection methods correlates to the reliability of the data and, ultimately, the analytical outcome of the study. Research strategies for reliable data collection from the patient medical record include the development of a precise data collection tool, the use of a coding manual, and ongoing communication with research staff.
Medical record review to recover missing data in a Portuguese birth cohort: Agreement with self-reported data collected by questionnaire and inter-rater variability
2011, Gaceta Sanitaria
Citation Excerpt :
The interobserver variability was low and did not threaten data precision. Standardized training of abstractors and rigorous quality assurance were proposed as critical criteria to improve the quality and accuracy of clinical record review16–18,34,35. When research involves data collection by distinct observers, the extent to which different observers perceive and record the same information should be evaluated.
To assess the yield of medical record review to recover missing data originally collected by questionnaire, to analyze the agreement between these two data sources and to determine interobserver variability in clinical record review.
We analyzed data from a birth cohort of 8,127 women who were consecutively recruited after giving birth from 2005-2006. Recruitment was conducted at all public maternity units of Porto, Portugal. We reviewed the medical records of 3,657 women with missing data in the baseline questionnaire and assessed agreement between these two sources by using information from participants with data from both sources. Interobserver variability was assessed by using 400 randomly selected clinical records.
Data on pregnancy complications and maternal anthropometric parameters were successfully recovered. Agreement between the questionnaire and records in family history data was fair, particularly for cardiovascular disease [k = 0.27; 95% confidence interval (95%CI): 0.23-0.32]. The highest agreement was observed for personal history of diabetes (k = 0.82; 95%CI 0.70-0.93), while agreement for hypertension was moderate (k = 0.60; 95%CI 0.50-0.69). Discrepancies in prepregnancy body mass index classes were observed in 10.3% women. Data were highly consistent between the two reviewers, with the highest agreement found for gestational diabetes (k = 1.00) and birth weight (99.5% concordance).
Data from the medical records and questionnaire were concordant with regard to pregnancy and well-known risk factors. The low interobserver variability did not threaten the precision of our data.
Evaluar el rendimiento de la revisión de registros médicos para completar datos originalmente recogidos por cuestionario, y analizar la concordancia entre ambas fuentes de datos y la variabilidad interobservador en la revisión de registros médicos.
Cohorte de nacimiento con 8.127 mujeres reclutadas de forma consecutiva después del parto en todas las maternidades públicas de Porto, Portugal (2005-2006). Se revisaron los registros médicos de 3.657 mujeres con datos incompletos en el cuestionario inicial, y se evaluó la concordancia entre ambas fuentes. La variabilidad interobservador se evaluó en 400 historias clínicas seleccionadas aleatoriamente.
La información sobre complicaciones patológicas del embarazo y la antropometría de las madres se recuperó con éxito. La concordancia entre el cuestionario y los registros con respecto a los antecedentes familiares era débil, especialmente para las enfermedades cardiovasculares (k = 0,27, intervalo de confianza del 95% [IC95%]: 0,23-0,32). La concordancia máxima se observó en los antecedentes personales de diabetes (k = 0,82, IC95%: 0,70-0,93), mientras que para la hipertensión fue moderada (k = 0,60, IC95%: 0,50-0,69). Se observaron discrepancias en las categorías de índice de masa corporal antes del embarazo en el 10,3% de las mujeres. Los datos fueron muy concordantes entre los revisores, con el máximo nivel de concordancia para la diabetes gestacional (k = 1,00), seguida del peso al nacer (99,5% concordantes).
Los registros médicos y la información del cuestionario fueron concordantes para los datos relacionados con el embarazo y los factores de riesgo conocidos. La baja variabilidad interobservador no pone en peligro la precisión de los datos.
Factors Associated With Non-Normal Birth Outcomes for Low-Risk Women in an Inner-City Hospital
2010, Journal of Midwifery and Women's Health
The purpose of this study was to examine factors associated with normal versus non-normal birth outcomes for low-risk women who were admitted for care in spontaneous labor.
The birth records of 93 women were reviewed.
At the completion of the fourth stage of labor, 61% of births (n = 57) met the criteria for normal, while 39% of births (n = 36) had non-normal outcomes. On bivariate analysis, variables associated with non-normal outcomes included nulliparity (odds ratio [OR], 9.10; 95% confidence interval [CI], 3–28; P < .0001), lower average centimeters of dilation at admission (t-score 4.422; P < .001), use of pharmacologic pain relief, including narcotics and epidural anesthesia (OR, 5.03; 95% CI, 2–16; P = .005), and birth attended by a physician versus a certified nurse-midwife (OR, 3.60; 95% CI, 2–9; P = .004). In a multivariate analysis, nulliparity (OR, 6.07; 95% CI, 2–19; P = .002) and lower average centimeters of dilation at admission (OR, 0.63; 95% CI, 0.5–0.9; P = .005) were independently associated with non-normal outcome.
The development of clinical guidelines aimed at reducing admissions of women in early labor may reduce non-normal outcomes, particularly for nulliparous women.

View all citing articles on Scopus

^☆: Supported in part by funds received from the State of California, Department of Health Services, Cancer Detection Section. All analysis, interpretations, and conclusions presented in this article are those of the authors and not the State of California. There are no known biases in the data presented that would affect the results.

View full text

Clinical methodPragmatic strategies that enhance the reliability of data abstracted from medical records☆

Abstract

Introduction

Section snippets

Review of the literature

Instruments

Results

Discussion

Summary and conclusion

Joint Commission Journal on Quality Improvement

American Journal of Medicine

Comparison of self-reported initial treatment with medical records: Results from the prostate cancer outcomes study

American Journal of Epidemiology

Reliability of health information on the net: An examination of experts' ratings

Journal of Medical Internet Research

Accounting for overlap? An application of Mezzich's kappa statistic to test interrater reliability of interview data on parental accident and emergency attendance

Journal of Advanced Nursing

Methodological considerations in the design of an obstetric database abstracted from medical records

Methods of Information in Medicine

Estimating hospital deaths due to errors: Preventability is in the eye of the reviewer

Journal of the Medical Association

Clinical method
Pragmatic strategies that enhance the reliability of data abstracted from medical records☆