Skip to main content

Main menu

  • Home
  • Current Issue
  • Content
    • Current Issue
    • Early Access
    • Multimedia
    • Podcast
    • Collections
    • Past Issues
    • Articles by Subject
    • Articles by Type
    • Supplements
    • Plain Language Summaries
    • Calls for Papers
  • Info for
    • Authors
    • Reviewers
    • Job Seekers
    • Media
  • About
    • Annals of Family Medicine
    • Editorial Staff & Boards
    • Sponsoring Organizations
    • Copyrights & Permissions
    • Announcements
  • Engage
    • Engage
    • e-Letters (Comments)
    • Subscribe
    • Podcast
    • E-mail Alerts
    • Journal Club
    • RSS
    • Annals Forum (Archive)
  • Contact
    • Contact Us
  • Careers

User menu

  • My alerts

Search

  • Advanced search
Annals of Family Medicine
  • My alerts
Annals of Family Medicine

Advanced Search

  • Home
  • Current Issue
  • Content
    • Current Issue
    • Early Access
    • Multimedia
    • Podcast
    • Collections
    • Past Issues
    • Articles by Subject
    • Articles by Type
    • Supplements
    • Plain Language Summaries
    • Calls for Papers
  • Info for
    • Authors
    • Reviewers
    • Job Seekers
    • Media
  • About
    • Annals of Family Medicine
    • Editorial Staff & Boards
    • Sponsoring Organizations
    • Copyrights & Permissions
    • Announcements
  • Engage
    • Engage
    • e-Letters (Comments)
    • Subscribe
    • Podcast
    • E-mail Alerts
    • Journal Club
    • RSS
    • Annals Forum (Archive)
  • Contact
    • Contact Us
  • Careers
  • Follow annalsfm on Twitter
  • Visit annalsfm on Facebook
NewsFamily Medicine UpdatesF

UNDERSTANDING THE “SUM OF SUBTEST TO OVERALL SCORE DISCREPANCY” ON THE MC-FP EXAMINATION

Kenneth D. Royal and James C. Puffer
The Annals of Family Medicine January 2012, 10 (1) 81-82; DOI: https://doi.org/10.1370/afm.1358
Kenneth D. Royal
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James C. Puffer
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • eLetters
  • Info & Metrics
  • PDF
Loading

When high-stakes examinations, such as the American Board of Family Medicine’s (ABFM) Maintenance of Certification-in Family Practice (MC-FP) examination are administered, candidates and diplomates are keenly interested in the accuracy of their test scores, especially when their scores are close to, but below, the pass/fail cut point. In some instances, candidates will attempt to reverse engineer their scores using the information provided on the score report in an effort to verify the “weighted sum of the subtest scores” is congruent with the overall test score. Any discrepancy might become alarming to the candidate, providing a seemingly legitimate reason to believe the overall score was inaccurate, thus prompting a phone call to the ABFM for further investigation and clarification. Historically, such a mistake in scoring has never been found; however, a statistical phenomenon that we will describe below could make it appear so. We would like to explain this phenomenon so that examinees who attempt to reverse engineer their score reports will better understand the “sum of subtest to overall score discrepancy” phenomenon.

Sum of Subtest to Overall Score Discrepancy

Sometimes when examinees attempt to reverse engineer their score reports, the weighted sum of the sub-test scores will be higher than the total score. When the reverse happens we generally do not receive a phone call. For example, some candidates may find the weighted subtests add up to a scaled score of 400 when the overall scaled score was 380. Because the current minimum passing standard (MPS) is 390, candidates who experience this phenomenon may question the validity of the overall score, and ultimately the pass/fail decision. Below, we will attempt to explain (albeit briefly) this rather technical statistical phenomenon.

Diplomates typically view scores as quantities that have additive properties. For instance, in the past the ABFM presented raw scores on the score report. If one were to add the weighted raw scores of the subtests, the scores would certainly equal the raw score of the total test. Unfortunately, raw scores are not measures. Although raw scores are useful for descriptive purposes, they lack generality because they are specific to the particular test that was taken. Raw scores are counts and are deterministic and exact, but the measures they imply are probabilistic and have some degree of imprecision. The ABFM employs the Rasch model, 1 a 1-parameter Item Response Theory (IRT) measurement model, to score examinations. The Rasch model converts raw scores to linear measures and controls for the difficulty of the test version one received.

In some instances, the weighted sum of the subtests scores (as determined by the IRT scoring method) will be greater than the overall score. The primary reasons for this are twofold: first, score exchanges have asymmetric non-linearity. That is, within-person variation increases on subtest areas, making the distribution of subtest measures wider than the distribution for the overall test. This can often make mean measures appear larger. Second, there is an increase in measurement error due to the small number of items available in each subtest area. Consequently, the increase in measurement error also inflates measure variance, thus causing even more inferential instability. It is for these reasons that we report a standard error with each measure on the MC-FP score report, as it assists the examinee in understanding the stability of each particular measure. For a more detailed discussion on the topic of summing sub-test measures, readers are encouraged to see Wright.2

Additional Insights and Recommendations to Test Takers

What does the statistical phenomenon presented above mean to persons who take the MC-FP examination? First, examinees should know that only the overall scaled score is used to determine the pass/fail decision. This score is based on one’s cumulative performance on 350 items, thus the results will be both highly precise and statistically stable. Therefore, subtest scores should be viewed simply as good approximations for one’s performance in a particular clinical content area, as these scores are often highly unstable due to the limited number of items and larger standard errors.

Next, test takers should be aware that extreme subtest scores are not uncommon because there are a limited number of items in each subtest area. This may cause additional problems with regard to interpretation. For example, the ABFM reported range for scores is 200 to 800. It is possible that scores actually may be well below 200 or far greater than 800, but in such instances scores are rounded back to fit the range of the scale. Therefore, in most instances where candidates find their weighted sum of subtest scores do not equal that of the total test, extreme scores likely are the primary culprit. Examinees who attempt to reverse engineer the score report should be particularly mindful of extreme scores and how scores of 200 or 800 are not necessarily indicative of a true 200 or 800 score.

Examinees also need to be aware that some granularity exists with the reporting of scores. The MC-FP examination provides truncated scores that are reported in increments of 10. For example, an examinee that truly scored a 507 would see a reported score of 500. Although detailed scaled scores are used in the calculation of scores, only truncated scores are reported. This largely is for purposes of clarity and simplicity. However, in no instance is an examinee’s score rounded up, as all test takers are expected to meet or exceed a particular passing threshold. Subtle nuances such as these also can have some bearing on the impact of subtest score summations.

Conclusion

It is important to emphasize that only the overall scaled score is used to determine the pass/fail decision on the ABFM MC-FP examination. Subtest scores are less stable due to the fewer number of items and the larger standard errors. Despite the instability of subtest scores, a good bit of inferential value can be gleaned from this information, as subtest scores serve as useful approximations for one’s performance in various clinical categories on the MC-FP examination. In instances where examinees attempt to reverse engineer their scores based on the information presented in the blueprint, it likely will prove to be unproductive because of the statistical phenomenon discussed above. Rather than attempting to make the case that one’s score result should be corrected because the sum of one’s weighted subtest scores is not congruent with the overall score, candidates instead are encouraged to use their subtest scores to improve their medical content knowledge by developing an improved self-directed learning plan, thereby increasing their likelihood of future success on the examination.

  • © 2012 Annals of Family Medicine, Inc.

References

  1. ↵
    1. Rasch G
    . Probabilistic Models for Some Intelligence and Attainment Tests. Chicago, IL: The University of Chicago Press; 1960.
  2. ↵
    1. Wright B
    . Combining part-test vs whole-test measures. Rasch Measurement Transactions. 1994;8(3):376.
    OpenUrl
PreviousNext
Back to top

In this issue

The Annals of Family Medicine: 10 (1)
The Annals of Family Medicine: 10 (1)
Vol. 10, Issue 1
January/February 2012
  • Table of Contents
  • Index by author
  • In Brief
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Annals of Family Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
UNDERSTANDING THE “SUM OF SUBTEST TO OVERALL SCORE DISCREPANCY” ON THE MC-FP EXAMINATION
(Your Name) has sent you a message from Annals of Family Medicine
(Your Name) thought you would like to see the Annals of Family Medicine web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
10 + 7 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Citation Tools
UNDERSTANDING THE “SUM OF SUBTEST TO OVERALL SCORE DISCREPANCY” ON THE MC-FP EXAMINATION
Kenneth D. Royal, James C. Puffer
The Annals of Family Medicine Jan 2012, 10 (1) 81-82; DOI: 10.1370/afm.1358

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Get Permissions
Share
UNDERSTANDING THE “SUM OF SUBTEST TO OVERALL SCORE DISCREPANCY” ON THE MC-FP EXAMINATION
Kenneth D. Royal, James C. Puffer
The Annals of Family Medicine Jan 2012, 10 (1) 81-82; DOI: 10.1370/afm.1358
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Sum of Subtest to Overall Score Discrepancy
    • Additional Insights and Recommendations to Test Takers
    • Conclusion
    • References
  • eLetters
  • Info & Metrics
  • PDF

Related Articles

  • No related articles found.
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Support for the WHO Resolution on Social Participation
  • Resident Leadership Roles and Selection
  • New Advocacy Ambassadors Program Helps AAFP Members Engage With Their Legislators
Show more Family Medicine Updates

Similar Articles

Content

  • Current Issue
  • Past Issues
  • Early Access
  • Plain-Language Summaries
  • Multimedia
  • Podcast
  • Articles by Type
  • Articles by Subject
  • Supplements
  • Calls for Papers

Info for

  • Authors
  • Reviewers
  • Job Seekers
  • Media

Engage

  • E-mail Alerts
  • e-Letters (Comments)
  • RSS
  • Journal Club
  • Submit a Manuscript
  • Subscribe
  • Family Medicine Careers

About

  • About Us
  • Editorial Board & Staff
  • Sponsoring Organizations
  • Copyrights & Permissions
  • Contact Us
  • eLetter/Comments Policy

© 2025 Annals of Family Medicine