Skip to main content

Main menu

  • Home
  • Current Issue
  • Content
    • Current Issue
    • Early Access
    • Multimedia
    • Podcast
    • Collections
    • Past Issues
    • Articles by Subject
    • Articles by Type
    • Supplements
    • Plain Language Summaries
    • Calls for Papers
  • Info for
    • Authors
    • Reviewers
    • Job Seekers
    • Media
  • About
    • Annals of Family Medicine
    • Editorial Staff & Boards
    • Sponsoring Organizations
    • Copyrights & Permissions
    • Announcements
  • Engage
    • Engage
    • e-Letters (Comments)
    • Subscribe
    • Podcast
    • E-mail Alerts
    • Journal Club
    • RSS
    • Annals Forum (Archive)
  • Contact
    • Contact Us
  • Careers

User menu

  • My alerts

Search

  • Advanced search
Annals of Family Medicine
  • My alerts
Annals of Family Medicine

Advanced Search

  • Home
  • Current Issue
  • Content
    • Current Issue
    • Early Access
    • Multimedia
    • Podcast
    • Collections
    • Past Issues
    • Articles by Subject
    • Articles by Type
    • Supplements
    • Plain Language Summaries
    • Calls for Papers
  • Info for
    • Authors
    • Reviewers
    • Job Seekers
    • Media
  • About
    • Annals of Family Medicine
    • Editorial Staff & Boards
    • Sponsoring Organizations
    • Copyrights & Permissions
    • Announcements
  • Engage
    • Engage
    • e-Letters (Comments)
    • Subscribe
    • Podcast
    • E-mail Alerts
    • Journal Club
    • RSS
    • Annals Forum (Archive)
  • Contact
    • Contact Us
  • Careers
  • Follow annalsfm on Twitter
  • Visit annalsfm on Facebook
Research ArticleOriginal Research

Using Machine Learning to Predict Primary Care and Advance Workforce Research

Peter Wingrove, Winston Liaw, Jeremy Weiss, Stephen Petterson, John Maier and Andrew Bazemore
The Annals of Family Medicine July 2020, 18 (4) 334-340; DOI: https://doi.org/10.1370/afm.2550
Peter Wingrove
1University of Pittsburgh, School of Medicine, Pittsburgh, Pennsylvania
2Robert Graham Center, Washington, DC
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: pmw27@pitt.edu
Winston Liaw
2Robert Graham Center, Washington, DC
3University of Houston, College of Medicine, Department of Health Systems and Population Health Sciences, Houston, Texas
MD, MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeremy Weiss
4Carnegie Mellon University, Pittsburgh, Pennsylvania
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephen Petterson
2Robert Graham Center, Washington, DC
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John Maier
5University of Pittsburgh, Department of Biomedical Informatics, Pittsburgh, Pennsylvania
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrew Bazemore
2Robert Graham Center, Washington, DC
MD, MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • eLetters
  • Info & Metrics
  • PDF
Loading

Article Figures & Data

Tables

  • Additional Files
    • View popup
    Table 1

    Prescriptions and Procedures and Comparison of Train and Test Data Sets, by Specialty

    SpecialtyDrugs Prescribed, Mean No.Procedure Codes, Mean No.Train, n (%)Test, n (%)
    Allergy/immunology12.98.71,611 (0.6)1,625 (0.6)
    Anesthesiology16.47.416,087 (5.7)16,110 (5.7)
    Cardiology38.121.311,465 (4.1)11,170 (4.0)
    Dermatology12.817.35,609 (2.0)5,498 (1.9)
    Emergency medicine8.65.518,689 (6.6)18,663 (6.6)
    Endocrinology31.79.72,376 (0.8)2,497 (0.9)
    Gastroenterology16.213.25,999 (2.1)5,960 (2.1)
    Hematology-oncology19.218.55,638 (2.0)5,572 (2.0)
    Infectious disease21.87.02,337 (0.8)2,328 (0.8)
    Nephrology36.613.33,735 (1.3)3,691 (1.3)
    Neurology27.49.46,431 (2.3)6,217 (2.2)
    Neurosurgery5.18.81,976 (0.7)2,008 (0.7)
    Obstetrics and gynecology5.44.611,361 (4.0)11,505 (4.1)
    Ophthalmology13.713.28,837 (3.1)8,755 (3.1)
    Orthopedic surgery5.913.510,980 (3.9)11,095 (3.9)
    Otolaryngology9.910.54,322 (1.5)4,262 (1.5)
    Pathology5.610.24,682 (1.7)4,831 (1.7)
    Physical medicine and rehabilitation14.510.63,610 (1.3)3,438 (1.2)
    Plastic surgery3.15.71,864 (0.7)1,795 (0.6)
    Primary care61.411.7100,682 (35.6)101,498 (35.9)
    Psychiatry20.94.015,075 (5.3)14,974 (5.3)
    Pulmonology22.812.75,282 (1.9)5,395 (1.9)
    Radiation oncology3.514.51,926 (0.7)1,903 (0.7)
    Radiology4.335.711,840 (4.2)11,816 (4.2)
    Rheumatology33.415.11,975 (0.7)2,030 (0.7)
    Surgery5.19.213,536 (4.8)13,278 (4.7)
    Urology17.320.14,568 (1.6)4,579 (1.6)
    Total282,493 (100)282,493 (100)
    • Note: Prescribing data are from the 2014-2016 Centers for Medicare and Medicaid Services (CMS) Medicare Fee-For-Service Provider Utilization and Payment Data: Part D Prescriber Public Use Files.19 Procedure data are from 2014-2016 CMS Medicare Fee-For-Service Provider Utilization and Payment Data: Physician and Other Supplier Public Use Files.20

    • View popup
    Table 2

    F1 Scores for Random Forests, by Specialty and Type of Training Data

    SpecialtyF1 Score, Mean (95% CI)
    CombinedPrescription OnlyProcedure Only
    Allergy/immunology0.9120.860a0.903a
    (0.906-0.917)(0.855-0.865)(0.901-0.905)
    Anesthesiology0.9510.624a0.963a
    (0.950-0.952)(0.615-0.632)(0.962-0.964)
    Cardiology0.9380.917a0.929a
    (0.935-0.940)(0.913-0.919)(0.927-0.932)
    Dermatology0.9660.940a0.964b
    (0.964-0.968)(0.937-0.943)(0.963-0.965)
    Emergency medicine0.8970.716a0.914a
    (0.894-0.899)(0.708-0.723)(0.911-0.916)
    Endocrinology0.8650.869a0.623a
    (0.858-0.871)(0.863-0.875)(0.615-0.630)
    Gastroenterology0.9230.901a0.900a
    (0.922-0.924)(0.897-0.905)(0.897-0.903)
    Hematology-oncology0.8740.8720.677a
    (0.872-0.876)(0.869-0.876)(0.673-0.680)
    Infectious disease0.7450.758a0.474a
    (0.740-0.750)(0.754-0.763)(0.462-0.486)
    Nephrology0.8850.866a0.882a
    (0.882-0.887)(0.864-0.868)(0.879-0.884)
    Neurology0.8850.895a0.732a
    (0.883-0.887)(0.892-0.897)(0.725-0.739)
    Neurosurgery0.6500.377a0.631a
    (0.645-0.656)(0.364-0.389)(0.626-0.635)
    Obstetrics and gynecology0.9200.928a0.868a
    (0.917-0.923)(0.927-0.929)(0.865-0.870)
    Ophthalmology0.9820.975a0.987a
    (0.982-0.982)(0.974-0.976)(0.987-0.987)
    Orthopedic surgery0.8840.760a0.901a
    (0.880-0.888)(0.756-0.765)(0.900-0.903)
    Otolaryngology0.9320.874a0.951a
    (0.927-0.937)(0.868-0.880)(0.949-0.952)
    Pathology0.9870.005a0.990a
    (0.986-0.987)(0-0.012)(0.990-0.990)
    Physical medicine and rehabilitation0.5860.380a0.492a
    (0.583-0.589)(0.374-0.387)(0.483-0.500)
    Plastic surgery0.5330.314a0.383a
    (0.527-0.539)(0.287-0.341)(0.378-0.388)
    Primary care0.9200.911a0.878a
    (0.917-0.922)(0.910-0.912)(0.876-0.880)
    Psychiatry0.9300.938a0.740a
    (0.929-0.932)(0.936-0.940)(0.734-0.745)
    Pulmonology0.8360.843a0.818a
    (0.834-0.837)(0.839-0.848)(0.814-0.822)
    Radiation oncology0.9390.691a0.976a
    (0.933-0.945)(0.680-0.702)(0.975-0.977)
    Radiology0.9790.275a0.984a
    (0.977-0.980)(0.272-0.279)(0.983-0.985)
    Rheumatology0.9160.9160.726a
    (0.913-0.919)(0.913-0.918)(0.719-0.733)
    Surgery0.7740.624a0.735a
    (0.767-0.781)(0.614-0.634)(0.731-0.740)
    Urology0.9620.950a0.962
    (0.958-0.965)(0.947-0.953)(0.961-0.963)
    Macro F10.8760.745a0.821a
    (0.874-0.878)(0.741-0.748)(0.820-0.823)
    • Note: Prescribing data are from 2014-2016 Centers for Medicare and Medicaid Services (CMS) Medicare Fee-For-Service Provider Utilization and Payment Data: Part D Prescriber Public Use Files.19 Procedure data from 2014-2016 CMS Medicare Fee-For-Service Provider Utilization and Payment Data: Physician and Other Supplier Public Use Files.20

    • ↵a P <.01 (paired t test vs combined).

    • ↵b P <.05 (paired t test vs combined).

    • View popup
    Table 3

    Predicted vs Actual Counts of Physicians, by Specialty, for Combined Random Forest Models

    SpecialtyPredictedActualPredicted/Actual, %
    Allergy/immunology1,5901,62597.8
    Anesthesiology16,10016,11099.9
    Cardiology10,79011,17096.6
    Dermatology5,5695,498101.3
    Emergency medicine17,88618,66395.8
    Endocrinology2,3302,49793.3
    Gastroenterology6,0645,960101.7
    Hematology-oncology5,4685,57298.1
    Infectious disease1,7042,32873.2
    Nephrology3,7023,691100.3
    Neurology5,7476,21792.4
    Neurosurgery1,3422,00866.8
    Obstetrics and gynecology11,42511,50599.3
    Ophthalmology8,7588,755100.0
    Orthopedic surgery11,00811,09599.2
    Otolaryngology4,0214,26294.3
    Pathology4,7904,83199.2
    Physical medicine and rehabilitation2,1543,43862.7
    Plastic surgery1,4191,79579.1
    Primary care105,225101,498103.7
    Psychiatry14,91214,97499.6
    Pulmonology5,5255,395102.4
    Radiation oncology1,8811,90398.8
    Radiology11,54711,81697.7
    Rheumatology2,0862,030102.8
    Surgery14,94913,278112.6
    Urology4,4984,57998.2
    • Note: The predicted counts are based on the combined models and averaged all 9 sets of predictions. The actual counts are the number of physicians by specialty in the Test data set. Values are rounded to the nearest integer.

    • View popup
    Table 4

    Model Agreement and Specialty Match Using 2016 Data

    SpecialtyCountModels Predicting the Same Specialty, %Specialty Match, %aSpecialty Mismatch, %b
    Allergy/immunology1,62597.189.67.5
    Anesthesiology16,11097.994.33.6
    Cardiology11,17096.990.46.5
    Dermatology5,49898.896.72.1
    Emergency medicine18,66398.387.011.3
    Endocrinology2,49795.883.312.5
    Gastroenterology5,96097.292.44.8
    Hematology-oncology5,57294.984.910.0
    Infectious disease2,32891.161.229.9
    Nephrology3,69196.786.99.8
    Neurology6,21794.583.111.4
    Neurosurgery2,00880.648.332.3
    Obstetrics and gynecology11,50596.790.66.1
    Ophthalmology8,75599.197.91.2
    Orthopedic surgery11,09594.686.18.5
    Otolaryngology4,26296.889.57.3
    Pathology4,83199.397.81.5
    Physical medicine and rehabilitation3,43883.241.641.6
    Plastic surgery1,79580.742.238.5
    Primary care101,49898.392.65.7
    Psychiatry14,97497.992.15.8
    Pulmonology5,39596.183.212.9
    Radiation Oncology1,90395.991.04.9
    Radiology11,81699.196.42.7
    Rheumatology2,03097.691.75.9
    Surgery13,27891.777.714.0
    Urology4,57997.394.52.8
    Overall282,49397.0c89.4c7.6c
    • For this analysis, we applied the 2014, 2015, and 2016 combined random forests to 2016 Test data, for a total of 3 predictions based on prescribing and procedure data for a single year. Model agreement is defined as all 3 models predicting the same specialty.

    • ↵a All 3 models predicted the self-reported specialty.

    • ↵b All 3 models predicted a specialty that differed from the self-reported category.

    • ↵c Mean across all specialties weighted by number in each specialty.

Additional Files

  • Tables
  • Supplemental Tables

    Supplemental Tables

    Files in this Data Supplement:

    • Supplemental data: Tables - PDF file
PreviousNext
Back to top

In this issue

The Annals of Family Medicine: 18 (4)
The Annals of Family Medicine: 18 (4)
Vol. 18, Issue 4
July/August 2020
  • Table of Contents
  • Index by author
  • Back Matter (PDF)
  • Front Matter (PDF)
  • The Issue in Brief
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Annals of Family Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Using Machine Learning to Predict Primary Care and Advance Workforce Research
(Your Name) has sent you a message from Annals of Family Medicine
(Your Name) thought you would like to see the Annals of Family Medicine web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
15 + 5 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Citation Tools
Using Machine Learning to Predict Primary Care and Advance Workforce Research
Peter Wingrove, Winston Liaw, Jeremy Weiss, Stephen Petterson, John Maier, Andrew Bazemore
The Annals of Family Medicine Jul 2020, 18 (4) 334-340; DOI: 10.1370/afm.2550

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Get Permissions
Share
Using Machine Learning to Predict Primary Care and Advance Workforce Research
Peter Wingrove, Winston Liaw, Jeremy Weiss, Stephen Petterson, John Maier, Andrew Bazemore
The Annals of Family Medicine Jul 2020, 18 (4) 334-340; DOI: 10.1370/afm.2550
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • INTRODUCTION
    • METHODS
    • RESULTS
    • DISCUSSION
    • Footnotes
    • References
  • Figures & Data
  • eLetters
  • Info & Metrics
  • PDF

Related Articles

  • PubMed
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Neighborhood Determinants of Primary Care Access in Virginia
  • Proactive Deprescribing Among Older Adults With Polypharmacy: Barriers and Enablers
  • Artificial Intelligence Tools for Preconception Cardiomyopathy Screening Among Women of Reproductive Age
Show more Original Research

Similar Articles

Subjects

  • Methods:
    • Cross-disciplinary
  • Other topics:
    • Health informatics
    • Research capacity building

Keywords

  • biostatistical methods
  • workforce
  • Medicare

Content

  • Current Issue
  • Past Issues
  • Early Access
  • Plain-Language Summaries
  • Multimedia
  • Podcast
  • Articles by Type
  • Articles by Subject
  • Supplements
  • Calls for Papers

Info for

  • Authors
  • Reviewers
  • Job Seekers
  • Media

Engage

  • E-mail Alerts
  • e-Letters (Comments)
  • RSS
  • Journal Club
  • Submit a Manuscript
  • Subscribe
  • Family Medicine Careers

About

  • About Us
  • Editorial Board & Staff
  • Sponsoring Organizations
  • Copyrights & Permissions
  • Contact Us
  • eLetter/Comments Policy

© 2025 Annals of Family Medicine