Skip to main content

Main menu

  • Home
  • Current Issue
  • Content
    • Current Issue
    • Early Access
    • Multimedia
    • Podcast
    • Collections
    • Past Issues
    • Articles by Subject
    • Articles by Type
    • Supplements
    • Plain Language Summaries
    • Calls for Papers
  • Info for
    • Authors
    • Reviewers
    • Job Seekers
    • Media
  • About
    • Annals of Family Medicine
    • Editorial Staff & Boards
    • Sponsoring Organizations
    • Copyrights & Permissions
    • Announcements
  • Engage
    • Engage
    • e-Letters (Comments)
    • Subscribe
    • Podcast
    • E-mail Alerts
    • Journal Club
    • RSS
    • Annals Forum (Archive)
  • Contact
    • Contact Us
  • Careers

User menu

  • My alerts

Search

  • Advanced search
Annals of Family Medicine
  • My alerts
Annals of Family Medicine

Advanced Search

  • Home
  • Current Issue
  • Content
    • Current Issue
    • Early Access
    • Multimedia
    • Podcast
    • Collections
    • Past Issues
    • Articles by Subject
    • Articles by Type
    • Supplements
    • Plain Language Summaries
    • Calls for Papers
  • Info for
    • Authors
    • Reviewers
    • Job Seekers
    • Media
  • About
    • Annals of Family Medicine
    • Editorial Staff & Boards
    • Sponsoring Organizations
    • Copyrights & Permissions
    • Announcements
  • Engage
    • Engage
    • e-Letters (Comments)
    • Subscribe
    • Podcast
    • E-mail Alerts
    • Journal Club
    • RSS
    • Annals Forum (Archive)
  • Contact
    • Contact Us
  • Careers
  • Follow annalsfm on Twitter
  • Visit annalsfm on Facebook
Meeting ReportResearch capacity building

Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset for AI Development

Yejin Jeong, Mike Schaekermann and Steven Lin
The Annals of Family Medicine November 2024, 22 (Supplement 1) 6186; DOI: https://doi.org/10.1370/afm.22.s1.6186
Yejin Jeong
BA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mike Schaekermann
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Steven Lin
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • eLetters
  • Info & Metrics
  • PDF
Loading

Abstract

Context: Traditional clinical health datasets often fail to represent the diversity and breadth of real-world disease, influencing research and AI tools developed on those datasets. Dermatology, easily documentable through patient-contributed images, presents an ideal case for innovative dataset creation.

Objective: To develop a scalable, representative dermatology dataset using crowdsourcing via internet search ads, enhancing medical education and AI application development.

Study Design and Analysis: We employed Google Search ads to gather dermatology images from the public, with informed consent, between March and November 2023. Images were curated, de-identified, and labeled by dermatologists, providing a dataset of 10,408 images from 5,033 contributors across the United States.

Setting or Dataset: The dataset aggregated includes diverse skin conditions and demographic data, now publicly accessible on GitHub.

Population Studied: Internet users across the United States, representing various demographics and skin types, contributed to the dataset.

Intervention/Instrument: Google Search ads targeted individuals searching for skin-related terms, inviting them to contribute images and related information via a web platform.

Outcome Measures: Measures included the volume of contributions, demographic diversity, and the diagnostic usability of images as evaluated by dermatologists.

Results: The study achieved a median of 22 submissions per day, with a significant representation of diverse skin conditions and demographics. Over 97.5% of contributions were usable, with high dermatologist confidence correlated with the completeness of accompanying data.

Conclusions: Crowdsourcing via search ads effectively generates diverse, representative dermatological datasets. The SCIN dataset can significantly enhance dermatological research, education, and AI tool accuracy, particularly in underrepresented communities.

  • © 2024 Annals of Family Medicine, Inc. For the private, noncommercial use of one individual user of the Web site. All other rights reserved.
Previous
Back to top

In this issue

The Annals of Family Medicine: 22 (Supplement 1)
The Annals of Family Medicine: 22 (Supplement 1)
Vol. 22, Issue Supplement 1
20 Nov 2024
  • Table of Contents
  • Index by author
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Annals of Family Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset for AI Development
(Your Name) has sent you a message from Annals of Family Medicine
(Your Name) thought you would like to see the Annals of Family Medicine web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
1 + 3 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Citation Tools
Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset for AI Development
Yejin Jeong, Mike Schaekermann, Steven Lin
The Annals of Family Medicine Nov 2024, 22 (Supplement 1) 6186; DOI: 10.1370/afm.22.s1.6186

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Get Permissions
Share
Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset for AI Development
Yejin Jeong, Mike Schaekermann, Steven Lin
The Annals of Family Medicine Nov 2024, 22 (Supplement 1) 6186; DOI: 10.1370/afm.22.s1.6186
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
  • eLetters
  • Info & Metrics
  • PDF

Related Articles

  • No related articles found.
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • ‘It takes a lot of twisting’: doing research in structurally vulnerable spaces
  • Combining evidence and experience for research recruitment and participation among patients in US community health centers
  • Flipping the Script: A Design thinking Approach to Enhancing Interprofessional Collaboration in Primary Care
Show more Research capacity building

Similar Articles

Content

  • Current Issue
  • Past Issues
  • Early Access
  • Plain-Language Summaries
  • Multimedia
  • Podcast
  • Articles by Type
  • Articles by Subject
  • Supplements
  • Calls for Papers

Info for

  • Authors
  • Reviewers
  • Job Seekers
  • Media

Engage

  • E-mail Alerts
  • e-Letters (Comments)
  • RSS
  • Journal Club
  • Submit a Manuscript
  • Subscribe
  • Family Medicine Careers

About

  • About Us
  • Editorial Board & Staff
  • Sponsoring Organizations
  • Copyrights & Permissions
  • Contact Us
  • eLetter/Comments Policy

© 2025 Annals of Family Medicine