Skip to Main Content
Brown University
The Warren Alpert Medical School

Department of Neurosurgery

Secondary Navigation Navigation

  • Give Now
Search Menu

Site Navigation

  • Home
  • About
    • History
    • Diversity
    • News
    • Facilities & Locations
    • Norman Prince Neurosciences Institute
  • People
  • Divisions
    • Brain Tumor Surgery
    • Cerebrovascular/Skull Base Surgery/Endovascular Neurosurgery
    • Functional and Epilepsy Neurosurgery Division
    • Neuro-Trauma and Critical Care
    • Pediatric Neurosurgery
    • Peripheral Nerve Surgery
    • Spinal Surgery
    • Stereotactic Radiosurgery
  • Centers
    • Center for Endoscopic Skull Base and Pituitary Surgery
    • Center for Surgical Treatment of the Developing Brain and Spine
    • Comprehensive Brain Tumor Center
    • Comprehensive Movement Disorders Center
    • Comprehensive Stroke Center
    • Epilepsy Surgery Program
    • Minimally-Invasive Endoscopic Spine Surgery
    • Neuroplastic Center
    • Norman Prince Spine Institute
    • Psychiatric Neurosurgery Program
    • Spine Health and Bone Metabolism Center
  • Research
    • Clinical Trials
    • Publications
    • Research Labs
    • Basic & Translational Science Research
  • Education
    • Residency Program
    • Fellowship Programs
    • Conferences & Lectures
    • Medical Student
  • For Patients
    • Conditions
    • Technology
    • Schedule a Visit
Search
Department of Neurosurgery
June 12, 2023
PubMed

Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank.

Publication

Ali R, Tang OY, Connolly ID, Fridley JS, Shin JH, Zadnik Sullivan PL, Cielo D, Oyelese AA, Doberstein CE, Telfeian AE, Gokaslan ZL, Asaad WF. Neurosurgery. 2023 Jun 12. doi:10.1227/neu.0000000000002551. Epub ahead of print. PMID: 37306460.

Large language models (LLMs) like GPT-3.5, GPT-4, and Google Bard were tested on a neurosurgery exam with complex questions. GPT-4 showed the highest accuracy, getting 82.6% of questions right, while GPT-3.5 scored 62.4%, and Bard got 44.2% correct. GPT-4 excelled in various categories, especially spine-related questions. Questions that required higher-order problem solving were harder for GPT-3.5 and Bard, but not for GPT-4. GPT-4 performed well on imaging questions, even outperforming GPT-3.5 and Bard, and had fewer instances of incorrect “hallucination” in responses. This study highlights GPT-4’s effectiveness in answering complex neurosurgery questions and its potential for medical applications.

Brown University
Providence RI 02912 401-863-1000

Quick Navigation

  • Division of Biology and Medicine
  • Program in Biology
  • Affiliated Hospitals

Footer Navigation

  • Events
  • Maps and Directions
  • Contact Us
  • Accessibility
Give To Brown

© Brown University

The Warren Alpert Medical School
For You
Search Menu

Mobile Site Navigation

    Mobile Site Navigation

    • Home
    • About
      • History
      • Diversity
      • News
      • Facilities & Locations
      • Norman Prince Neurosciences Institute
    • People
    • Divisions
      • Brain Tumor Surgery
      • Cerebrovascular/Skull Base Surgery/Endovascular Neurosurgery
      • Functional and Epilepsy Neurosurgery Division
      • Neuro-Trauma and Critical Care
      • Pediatric Neurosurgery
      • Peripheral Nerve Surgery
      • Spinal Surgery
      • Stereotactic Radiosurgery
    • Centers
      • Center for Endoscopic Skull Base and Pituitary Surgery
      • Center for Surgical Treatment of the Developing Brain and Spine
      • Comprehensive Brain Tumor Center
      • Comprehensive Movement Disorders Center
      • Comprehensive Stroke Center
      • Epilepsy Surgery Program
      • Minimally-Invasive Endoscopic Spine Surgery
      • Neuroplastic Center
      • Norman Prince Spine Institute
      • Psychiatric Neurosurgery Program
      • Spine Health and Bone Metabolism Center
    • Research
      • Clinical Trials
      • Publications
      • Research Labs
      • Basic & Translational Science Research
    • Education
      • Residency Program
      • Fellowship Programs
      • Conferences & Lectures
      • Medical Student
    • For Patients
      • Conditions
      • Technology
      • Schedule a Visit

Mobile Secondary Navigation Navigation

  • Give Now
All of Brown.edu People
Close Search

Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank.