Evidence
Surg Endosc. 2024 Sep 25. doi: 10.1007/s00464-024-11267-y. Online ahead of print.
ABSTRACT
BACKGROUND: Artificial intelligence models such as ChatGPT (Open AI) have performed well on the exams of various medical and surgical fields. It is not yet known how ChatGPT performs on similar metabolic and bariatric surgery (MBS) questions.
OBJECTIVE: Assess the performance of ChatGPT on Focused Practice Designation in Metabolic and Bariatric Surgery board-style questions.
SETTING: United States.
METHODS: Questions obtained from the largest commercially available bank of FPD-MBS practice questions were entered into ChatGPT-4, as is, without prior training. We assessed the overall percentage correct as well as the percentage correct within each of the five American Board of Surgery (ABS) question categories. One-way ANOVA was used to determine if the frequency of correct answers differed between categories.
RESULTS: Out of 255 questions, ChatGPT-4 correctly answered 189 (74.1%). Between the five question categories there was no difference between the frequency of correct answers (p = 0.22). It did not matter if questions were entered individually or in groups of up to 10.
CONCLUSION: Without prior training, ChatGPT-4 scored highly when evaluated on the largest practice question bank for the FPD-MBS exam.
PMID:39317906 | DOI:10.1007/s00464-024-11267-y
Estimated reading time: 4 minute(s)
Latest: Psychiatryai.com #RAISR4D Evidence
Cool Evidence: Engaging Young People and Students in Real-World Evidence ☀️
Real-Time Evidence Search [Psychiatry]
AI Research
Artificial intelligence large language model scores highly on focused practice designation in metabolic and bariatric surgery board practice questions
🌐 90 Days
AI Virtual Reality Related Evidence Matrix
- Assessing knowledge about medical physics in language-generative AI with large language model: using the medical physicist exam
- ChatGPT-4 Surpasses Residents: A Study of Artificial Intelligence Competency in Plastic Surgery In-service Examinations and Its Advancements from ChatGPT-3.5
- Analysis of Responses of GPT-4 V to the Japanese National Clinical Engineer Licensing Examination
- Assessing ChatGPT's theoretical knowledge and prescriptive accuracy in bacterial infections: a comparative study with infectious diseases residents and specialists
- Assessing ChatGPT's theoretical knowledge and prescriptive accuracy in bacterial infections: a comparative study with infectious diseases residents and specialists
- Comparison of the Performance of Artificial Intelligence Versus Medical Professionals in the Polish Final Medical Examination
- Assessing ChatGPT as a Medical Consultation Assistant for Chronic Hepatitis B: Cross-Language Study of English and Chinese
- Comparing ChatGPT and a Single Anesthesiologist's Responses to Common Patient Questions: An Exploratory Cross-Sectional Survey of a Panel of Anesthesiologists
- Appropriateness of ChatGPT as a resource for medication-related questions
- Basal knowledge in the field of pediatric nephrology and its enhancement following specific training of ChatGPT-4 "omni" and Gemini 1.5 Flash
- Lumbar disc herniation with radiculopathy: a comparison of NASS guidelines and ChatGPT
- Evaluation of the accuracy and readability of ChatGPT-4 and Google Gemini in providing information on retinal detachment: a multicenter expert comparative study
- Bariatric-Metabolic Surgery is the Most Effective Intervention in Reducing Food Addiction Symptoms: A Systematic Review and Meta-Analysis
- The performance of OpenAI ChatGPT-4 and Google Gemini in virology multiple-choice questions: a comparative analysis of English and Arabic responses
- Caution Regarding ChatGPT's Appropriateness and Reliability Regarding Surgery for Wrist Arthritis
- Appraisal of ChatGPT's Aptitude for Medical Education: Comparative Analysis With Third-Year Medical Students in a Pulmonology Examination
- AI-generated text in otolaryngology publications: a comparative analysis before and after the release of ChatGPT
- ChatGPT compared to national guidelines for management of ovarian cancer: Did ChatGPT get it right? - A Memorial Sloan Kettering Cancer Center Team Ovary study
- Comparative Assessment of Otolaryngology Knowledge Among Large Language Models
- An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer
- Gemini AI vs. ChatGPT: A comprehensive examination alongside ophthalmology residents in medical knowledge
- Integrating ChatGPT in Orthopedic Education for Medical Undergraduates: Randomized Controlled Trial
- Can artificial intelligence models serve as patient information consultants in orthodontics?
- Comparative Analysis of Artificial Intelligence Platforms: ChatGPT-3.5 and GoogleBard in Identifying Red Flags of Low Back Pain
- Comparative accuracy of ChatGPT-4, Microsoft Copilot and Google Gemini in the Italian entrance test for healthcare sciences degrees: a cross-sectional study
- Amplifying Chinese physicians' emphasis on patients' psychological states beyond urologic diagnoses with ChatGPT-A multi-center cross-sectional study
- Assessing the appropriateness and completeness of ChatGPT-4's AI-generated responses for queries related to diabetic retinopathy
- Bariatric Surgery in Obesity: Metabolic Quality Analysis and Comparison of Surgical Options
- ChatGPT can help guide and empower patients after prostate cancer diagnosis
- Assessing ChatGPT's Capability for Multiple Choice Questions Using RaschOnline: Observational Study
- Digital Ink and Surgical Dreams: Perceptions of Artificial Intelligence-Generated Essays in Residency Applications
- Digital Ink and Surgical Dreams: Perceptions of Artificial Intelligence-Generated Essays in Residency Applications
- Digital Ink and Surgical Dreams: Perceptions of Artificial Intelligence-Generated Essays in Residency Applications
- Assessment of the information provided by ChatGPT regarding exercise for patients with type 2 diabetes: a pilot study
- Language discrepancies in the performance of generative artificial intelligence models: an examination of infectious disease queries in English and Arabic
- QUALITY OF LIFE AND PSYCHOLOGICAL CHANGES IN BARIATRIC SURGERY: AN OBSERVATIONAL STUDY
- QUALITY OF LIFE AND PSYCHOLOGICAL CHANGES IN BARIATRIC SURGERY: AN OBSERVATIONAL STUDY
- The Use of Generative Artificial Intelligence for Improving Health Literacy in Reproductive Health: A Case Study
- Research progress of probiotics regulating intestinal micro-ecological environment in obese patients after bariatric surgery
- The conversational AI "ChatGPT" outperforms medical students on a physiology university examination
- Evaluating ChatGPT as a Patient Education Tool for COVID-19-Induced Olfactory Dysfunction
- Exploring the potential of artificial intelligence to enhance the writing of english academic papers by non-native english-speaking medical students - the educational application of ChatGPT
- The Advent of Artificial Intelligence into Cardiac Surgery: A Systematic Review of Our Understanding
- "Hospice Care Could Be a Compassionate Choice": ChatGPT Responses to Questions About Decision Making in Advanced Cancer
- Parental concerns about oral health of children: Is ChatGPT helpful in finding appropriate answers?
- Evaluating generative AI responses to real-world drug-related questions
- Comparison of ChatGPT versions in informing patients with rotator cuff injuries
- Evaluation of online chat-based artificial intelligence responses about inflammatory bowel disease and diet
- Can ChatGPT make surgical decisions with confidence similar to experienced knee surgeons?
- Artificial Intelligence-Supported Development of Health Guideline Questions
- ChatGPT in medicine: A cross-disciplinary systematic review of ChatGPT's (artificial intelligence) role in research, clinical practice, education, and patient interaction
- ChatGPT in medicine: A cross-disciplinary systematic review of ChatGPT's (artificial intelligence) role in research, clinical practice, education, and patient interaction
- Quality and Accountability of ChatGPT in Health Care in Low- and Middle-Income Countries: Simulated Patient Study
- Prioritising patients for publicly funded bariatric surgery in Queensland, Australia
- Prioritising patients for publicly funded bariatric surgery in Queensland, Australia
- ChatGPT-4 Consistency in Interpreting Laryngeal Clinical Images of Common Lesions and Disorders
- Generative artificial intelligence in primary care: an online survey of UK general practitioners
- Attention-deficit/hyperactivity disorder in adolescent and adult candidates for metabolic and bariatric surgery: A systematic review and meta-analysis
- Assessing Generative Pretrained Transformers (GPT) in Clinical Decision-Making: Comparative Analysis of GPT-3.5 and GPT-4
- Exploring Affective Representations in Emotional Narratives: An Exploratory Study Comparing ChatGPT and Human Responses
- Characterizing the Adoption and Experiences of Users of Artificial Intelligence-Generated Health Information in the United States: Cross-Sectional Questionnaire Study
- Artificial intelligence (AI) in diagnostic and therapeutic decision-making-a tool or communication partner?
- Current Status of ChatGPT Use in Medical Education: Potentials, Challenges, and Strategies
- Do ChatGPT and Gemini Provide Appropriate Recommendations for Pediatric Orthopaedic Conditions?
Evidence Blueprint
Artificial intelligence large language model scores highly on focused practice designation in metabolic and bariatric surgery board practice questions
☊ AI-Driven Related Evidence Nodes
(recent articles with at least 5 words in title)
More Evidence
Artificial intelligence large language model scores highly on focused practice designation in metabolic and bariatric surgery board practice questions
🌐 365 Days
AI Virtual Reality Related Evidence Matrix
- A Comparison Between GPT-3.5, GPT-4, and GPT-4V: Can the Large Language Model (ChatGPT) Pass the Japanese Board of Orthopaedic Surgery Examination?
- Comparison of Artificial Intelligence to Resident Performance on Upper-Extremity Orthopaedic In-Training Examination Questions
- Assessing ChatGPT 4.0's test performance and clinical diagnostic accuracy on USMLE STEP 2 CK and clinical case reports
- Evaluation of the Impact of ChatGPT on the Selection of Surgical Technique in Bariatric Surgery
- Evaluating the Performance of ChatGPT in Urology: A Comparative Study of Knowledge Interpretation and Patient Guidance
- Assessing knowledge about medical physics in language-generative AI with large language model: using the medical physicist exam
- ChatGPT and the German board examination for ophthalmology: an evaluation
- ChatGPT-4 Surpasses Residents: A Study of Artificial Intelligence Competency in Plastic Surgery In-service Examinations and Its Advancements from ChatGPT-3.5
- Quality of ChatGPT Responses to Frequently Asked Questions in Carpal Tunnel Release Surgery
- ChatGPT's performance in dentistry and allergyimmunology assessments: a comparative study
- ChatGPT/GPT-4 (large language models): Opportunities and challenges of perspective in bariatric healthcare professionals
- A comparative analysis of ChatGPT, ChatGPT-4 and Google Bard performances at the Advanced Burn Life Support Exam
- Analysis of Responses of GPT-4 V to the Japanese National Clinical Engineer Licensing Examination
- Assessing ChatGPT's theoretical knowledge and prescriptive accuracy in bacterial infections: a comparative study with infectious diseases residents and specialists
- Assessing ChatGPT's theoretical knowledge and prescriptive accuracy in bacterial infections: a comparative study with infectious diseases residents and specialists
- Performance of Large Language Models on a Neurology Board-Style Examination
- Comparison of the Performance of Artificial Intelligence Versus Medical Professionals in the Polish Final Medical Examination
- Assessing ChatGPT as a Medical Consultation Assistant for Chronic Hepatitis B: Cross-Language Study of English and Chinese
- Comparing ChatGPT and a Single Anesthesiologist's Responses to Common Patient Questions: An Exploratory Cross-Sectional Survey of a Panel of Anesthesiologists
- Performance of ChatGPT on Chinese Master's Degree Entrance Examination in Clinical Medicine
- Exploring the Performance of ChatGPT Versions 3.5, 4, and 4 With Vision in the Chilean Medical Licensing Examination: Observational Study
- Google Gemini and Bard artificial intelligence chatbot performance in ophthalmology knowledge assessment
- ChatGPT-3.5 passes Poland's medical final examination-Is it possible for ChatGPT to become a doctor in Poland?
- ChatGPT-3.5 passes Poland's medical final examination-Is it possible for ChatGPT to become a doctor in Poland?
- Educational Limitations of ChatGPT in Neurosurgery Board Preparation
- Appropriateness of ChatGPT as a resource for medication-related questions
- Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery
- From Bytes to Best Practices: Tracing ChatGPT-3.5's Evolution and Alignment With the National Comprehensive Cancer Network® Guidelines in Pancreatic Adenocarcinoma Management
- Language-adaptive artificial intelligence: assessing CHATGPT'S answer to frequently asked questions on total hip arthroplasty questions
- Evaluating the accuracy of Chat Generative Pre-trained Transformer version 4 (ChatGPT-4) responses to United States Food and Drug Administration (FDA) frequently asked questions about dental amalgam
- Is ChatGPT an Accurate and Reliable Source of Information for Patients with Vaccine and Statin Hesitancy?
- Charting new AI education in gastroenterology: Cross-sectional evaluation of ChatGPT and perplexity AI in medical residency exam
- Assessing ChatGPT's Responses to Otolaryngology Patient Questions
- Assessing ChatGPT vs. Standard Medical Resources for Endoscopic Sleeve Gastroplasty Education: A Medical Professional Evaluation Study
- Basal knowledge in the field of pediatric nephrology and its enhancement following specific training of ChatGPT-4 "omni" and Gemini 1.5 Flash
- Performance Comparison of ChatGPT-4 and Japanese Medical Residents in the General Medicine In-Training Examination: Comparison Study
- Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients
- Lumbar disc herniation with radiculopathy: a comparison of NASS guidelines and ChatGPT
- ChatGPT is an above-average student at the Faculty of Medicine of the University of Zaragoza and an excellent collaborator in the development of teaching materials
- Evaluation of the accuracy and readability of ChatGPT-4 and Google Gemini in providing information on retinal detachment: a multicenter expert comparative study
- Geriatrics and artificial intelligence in Spain (Ger-IA project): talking to ChatGPT, a nationwide survey
- Evaluating The Role of ChatGPT as a Study Aid in Medical Education in Surgery
- Bariatric-Metabolic Surgery is the Most Effective Intervention in Reducing Food Addiction Symptoms: A Systematic Review and Meta-Analysis
- Comparison of the Performance of GPT-3.5 and GPT-4 With That of Medical Students on the Written German Medical Licensing Examination: Observational Study
- Frequently asked questions on erectile dysfunction: evaluating artificial intelligence answers with expert mentorship
- Appropriateness of Frequently Asked Patient Questions Following Total Hip Arthroplasty From ChatGPT Compared to Arthroplasty-Trained Nurses
- Comparing ChatGPT and clinical nurses' performances on tracheostomy care: A cross-sectional study
- Comparing the Performance of Artificial Intelligence Learning Models to Medical Students in Solving Histology and Embryology Multiple Choice Questions
- The performance of OpenAI ChatGPT-4 and Google Gemini in virology multiple-choice questions: a comparative analysis of English and Arabic responses
- Caution Regarding ChatGPT's Appropriateness and Reliability Regarding Surgery for Wrist Arthritis
- Evaluating ChatGPT's Performance in Answering Questions About Allergic Rhinitis and Chronic Rhinosinusitis
- The performance of artificial intelligence large language model-linked chatbots in surgical decision-making for gastroesophageal reflux disease
- Utility of Large Language Models for Health Care Professionals and Patients in Navigating Hematopoietic Stem Cell Transplantation: Comparison of the Performance of ChatGPT-3.5, ChatGPT-4, and Bard
- To trust or not to trust: evaluating the reliability and safety of AI responses to laryngeal cancer queries
- How best to combine liver transplantation and bariatric surgery?-Results from a global, web-based survey
- Accuracy and consistency of online large language model-based artificial intelligence chat platforms in answering patients' questions about heart failure
- Appraisal of ChatGPT's Aptitude for Medical Education: Comparative Analysis With Third-Year Medical Students in a Pulmonology Examination
- Artificial Intelligence and ChatGPT in Abdominopelvic Surgery: A Systematic Review of Applications and Impact
- AI-generated text in otolaryngology publications: a comparative analysis before and after the release of ChatGPT
- How artificial intelligence can provide information about subdural hematoma: Assessment of readability, reliability, and quality of ChatGPT, BARD, and perplexity responses
- ChatGPT compared to national guidelines for management of ovarian cancer: Did ChatGPT get it right? - A Memorial Sloan Kettering Cancer Center Team Ovary study
- Comparing the Performance of ChatGPT-4 and Medical Students on MCQs at Varied Levels of Bloom's Taxonomy
- Intrapersonal coping predicts greater weight loss 24 months after bariatric surgery
- Assessing the Quality of ChatGPT Responses to Dementia Caregivers' Questions: Qualitative Analysis