Evidence
![](https://psychiatryai.com/wp-content/uploads/2023/04/psychiatryai_com.webp)
Cas Lek Cesk. 2024;162(7-8):294-297.
ABSTRACT
The advent of large language models (LLMs) based on neural networks marks a significant shift in academic writing, particularly in medical sciences. These models, including OpenAI’s GPT-4, Google’s Bard, and Anthropic’s Claude, enable more efficient text processing through transformer architecture and attention mechanisms. LLMs can generate coherent texts that are indistinguishable from human-written content. In medicine, they can contribute to the automation of literature reviews, data extraction, and hypothesis formulation. However, ethical concerns arise regarding the quality and integrity of scientific publications and the risk of generating misleading content. This article provides an overview of how LLMs are changing medical writing, the ethical dilemmas they bring, and the possibilities for detecting AI-generated text. It concludes with a focus on the potential future of LLMs in academic publishing and their impact on the medical community.
PMID:38981715
![Google](https://www.google.com/images/branding/googlelogo/2x/googlelogo_light_color_92x30dp.png)
![Google Keep](https://www.gstatic.com/images/branding/product/1x/keep_48dp.png)
![Share on Linkedin](https://psychiatryai.com/wp-content/uploads/2023/10/linkedin-logo-png-2048-1.png)
Estimated reading time: 3 minute(s)
Latest: Psychiatryai.com #RAISR4D Evidence
![](/wp-content/uploads/2024/04/bd462cc11bcf0bd0d0d6f1d0f8b7cd04-modified-1.png)
Cool Evidence: Engaging Young People and Students in Real-World Evidence
![](/wp-content/uploads/2024/04/bd462cc11bcf0bd0d0d6f1d0f8b7cd04-modified-1.png)
Real-Time Evidence Search [Psychiatry]
![](/wp-content/uploads/2024/04/pubmed.png)
AI Research
![](/wp-content/uploads/2024/05/Il5nR_nf_400x400-modified-1.png)
Large language models are changing landscape of academic publications. A positive transformation?
![](https://psychiatryai.com/wp-content/uploads/2023/04/psychiatryai_com.webp)
🌐 90 Days
AI Virtual Reality Related Evidence Matrix
Clinical Accuracy, Relevance, Clarity, and Emotional Sensitivity of Large Language Models to Surgical Patient Questions: Cross-Sectional Study Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis Performance of two large language models for data extraction in evidence synthesis Large language models from OpenAI, Google, Meta, X and Co. : The role of "closed" and "open" models in radiology Maximizing Large Language Model Utility in Cardiovascular Care: A Practical Guide Assessing the risk of takeover catastrophe from large language models Potential of Large Language Models in Health Care: Delphi Study Large language models reshaping molecular biology and drug development Large language models in psychiatry: Opportunities and challenges The application of large language models in medicine: A scoping review Large Language Models and User Trust: Consequence of Self-Referential Learning Loop and the Deskilling of Health Care Professionals Enhancing Care for Older Adults and Dementia Patients with Large Language Models: Proceedings of the National Institute on Aging -Artificial Intelligence & Technology Collaboratory for Aging Research Symposium Fine-tuning large language models for chemical text mining ChatGPT's ability to generate realistic experimental images poses a new challenge to academic integrity Urology consultants versus large language models: Potentials and hazards for medical advice in urology Using large language model to guide patients to create efficient and comprehensive clinical care message AI-generated text in otolaryngology publications: a comparative analysis before and after the release of ChatGPT An overview of diagnostics and therapeutics using large language models Utility of Large Language Models for Health Care Professionals and Patients in Navigating Hematopoietic Stem Cell Transplantation: Comparison of the Performance of ChatGPT-3.5, ChatGPT-4, and Bard Leveraging Large Language Models for Improved Patient Access and Self-Management: Assessor-Blinded Comparison Between Expert- and AI-Generated Content A comparative analysis of ChatGPT, ChatGPT-4 and Google Bard performances at the Advanced Burn Life Support Exam Deception abilities emerged in large language models Implications of Large Language Models for Quality and Efficiency of Neurologic Care: Emerging Issues in Neurology The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients Can Artificial Intelligence Mitigate Missed Diagnoses by Generating Differential Diagnoses for Neurosurgeons? Evaluation and mitigation of the limitations of large language models in clinical decision-making Large Language Models for Inorganic Synthesis Predictions The Performance of ChatGPT-4 and Gemini Ultra 1.0 for Quality Assurance Review in Emergency Medical Services Chest Pain Calls To trust or not to trust: evaluating the reliability and safety of AI responses to laryngeal cancer queries Current Concepts Review: Large Language Models in Orthopaedics: Definitions, Uses, and Limitations Exploring the potential of artificial intelligence to enhance the writing of english academic papers by non-native english-speaking medical students - the educational application of ChatGPT Large Language Model-Based AI Agent for Organic Semiconductor Devices Research Triage Performance Across Large Language Models, ChatGPT, and Untrained Doctors in Emergency Medicine: Comparative Study Applications of large language models in psychiatry: a systematic review Testing theory of mind in large language models and humans ChatGPT's performance in dentistry and allergyimmunology assessments: a comparative study Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer The potential and pitfalls of using a large language model such as ChatGPT, GPT-4, or LLaMA as a clinical assistant Enhancing readability of USFDA patient communications through large language models: a proof-of-concept study The consequences of generative AI for online knowledge communities Emerging opportunities of using large language models for translation between drug molecules and indications Physician Versus Large Language Model Chatbot Responses to Web-Based Questions From Autistic Patients in Chinese: Cross-Sectional Comparative Analysis RefAI: a GPT-powered retrieval-augmented generative tool for biomedical literature recommendation and summarization Reliability of large language models for advanced head and neck malignancies management: a comparison between ChatGPT 4 and Gemini Advanced From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality? Assessing the Application of Large Language Models in Generating Dermatologic Patient Education Materials According to Reading Level: Qualitative Study A Comparative Study of Responses to Retina Questions from Either Experts, Expert-Edited Large Language Models, or Expert-Edited Large Language Models Alone Evolution of publicly available large language models for complex decision-making in breast cancer care Using Large Language Models to Support Content Analysis: A Case Study of ChatGPT for Adverse Event Detection Evidence-Based Learning Strategies in Medicine Using AI Harnessing Artificial Intelligence in Multimodal Omics Data Integration: Paving the Path for the Next Frontier in Precision Medicine Artificial intelligence classifies primary progressive aphasia from connected speech Brief Review and Primer of Key Terminology for Artificial Intelligence and Machine Learning in Hypertension Evaluating Artificial Intelligence's Role in Teaching the Reporting and Interpretation of Computed Tomographic Angiography for Preoperative Planning of the Deep Inferior Epigastric Artery Perforator Flap ChatGPT as a Tool for Medical Education and Clinical Decision-Making on the Wards: Case Study A practical guide to the implementation of artificial intelligence in orthopaedic research-Part 2: A technical introduction Applications of Artificial Intelligence in Prostate Cancer Care: A Path to Enhanced Efficiency and Outcomes Mapping of specialized metabolite terms onto a plant phylogeny using text mining and large language models RELATIONAL DIMENSION VERSUS ARTIFICIAL INTELLIGENCE Assessing and Optimizing Large Language Models on Spondyloarthritis Multi-Choice Question Answering: Protocol for Enhancement and Assessment Knowledge graphs in psychiatric research: Potential applications and future perspectives Artificial intelligence in medicine and healthcare: Opportunity and/or threat
Evidence Blueprint
Large language models are changing landscape of academic publications. A positive transformation?
![](https://psychiatryai.com/wp-content/uploads/2023/04/psychiatryai_com.webp)
☊ AI-Driven Related Evidence Nodes
(recent articles with at least 5 words in title)
More Evidence
![](https://psychiatryai.com/wp-content/uploads/2023/04/psychiatryai_com.webp)
Large language models are changing landscape of academic publications. A positive transformation?
🌐 365 Days
AI Virtual Reality Related Evidence Matrix
Clinical Accuracy, Relevance, Clarity, and Emotional Sensitivity of Large Language Models to Surgical Patient Questions: Cross-Sectional Study Adapted large language models can outperform medical experts in clinical text summarization Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis Performance of two large language models for data extraction in evidence synthesis Large language models from OpenAI, Google, Meta, X and Co. : The role of "closed" and "open" models in radiology The impact of large language models on radiology: a guide for radiologists on the latest innovations in AI Comparing the Performance of Popular Large Language Models on the National Board of Medical Examiners Sample Questions Maximizing Large Language Model Utility in Cardiovascular Care: A Practical Guide Assessing the risk of takeover catastrophe from large language models Potential of Large Language Models in Health Care: Delphi Study Current safeguards, risk mitigation, and transparency measures of large language models against the generation of health disinformation: repeated cross sectional analysis Large language models reshaping molecular biology and drug development Large language models in psychiatry: Opportunities and challenges The application of large language models in medicine: A scoping review From Bench to Bedside With Large Language Models: AJR Expert Panel Narrative Review Large Language Models and User Trust: Consequence of Self-Referential Learning Loop and the Deskilling of Health Care Professionals AE-GPT: Using Large Language Models to extract adverse events from surveillance reports-A use case with influenza vaccine adverse events Enhancing Care for Older Adults and Dementia Patients with Large Language Models: Proceedings of the National Institute on Aging -Artificial Intelligence & Technology Collaboratory for Aging Research Symposium Large language models for generating medical examinations: systematic review MedChatZH: A tuning LLM for traditional Chinese medicine consultations Fine-tuning large language models for chemical text mining AI vs academia: Experimental study on AI text detectors' accuracy in behavioral health academic writing Evaluating the performance of large language models: ChatGPT and Google Bard in generating differential diagnoses in clinicopathological conferences of neurodegenerative disorders Automated Category and Trend Analysis of Scientific Articles on Ophthalmology Using Large Language Models: Development and Usability Study Large language models could change the future of behavioral healthcare: a proposal for responsible development and evaluation Assessing the Alignment of Large Language Models With Human Values for Mental Health Integration: Cross-Sectional Study Using Schwartz's Theory of Basic Values Large Language Models pose risk to science with false answers, says Oxford study ChatGPT's ability to generate realistic experimental images poses a new challenge to academic integrity Urology consultants versus large language models: Potentials and hazards for medical advice in urology Using large language model to guide patients to create efficient and comprehensive clinical care message AI-generated text in otolaryngology publications: a comparative analysis before and after the release of ChatGPT An overview of diagnostics and therapeutics using large language models Utility of Large Language Models for Health Care Professionals and Patients in Navigating Hematopoietic Stem Cell Transplantation: Comparison of the Performance of ChatGPT-3.5, ChatGPT-4, and Bard ChatGPT and Bard exhibit spontaneous citation fabrication during psychiatry literature search Leveraging Large Language Models for Improved Patient Access and Self-Management: Assessor-Blinded Comparison Between Expert- and AI-Generated Content Assessing prognosis in depression: comparing perspectives of AI models, mental health professionals and the general public A comparative analysis of ChatGPT, ChatGPT-4 and Google Bard performances at the Advanced Burn Life Support Exam Medical Sciences Education Forum: AI in practice Deception abilities emerged in large language models Leveraging large language models to foster equity in healthcare Implications of Large Language Models for Quality and Efficiency of Neurologic Care: Emerging Issues in Neurology The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study Large Language Models Facilitate the Generation of Electronic Health Record Phenotyping Algorithms Can Artificial Intelligence Mitigate Missed Diagnoses by Generating Differential Diagnoses for Neurosurgeons? Evaluation and mitigation of the limitations of large language models in clinical decision-making Large Language Models for Inorganic Synthesis Predictions The Performance of ChatGPT-4 and Gemini Ultra 1.0 for Quality Assurance Review in Emergency Medical Services Chest Pain Calls Inductive reasoning with large language models: a simulated randomized controlled trial for epilepsy To trust or not to trust: evaluating the reliability and safety of AI responses to laryngeal cancer queries Current Concepts Review: Large Language Models in Orthopaedics: Definitions, Uses, and Limitations Evaluation of Large Language Model Performance and Reliability for Citations and References in Scholarly Writing: Cross-Disciplinary Study Framework-based qualitative analysis of free responses of Large Language Models: Algorithmic fidelity Exploring the potential of artificial intelligence to enhance the writing of english academic papers by non-native english-speaking medical students - the educational application of ChatGPT Large Language Model-Based AI Agent for Organic Semiconductor Devices Research Triage Performance Across Large Language Models, ChatGPT, and Untrained Doctors in Emergency Medicine: Comparative Study Applications of large language models in psychiatry: a systematic review Testing theory of mind in large language models and humans Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study Performance of Large Language Models on a Neurology Board-Style Examination ChatGPT's performance in dentistry and allergyimmunology assessments: a comparative study Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer