Stud Health Technol Inform. 2025 May 15;327:874-875. doi: 10.3233/SHTI250486.
ABSTRACT
This study explores the utility of Large Language Models (LLMs) to support finding rare patient record details that could make a patient identifiable. Whilst most research has focused on what we call direct patient identifiers, indirect patient identifiers are not widely addressed. Our evaluation of patient records with mentions of indirect risks predicted by our LLM shows the potential to find these risks automatically. However, many risks highlighted were false positives or did not constitute identifiable risk. More work is needed to understand how we can harness the potential of LLMs as part of our de-identification pipelines for patient health records. Better de-identification of health records is important for safely improving data access and advancing research without compromising confidentiality.
PMID:40380594 | DOI:10.3233/SHTI250486
AI-Assisted Evidence Search
Share Evidence Blueprint
Search Google Scholar