Artif Intell Med. 2026 Apr 28. pii: S0933-3657(26)00093-X. [Epub ahead of print]178
103441
Medical text records serve as essential repositories of patient information, providing a foundation for informed clinical decision-making, accurate diagnosis, reliable prognosis, and effective treatment planning. Recent advancements in Artificial Intelligence (AI), particularly in Natural Language Processing (NLP) and Machine Learning (ML), have positioned AI-driven language models as powerful tools for analyzing, classifying, and generating medical textual data. In this systematic literature review, an initial search retrieved 548 records published between 1 January 2000 and 1 July 2024. After rigorous screening based on predefined inclusion and exclusion criteria, 22 original research articles were included. The review highlights substantial progress in applying advanced architectures such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformers (GPT) to medical text processing tasks. These models consistently outperform conventional NLP and ML approaches, achieving superior results in disease classification, automated clinical documentation, and predictive analytics. However, critical challenges persist, including the limited availability of clinically validated datasets, variability in data preprocessing protocols, insufficient external validation, and the lack of interpretable AI frameworks, all of which collectively hinder clinical trust and large-scale adoption. Future research should prioritize the development of hybrid AI systems that integrate multimodal data sources (text, imaging, and structured records), incorporate explainable AI mechanisms, and adhere to standardized reporting frameworks. Addressing these methodological gaps will be pivotal in enhancing the reliability, clinical applicability, and impact of AI language models, thereby advancing evidence-based medicine, personalized treatment strategies, and overall patient care.
Keywords: Artificial intelligence; Language models; Medical text; Natural language processing