Instruction Tuned Large Language Models for Assisting Brain Surgery Through Procedural Alignment and Decision Support Using PPO Reinforcement Learning

Article Information

Title: Instruction Tuned Large Language Models for Assisting Brain Surgery Through Procedural Alignment and Decision Support Using PPO Reinforcement Learning

Authors: G Ramana Murthy, A. Jansi Rani, Lavanya S, P Ravi Kumar, K. Karunambiga

Journal: Journal of Neonatal Surgery

HEC Recognition History

Category	From	To
Y	2023-07-01	2024-09-30
Y	2022-07-01	2023-06-30

Publisher: EL-MED-Pub Publishers

Country: Pakistan

Year: 2025

Volume: 14

Issue: 5

Language: en

Keywords: Procedural alignment

Abstract

The integration of large language models (LLMs) into clinical decision-making remains a critical challenge, especially in high-risk domains such as neurosurgery. This study presents a novel framework that leverages instruction-tuned LLMs optimized using Proximal Policy Optimization (PPO) reinforcement learning to assist brain surgery through procedural alignment and decision support. We begin by fine-tuning a transformer-based LLM on domain-specific surgical protocols and neurosurgical dialogue datasets using supervised instruction tuning. To further enhance procedural adherence and mitigate hallucinations, we introduce a reward model guided by expert-annotated signals such as factual accuracy, stepwise protocol fidelity, and relevance to surgical context. PPO is employed to iteratively refine the model's responses through a feedback loop, optimizing both language coherence and domain-specific reliability. Experimental evaluations on simulated neurosurgical benchmarks demonstrate that our model outperforms both instruction-tuned and PPO-only baselines in terms of procedural accuracy and decision support relevance. The results indicate that reinforcement learning with human feedback, when tailored to surgical requirements, significantly improves trustworthiness and alignment in LLM outputs. This research contributes a critical step toward the deployment of explainable, reliable AI assistants for neurosurgical procedures.

Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...

DefinePK

Instruction Tuned Large Language Models for Assisting Brain Surgery Through Procedural Alignment and Decision Support Using PPO Reinforcement Learning

Article Information

HEC Recognition History

Categories

Abstract

DefinePK

Select Collection

Instruction Tuned Large Language Models for Assisting Brain Surgery Through Procedural Alignment and Decision Support Using PPO Reinforcement Learning

Article Information

HEC Recognition History

Categories

Abstract