FROM SYNTHETIC MEDIUM TO REAL-WORLD APPLICATION: FINE-TUNING A MEDICAL LLM FOR DDX

Submitted by Anonymous on Mon, 09/02/2026 - 12:18

MSc Thesis📅 23.01.2026 — 10:00

👤 Speaker:

EZGI CAVAS

🎓 Supervisor(s):

PRFO.DR.NIHAN KESIM CICEKLI, DR.AYSENUR BIRTURK

📍 Location:

A101

⏲ Duration:

90 min.

📝 Abstract:

Access to large-scale, annotated EHR is limited by privacy rules. This creates a major setback for training strong clinical NLP models. Synthetic data provides a way to protect privacy, but how well synthetic text works for fine-tuning LLMs in real-world tasks is still an important issue to explore. This thesis presents a framework that uses synthetic patient summaries to fine-tune a medical LLM model for multi-label disease diagnosis. This approach offers a cost-effective and privacy-focused method for creating clinical diagnostic tools with minimal use of sensitive real-world data. The results show that synthetic data can successfully reshape the medical models. This also helps the hospitals that are struggling with triage and the overcrowding of patients.

Time - Location

2026-01-23 10:00:00