Unsupervised Neural Network for Automated Classification of Surgical Urgency Levels in Medical Transcriptions

arXiv cs.CL / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an unsupervised neural-network pipeline to classify surgical urgency levels (immediate, urgent, elective) directly from medical transcription text.
  • It uses BioClinicalBERT to convert transcripts into semantic embeddings, then applies clustering with K-means and Deep Embedding Clustering (DEC), with DEC producing better-formed clusters.
  • To keep outputs clinically meaningful, clustering is validated via the Modified Delphi Method with expert review and iterative refinement.
  • After validation, the work trains a BiLSTM-based neural classifier on top of BioClinicalBERT embeddings and evaluates it with cross-validation using accuracy, precision, recall, and F1-score.
  • The authors argue the approach reduces reliance on labeled data and supports scalable, potentially real-time surgical prioritization to improve operational efficiency and patient outcomes.

Abstract

Efficient classification of surgical procedures by urgency is paramount to optimize patient care and resource allocation within healthcare systems. This study introduces an unsupervised neural network approach to automatically categorize surgical transcriptions into three urgency levels: immediate, urgent, and elective. Leveraging BioClinicalBERT, a domain-specific language model, surgical transcripts are transformed into high-dimensional embeddings that capture their semantic nuances. These embeddings are subsequently clustered using both K-means and Deep Embedding Clustering (DEC) algorithms, in which DEC demonstrates superior performance in the formation of cohesive and well-separated clusters. To ensure clinical relevance and accuracy, the clustering results undergo validation through the Modified Delphi Method, which involves expert review and refinement. Following validation, a neural network that integrates Bidirectional Long Short-Term Memory (BiLSTM) layers with BioClinicalBERT embeddings is developed for classification tasks. The model is rigorously evaluated using cross-validation and metrics such as accuracy, precision, recall, and F1-score, which achieve robust performance and demonstrate strong generalization capabilities on unseen data. This unsupervised framework not only addresses the challenge of limited labeled data but also provides a scalable and reliable solution for real-time surgical prioritization, which ultimately enhances operational efficiency and patient outcomes in dynamic medical environments.