Large Language Models Meet Biomedical Knowledge Graphs for Mechanistically Grounded Therapeutic Prioritization

arXiv cs.AI / 4/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces DrugKLM, a hybrid framework that combines biomedical knowledge graphs with large-language-model mechanistic reasoning for therapeutic (especially drug repurposing) prioritization.
  • On benchmark datasets, DrugKLM achieves higher performance than knowledge-graph-only and language-model-only baselines, including TxGNN.
  • DrugKLM’s confidence scores are reported to be functionally aligned with molecular phenotypes, linking higher scores to transcriptional signatures associated with improved survival across 12 TCGA cancers.
  • The method is claimed to favor biologically perturbational evidence over historically well-connected indication patterns, and expert curation across five cancers suggests DrugKLM’s prioritization behavior varies in a disease-specific, context-aware way.
  • Overall, the work positions DrugKLM as an evidence-integrative approach that turns heterogeneous biomedical data into mechanistically interpretable and clinically grounded therapeutic hypotheses.

Abstract

Drug repurposing is often framed as a candidate identification task, but existing approaches provide limited guidance for distinguishing biologically plausible candidates from historically well-connected ones. Here we introduce DrugKLM, a hybrid framework that integrates biomedical knowledge graph structure with large language model-based mechanistic reasoning to enable mechanistically grounded therapeutic prioritization. Across benchmark datasets, DrugKLM outperforms knowledge graph-only and language model-only baselines, including TxGNN. Beyond improved recall, DrugKLM confidence scores exhibit functional alignment with molecular phenotypes: higher scores are associated with transcriptional signatures linked to improved survival across 12 TCGA cancers. The scoring framework preferentially captures biologically perturbational signals rather than historical indication patterns. Expert curation across five cancers further reveals systematic differences in prioritization behavior, with DrugKLM elevating candidates supported by coherent mechanistic rationale and disease-specific clinical context. Together, these results establish DrugKLM as an evidence-integrative framework that translates heterogeneous biomedical data into mechanistically interpretable and clinically grounded therapeutic hypotheses.