Coding-Free and Privacy-Preserving MCP Framework for Clinical Agentic Research Intelligence System

arXiv cs.CL / 4/15/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper proposes CARIS, a “coding-free” Clinical Agentic Research Intelligence System that automates major clinical research steps (planning, literature search, cohort building, IRB documentation, modeling, and report writing) using agentic workflows.
  • CARIS uses Large Language Models integrated with modular tools via the Model Context Protocol (MCP), so orchestration is driven by natural language intent rather than requiring users to write code.
  • Privacy is preserved by keeping clinical databases inside the MCP server and exposing only derived outputs and final research reports to users, avoiding direct access to raw sensitive data.
  • In evaluations across three heterogeneous clinical datasets, CARIS finalized research plans and IRB documents within 3–4 iterations and supported Vibe ML by exploring feature–model combinations, ranking the top 10 models, and generating performance visualizations.
  • The resulting reports were assessed as highly complete using a TRIPOD+AI-derived checklist, achieving 96% coverage in LLM evaluation and 82% in human evaluation.

Abstract

Clinical research involves labor-intensive processes such as study design, cohort construction, model development, and documentation, requiring domain expertise, programming skills, and access to sensitive patient data. These demands create barriers for clinicians and external researchers conducting data-driven studies. To overcome these limitations, we developed a Clinical Agentic Research Intelligence System (CARIS) that automates the clinical research workflow while preserving data privacy, enabling comprehensive studies without direct access to raw data. CARIS integrates Large Language Models (LLMs) with modular tools via the Model Context Protocol (MCP), enabling natural language-driven orchestration of appropriate tools. Databases remain securely within the MCP server, and users access only the outputs and final research reports. Based on user intent, CARIS automatically executes the full pipeline: research planning, literature search, cohort construction, Institutional Review Board (IRB) documentation, Vibe Machine Learning (ML), and report generation, with iterative human-in-the-loop refinement. We evaluated CARIS on three heterogeneous datasets with distinct clinical tasks. Research plans and IRB documents were finalized within three to four iterations, using evidence from literature and data. The system supported Vibe ML by exploring feature-model combinations, ranking the top ten models, and generating performance visualizations. Final reports showed high completeness based on a checklist derived from the TRIPOD+AI framework, achieving 96% coverage in LLM evaluation and 82% in human evaluation. CARIS demonstrates that agentic AI can transform clinical hypotheses into executable research workflows across heterogeneous datasets. By eliminating the need for coding and direct data access, the system lowers barriers and bridges public and private clinical data environments.