PRIME-CVD: A Parametrically Rendered Informatics Medical Environment for Education in Cardiovascular Risk Modelling
arXiv cs.LG / 3/23/2026
📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
Key Points
- PRIME-CVD introduces two openly accessible synthetic data assets representing a cohort of 50,000 adults undergoing primary prevention for cardiovascular disease, enabling education and methodological development without exposing real EMR data.
- The datasets are generated entirely from a user-specified causal directed acyclic graph parameterised using public statistics and published epidemiologic estimates, rather than from patient-level EMR data or trained generative models, preserving privacy and interpretability.
- Data Asset 1 provides a clean, analysis-ready cohort for exploratory analysis, stratification, and survival modelling, while Data Asset 2 restructures the same cohort into a relational, EMR-style database with realistic heterogeneity, supporting data cleaning and policy-relevant risk modelling.
- The work is released under a Creative Commons Attribution 4.0 licence to support reproducible research and scalable medical education.
Related Articles
The Moonwell Oracle Exploit: How AI-Assisted 'Vibe Coding' Turned cbETH Into a $1.12 Token and Cost $1.78M
Dev.to
How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command
Dev.to
Day 10: An AI Agent's Revenue Report — $29, 25 Products, 160 Tweets
Dev.to
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Dev.to
What CVE-2026-25253 Taught Me About Building Safe AI Assistants
Dev.to