Predicting Blastocyst Formation in IVF: Integrating DINOv2 and Attention-Based LSTM on Time-Lapse Embryo Images
arXiv cs.CV / 4/21/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The study addresses a key IVF challenge: selecting embryos for transfer by predicting blastocyst formation from limited time-lapse images rather than full video sequences.
- It introduces a hybrid approach that combines DINOv2 (a transformer-based vision model) for feature extraction with an enhanced LSTM augmented with multi-head attention to model temporal embryo development.
- Experiments on a real dataset of 704 embryo videos show the proposed model reaches 96.4% accuracy, outperforming prior methods.
- The method is designed to remain effective even when time-lapse frames are missing, which makes it potentially practical for IVF clinics that lack complete imaging systems.
- The authors suggest the system could support embryologists by improving the speed and confidence of embryo selection during IVF.
- categories-research
Related Articles

Every time a new model comes out, the old one is obsolete of course
Reddit r/LocalLLaMA

We built it during the NVIDIA DGX Spark Full-Stack AI Hackathon — and it ended up winning 1st place overall 🏆
Dev.to

Stop Losing Progress: Setting Up a Pro Jupyter Workflow in VS Code (No More Colab Timeouts!)
Dev.to

Building AgentOS: Why I’m Building the AWS Lambda for Insurance Claims
Dev.to

Where we are. In a year, everything has changed. Kimi - Minimax - Qwen - Gemma - GLM
Reddit r/LocalLLaMA