Evaluating Interactive 2D Visualization as a Sample Selection Strategy for Biomedical Time-Series Data Annotation
arXiv cs.LG / 3/30/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The study evaluates three sample selection strategies for biomedical time-series annotation—random sampling (RND), farthest-first traversal (FAFT), and a 2D-visualization user interface method (2DV)—using real human annotators under limited annotation budgets.
- Across four classification tasks (infant motility assessment and speech emotion recognition), 2DV delivers the best overall results when labels are aggregated across annotators.
- For infant motility assessment, 2DV is especially effective at capturing rare classes, but its label distribution variability can reduce model performance when training on individual annotators’ labels, where FAFT performs better.
- In speech emotion recognition, 2DV outperforms other methods for expert annotators and achieves similar performance to experts even when considering individual-annotator label sets for non-experts.
- Risk analysis suggests RND is the safest option when annotator number or expertise is uncertain, while 2DV carries the highest failure risk due to higher variability; interviews also found 2DV makes annotation more engaging.
- The authors conclude that 2DV-based sampling is promising for biomedical time-series labeling, particularly when annotation budgets are not extremely tight.
Related Articles

Knowledge Governance For The Agentic Economy.
Dev.to

AI server farms heat up the neighborhood for miles around, paper finds
The Register
Does the Claude “leak” actually change anything in practice?
Reddit r/LocalLLaMA

87.4% of My Agent's Decisions Run on a 0.8B Model
Dev.to

AIエージェントをソフトウェアチームに変える無料ツール「Paperclip」
Dev.to