MOSAIC: Multi-Objective Slice-Aware Iterative Curation for Alignment
arXiv cs.CL / 3/20/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- MOSAIC introduces a multi-objective framework for slice-aware iterative curation to balance safety, benign-overrefusal, and instruction-following under a fixed 1M-token budget across five rounds of fine-tuning.
- It uses slice-level failure profiles to derive executable data actions, including dataset-level mixture ratios, bucket-level weights, and focus criteria.
- The approach achieves improvements on XGuard (2.76->4.67), OrBench (4.41), and IFEval (3.65) and shows better generalization than a random static LoRA baseline on attacks, over-refusal, and capability tests.
- The method suggests structured failure diagnosis can serve as a practical control signal for budgeted data construction, with code available at GitHub.
- This work provides a framework for data-centric alignment under constraints and could inform future budget-aware fine-tuning pipelines.
Related Articles
How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models
Reddit r/LocalLLaMA
Engenharia de Prompt: Por Que a Forma Como Você Pergunta Muda Tudo(Um guia introdutório)
Dev.to
The Obligor
Dev.to
The Markup
Dev.to
2026 年 AI 部落格變現完整攻略:從第一篇文章到月收入 $1000
Dev.to