MOSAIC: Multi-Objective Slice-Aware Iterative Curation for Alignment
arXiv cs.CL / 3/20/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- MOSAIC introduces a multi-objective framework for slice-aware iterative curation to balance safety, benign-overrefusal, and instruction-following under a fixed 1M-token budget across five rounds of fine-tuning.
- It uses slice-level failure profiles to derive executable data actions, including dataset-level mixture ratios, bucket-level weights, and focus criteria.
- The approach achieves improvements on XGuard (2.76->4.67), OrBench (4.41), and IFEval (3.65) and shows better generalization than a random static LoRA baseline on attacks, over-refusal, and capability tests.
- The method suggests structured failure diagnosis can serve as a practical control signal for budgeted data construction, with code available at GitHub.
- This work provides a framework for data-centric alignment under constraints and could inform future budget-aware fine-tuning pipelines.
Related Articles

I let an AI agent loose on my codebase. It tried to read my .env file in 30 seconds.
Dev.to
Alex Chenglin Wu of DeepWisdom On The Future Of Artificial Intelligence | by Chad Silverstein | Authority Magazine | Mar, 2026
Reddit r/artificial
The Exit
Dev.to

Chip Smuggling Arrests, OpenClaw Is 'The Next ChatGPT,' and 81K People on AI
Dev.to
The Crucible
Dev.to