MGDA-Decoupled: Geometry-Aware Multi-Objective Optimisation for DPO-based LLM Alignment
arXiv cs.LG / 4/23/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper addresses LLM alignment as a multi-objective optimization problem involving potentially conflicting goals like helpfulness, truthfulness, and harmlessness.
- It argues that using a fixed scalarization in many alignment pipelines can create procedural unfairness by under-weighting harder or minority objectives.
- It proposes MGDA-Decoupled, a geometry-aware multi-objective algorithm that computes a shared descent direction while accounting for each objective’s convergence dynamics.
- Unlike prior approaches that use reinforcement learning or explicit reward models, MGDA-Decoupled is designed to work entirely within the lightweight DPO (Direct Preference Optimization) framework.
- Experiments on the UltraFeedback dataset indicate that geometry-aware methods, especially MGDA-Decoupled, achieve the highest win rates against golden responses overall and across individual objectives.




