Directing the Narrative: A Finetuning Method for Controlling Coherence and Style in Story Generation
arXiv cs.CV / 3/19/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- It proposes a two-stage framework for story generation that combines Group-Shared Attention (GSA) and Direct Preference Optimization (DPO) to improve coherence and style.
- Group-Shared Attention enables lossless cross-sample information flow within attention layers to encode identity consistency across frames without relying on external encoders.
- Direct Preference Optimization aligns generated outputs with human aesthetic and narrative standards by learning from holistic preference data rather than conflicting auxiliary losses.
- On ViStoryBench, the approach achieves state-of-the-art results with +10.0 gains in Character Identity (CIDS) and +18.7 gains in Style Consistency (CSD) while preserving high-fidelity generation.




