PlotTwist: A Creative Plot Generation Framework with Small Language Models

arXiv cs.CL / 3/18/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

PlotTwist presents a framework that enables small language models (≤5B parameters) to generate premise-conditioned plots that rival much larger frontier models, addressing alignment efficiency and accessibility concerns.
The approach decomposes generation into three components: a Narrative Quality Dimensions reward model trained via a novel Positive-Negative prompting strategy, a Mixture-of-Experts plot generator aligned via Direct Preference Optimization, and an Agentic Evaluation module that mimics human critical judgment for post-hoc assessment.
Experiments show PlotTwist consistently outperforms frontier models across multiple Narrative Quality Dimensions and can distinguish plots derived from acclaimed versus paned screenplays, indicating robust alignment with high-quality storytelling.
The work argues that structured, preference-based alignment is a resource-efficient path to high-quality creative plot generation, potentially reducing the computational barriers of large-scale alignment.

Abstract

Creative plot generation presents a fundamental challenge for language models: transforming a concise premise into a coherent narrative that sustains global structure, character development, and emotional resonance. Although recent Large Language Models (LLMs) demonstrate strong fluency across general-purpose tasks, they typically require preference alignment to perform well on specialized domains such as creative plot generation. However, conducting such alignment at the scale of frontier LLMs is computationally prohibitive, significantly limiting accessibility and practical deployment. To address this, we present PlotTwist, a structured framework that enables Small Language Models (SLMs) with

\leq

5B active parameters to generate high-quality, premise-conditioned plots competitive with frontier systems up to

200\times

larger. Our approach decomposes generation into three specialized components: (1) an Aspect Rating Reward Model trained via a novel Positive-Negative prompting strategy to deliver structured narratives across five Narrative Quality Dimensions (NQDs); (2) a Mixture-of-Experts (MoE) plot generator aligned via Direct Preference Optimization on high-confidence preference pairs; and (3) an Agentic Evaluation module that emulates human critical judgment for unbiased post-hoc assessment. Extensive experiments demonstrate that PlotTwist consistently outperforms frontier models across multiple NQDs despite substantially tighter capacity constraints. Further validation confirms strong sensitivity to narrative quality, as the framework reliably distinguishes plots derived from critically acclaimed versus widely panned screenplays. Together, these results establish structured, preference-based alignment as a resource-efficient approach to high-quality creative plot generation.