AI Navigate

ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

arXiv cs.CL / 3/17/2026

📰 NewsModels & Research

Key Points

  • ExPosST proposes explicit position allocation to resolve the positional mismatch when using decoder-only LLMs for simultaneous machine translation (SimulMT).
  • It reserves fixed positional slots for incoming source tokens to enable efficient decoding with KV cache across different positional encoding schemes.
  • The authors introduce a policy-consistent fine-tuning strategy that aligns training with inference-time decoding behavior, bridging fine-tuning and inference.
  • Experiments on multiple language pairs show that ExPosST enables simultaneous translation under diverse policies and improves compatibility with various encoding methods.
  • The framework aims to improve inference efficiency, positional consistency, and broad model compatibility in LLM-based SimulMT.

Abstract

Large language models (LLMs) have recently demonstrated promising performance in simultaneous machine translation (SimulMT). However, applying decoder-only LLMs to SimulMT introduces a positional mismatch, which leads to a dilemma between decoding efficiency and positional consistency. Existing approaches often rely on specific positional encodings or carefully designed prompting schemes, and thus fail to simultaneously achieve inference efficiency, positional consistency, and broad model compatibility. In this work, we propose ExPosST, a general framework that resolves this dilemma through explicit position allocation. ExPosST reserves fixed positional slots for incoming source tokens, enabling efficient decoding with KV cache across different positional encoding methods. To further bridge the gap between fine-tuning and inference, we introduce a policy-consistent fine-tuning strategy that aligns training with inference-time decoding behavior. Experiments across multiple language pairs demonstrate that ExPosST effectively supports simultaneous translation under diverse policies.