How to make the most of your masked language model for protein engineering
arXiv cs.LG / 3/12/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- The paper introduces a stochastic beam search sampling method for masked language models to optimize protein properties during design.
- It leverages MLMs’ efficiency in evaluating the pseudo-perplexity of the entire 1-edit neighborhood to guide generation with multiple objectives.
- It reframes generation as entire-sequence evaluation, enabling flexible multi-objective optimization during protein design.
- In vitro head-to-head experiments on antibody engineering campaigns show that the choice of sampling method can be as impactful as the model itself, underscoring a crucial area for future research.




