Boltzmann Machine Learning with a Parallel, Persistent Markov chain Monte Carlo method for Estimating Evolutionary Fields and Couplings from a Protein Multiple Sequence Alignment
arXiv stat.ML / 4/21/2026
💬 OpinionDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper tackles the inverse Potts problem by using a Boltzmann machine approach to estimate evolutionary single-site fields and pairwise couplings from amino-acid frequency statistics in protein multiple sequence alignments.
- To address the method’s high computational cost, the authors introduce a parallel, persistent Markov chain Monte Carlo (MCMC) scheme for estimating marginal distributions at each learning step, alongside stochastic gradient descent to speed up training.
- Hyperparameter tuning is improved by specifically adjusting two regularization parameters (for fields and couplings) using a condition intended to better match protein conformational requirements, rather than relying on contact-pair prediction precision alone.
- The proposed workflow is evaluated on eight protein families, supporting the practicality of the approach for deriving reproducible evolutionary parameters relevant to protein structure and evolution studies.
Related Articles

A practical guide to getting comfortable with AI coding tools
Dev.to

We built it during the NVIDIA DGX Spark Full-Stack AI Hackathon — and it ended up winning 1st place overall 🏆
Dev.to

Stop Losing Progress: Setting Up a Pro Jupyter Workflow in VS Code (No More Colab Timeouts!)
Dev.to

🚀 Major BrowserAct CLI Update
Dev.to

Building AgentOS: Why I’m Building the AWS Lambda for Insurance Claims
Dev.to