Reducing MP3 compression bias in music datasets via codec-aware reconstruction

Reddit r/LocalLLaMA / 5/6/2026

💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research

Key Points

  • The article describes an open-source tool that improves MP3 (LAME-encoded) decoding by reducing systematic codec-induced bias in audio datasets.
  • Instead of treating the task as denoising, it frames MP3 reconstruction as a disambiguation/Bayesian inference problem, because MP3 encoding is non-injective and maps many plausible originals to the same compressed signal.
  • The method selects a coherent reconstruction consistent with both the codec structure and musical priors, aiming to better preserve details like hi-hats/cymbals and transient clarity.
  • The author reports improved performance over a stock decoder on unseen data, with the largest gains at higher (e.g., around 96 kbps CBR) bitrates, and notes it works best for consistent medium-bitrate MP3s (about 96–224 kbps CBR).
  • The tool includes a (slow) web demo and an implementation repo, but it is not intended as “magic restoration” for arbitrary or heavily re-encoded audio.
  • The article provides an evaluation table comparing NMSE between original vs. compressed and original vs. reconstructed signals, showing substantial error reduction from the reconstruction approach.
Reducing MP3 compression bias in music datasets via codec-aware reconstruction

I built a tool to improve decoding of MP3 files (LAME encoded) reducing systematic codec induced bias in audio datasets.

Rather than denoising, it treats reconstruction as a disambiguation problem: MP3 encoding is non-injective, so the observed signal corresponds to a distribution of plausible originals. The model approximates this as a Bayesian inference problem induced by the compression process itself, selecting a coherent signal consistent with both codec structure and musical priors.

What it can help with?

  • clearer hi-hats / cymbals
  • sharper transients (less “smear”)
  • reducing typical MP3 artifacts (swishy / pre-echo stuff)

What it’s not?

  • not magic “restore the original track”
  • not really meant for random YouTube rips or heavily re-encoded audio
  • works best on consistent medium-bitrate MP3s (like 96-224 kbps CBR)

I put up:

  • a web demo (kinda slow 😅)
  • fully open-source repo (you can (and should) run it locally)

👉 Demo: https://audiode.theivanr.duckdns.org/
👉 Repo: https://github.com/theIvanR/ADE-MP3

** Performance vs stock decoder on unseen data **

CBR Bitrate (kbit/sec) nmse(orig, comp) nmse(orig, rec) Delta %
32 4.47E-02 4.10E-02 8.28%
40 3.28E-02 2.92E-02 10.98%
48 2.52E-02 2.21E-02 12.30%
56 1.99E-02 1.67E-02 16.08%
64 1.63E-02 1.33E-02 18.40%
80 9.59E-03 7.18E-03 25.13%
96 6.14E-03 3.75E-03 38.93%
112 4.62E-03 2.20E-03 52.38%
128 3.83E-03 1.40E-03 63.45%
160 3.07E-03 6.25E-04 79.64%
192 1.18E-03 2.83E-04 76.02%
224 5.50E-04 1.49E-04 72.91%
submitted by /u/TheSpicyBoi123
[link] [comments]