Why Is Table Extraction with VLM Models Still Challenging? [D]

Reddit r/MachineLearning / 5/1/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical Usage

Key Points

  • The post asks for a reliable open-source method to convert PDFs into Markdown with an emphasis on accurately extracting financial tables.
  • The author reports that borderless tables and tables with more than about 5–6 columns are especially difficult to handle with existing approaches.
  • They have tried several tools and pipelines (including Docling, Graphite-Docling, and Marker) but have not found a consistently solid open-source solution.
  • The only approach that works well so far is LandingAI, but it is paid, motivating a search for alternatives.
  • The request includes example images and seeks community recommendations for open-source tooling or workflows that better support complex table layouts using VLM-style extraction.
Why Is Table Extraction with VLM Models Still Challenging? [D]

Hey everyone, I’m struggling to find a good approach for converting PDFs to Markdown (especially for financial data). The main challenge is handling borderless tables and tables with more than 5–6 columns. I’ve tried docling, graphite-docling, marker, etc., but haven’t found a solid open-source solution. The only thing that works well so far is LandingAI (but it’s paid).

Does anyone know of a good open-source alternative? TIA!

Sample:

https://preview.redd.it/tajjcvjt5jyg1.png?width=959&format=png&auto=webp&s=8d04c5e946ab361bfef08021f79d106ab62a07cd

https://preview.redd.it/lhpwnbty5jyg1.png?width=630&format=png&auto=webp&s=8dc0475a32b89ce7f8107f3940fd3eb6b0896a3a

submitted by /u/No_Stretch_5809
[link] [comments]