Why Is Table Extraction with VLM Models Still Challenging? [D]

Reddit r/MachineLearning / 5/1/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical Usage

共有:

Key Points

The post asks for a reliable open-source method to convert PDFs into Markdown with an emphasis on accurately extracting financial tables.
The author reports that borderless tables and tables with more than about 5–6 columns are especially difficult to handle with existing approaches.
They have tried several tools and pipelines (including Docling, Graphite-Docling, and Marker) but have not found a consistently solid open-source solution.
The only approach that works well so far is LandingAI, but it is paid, motivating a search for alternatives.
The request includes example images and seeks community recommendations for open-source tooling or workflows that better support complex table layouts using VLM-style extraction.

Why Is Table Extraction with VLM Models Still Challenging? [D]

Hey everyone, I’m struggling to find a good approach for converting PDFs to Markdown (especially for financial data). The main challenge is handling borderless tables and tables with more than 5–6 columns. I’ve tried docling, graphite-docling, marker, etc., but haven’t found a solid open-source solution. The only thing that works well so far is LandingAI (but it’s paid).

Does anyone know of a good open-source alternative? TIA!

Sample:

https://preview.redd.it/tajjcvjt5jyg1.png?width=959&format=png&auto=webp&s=8d04c5e946ab361bfef08021f79d106ab62a07cd

https://preview.redd.it/lhpwnbty5jyg1.png?width=630&format=png&auto=webp&s=8dc0475a32b89ce7f8107f3940fd3eb6b0896a3a

submitted by /u/No_Stretch_5809
[link] [comments]

Black Hat USA

AI Business

I deployed AI agents across AWS, GCP, and Azure without a VPN. Here is how it works.

Dev.to

Panduan Lengkap TestSprite MCP Server — Dokumentasi Getting Started dalam Bahasa Indonesia

Dev.to

AI made learning fun again

Dev.to

Every Telegram conversation becomes a qualified lead. BizNode captures name, email, and business details automatically while...

Dev.to

Why Is Table Extraction with VLM Models Still Challenging? [D]

Key Points

Related Articles

Black Hat USA

I deployed AI agents across AWS, GCP, and Azure without a VPN. Here is how it works.

Panduan Lengkap TestSprite MCP Server — Dokumentasi Getting Started dalam Bahasa Indonesia

AI made learning fun again

Every Telegram conversation becomes a qualified lead. BizNode captures name, email, and business details automatically while...

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer