ZTab: Domain-based Zero-shot Annotation for Table Columns
arXiv cs.LG / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- ZTab proposes a domain-based zero-shot framework to automatically annotate semantic column types in relational tables without requiring user-provided labeled data, addressing privacy concerns and labeling costs.
- It generates pseudo-tables from sample schemas and fine-tunes an annotation LLM on them to enable domain-aware zero-shot labeling.
- The domain configuration offers a trade-off between zero-shot breadth and annotation performance, with a universal domain approaching pure zero-shot and a specialized domain achieving better accuracy within a given application.
- The approach aims to reduce reliance on high-performance closed-source LLMs, enables test-time operation without retraining for similar domains, and provides code and datasets on GitHub for reproducibility.




