EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises
arXiv cs.AI / 2026/3/24
📰 ニュースDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research
要点
- The paper argues that deploying AI agents in enterprise settings requires balancing performance with data sovereignty and inference cost, and that existing small-model pipelines are too fragmented to reach frontier-like specialization.
- It introduces EnterpriseLab, a full-stack closed-loop platform that unifies tool integration (via Model Context Protocol), automated trajectory/training-data synthesis from environment schemas, and continuous evaluation in the training pipeline.
- EnterpriseLab is validated through EnterpriseArena, which connects 15 enterprise applications and 140+ tools spanning IT, HR, sales, and engineering.
- The results claim that 8B-parameter models trained with EnterpriseLab can match GPT-4o on complex enterprise workflows while cutting inference costs by 8–10x.
- The paper reports robustness across multiple enterprise benchmarks, including EnterpriseBench (+10%) and CRMArena (+10%), positioning EnterpriseLab as a practical route to privacy-preserving agent deployment.

