AutoPKG: An Automated Framework for Dynamic E-commerce Product-Attribute Knowledge Graph Construction

arXiv cs.AI / 4/21/2026

📰 NewsSignals & Early TrendsIndustry & Market MovesModels & Research

Key Points

  • AutoPKG is a multi-agent LLM framework designed to automatically build and maintain a Product-attribute Knowledge Graph (PKG) from multimodal e-commerce product content.
  • It addresses inconsistent and costly ontology maintenance by inducing product types and type-specific attribute keys on demand, extracting attribute values from text and images, and consolidating changes into a globally consistent canonical graph via a central decision agent.
  • The paper introduces an evaluation protocol for dynamic PKGs that assesses type/key validity, consolidation quality, and edge-level accuracy for value assertions after canonicalization.
  • Experiments on a Lazada (Alibaba) marketplace catalog dataset show strong gains in knowledge efficiency and multimodal extraction accuracy, including improvements in edge-level exact-match F1 and application precision across public benchmarks.
  • Online A/B tests indicate measurable business impact, with AutoPKG-derived attributes increasing GMV across Badge, Search, and Recommendation by 3.81%, 5.32%, and 7.89%, respectively.

Abstract

Product attribute extraction in e-commerce is bottlenecked by ontologies that are inconsistent, incomplete, and costly to maintain. We present AutoPKG, a multi-agent Large Language Model (LLM) framework that automatically constructs a Product-attribute Knowledge Graph (PKG) from multimodal product content. AutoPKG induces product types and type-specific attribute keys on demand, extracts attribute values from text and images, and consolidates updates through a centralized decision agent that maintains a globally consistent canonical graph. We also propose an evaluation protocol for dynamic PKGs that measures type and key validity, consolidation quality, and edge-level accuracy for value assertions after canonicalization. On a large real-world marketplace catalog dataset from Lazada (Alibaba), AutoPKG achieves up to 0.953 Weighted Knowledge Efficiency (WKE) for product types, 0.724 WKE for attribute keys, and 0.531 edge-level F1 for multimodal value extraction. Across three public benchmarks, our method improves edge-level exact-match F1 by 0.152 and yields a precision gain of 0.208 on the attribute extraction application. Online A/B tests show that AutoPKG-derived attributes increase Gross Merchandise Value (GMV) in Badge by 3.81 percent, in Search by 5.32 percent, and in Recommendation by 7.89 percent, supporting the practical value of AutoPKG in production.