AI Navigate

SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment

arXiv cs.CL / 3/18/2026

📰 NewsIndustry & Market MovesModels & Research

Key Points

  • The paper proposes a Synthesize-Inject-Align (SI) framework to build knowledgeable and secure e-commerce search LLMs, addressing both knowledge hallucination and jailbreak security vulnerabilities.
  • It synthesizes a high-quality corpus by merging structured knowledge graphs with unstructured behavioral logs, augmented with reasoning chains and safety-aware data.
  • It introduces a parameter-efficient pre-training strategy named Depth Up-Scaling to inject domain knowledge while preserving general capabilities.
  • It employs a dual-path alignment approach via multi-task instruction tuning and adversarial training to strengthen both task performance and safety robustness.
  • The framework has been deployed at JD.com, with A/B tests across five core search scenarios showing significant improvements in key business metrics and validating industrial effectiveness and scalability.

Abstract

Large language models offer transformative potential for e-commerce search by enabling intent-aware recommendations. However, their industrial deployment is hindered by two critical challenges: (1) knowledge hallucination due to insufficient encoding of dynamic, fine-grained product knowledge, and (2) security vulnerabilities under jailbreak attacks that threaten compliance. To address these issues, we propose SI--a Synthesize-Inject-Align framework for building knowledgeable and secure e-commerce search LLMs. Our approach first synthesizes high-quality natural language corpus by combining structured knowledge graphs with unstructured behavioral logs, augmented with reasoning chains and safety-aware data.We then introduce a parameter-efficient pre-training strategy based on Depth Up-Scaling to inject domain knowledge while preserving general capabilities. Finally, a dual-path alignment method via multi-task instruction tuning and adversarial training strengthens both task performance and safety robustness. The framework has been deployed at JD.com, China's largest self-operated e-commerce platform, where A/B tests across five core search scenarios demonstrate significant improvements in key business metrics, validating its industrial effectiveness and scalability.