Call-Chain-Aware LLM-Based Test Generation for Java Projects

arXiv cs.AI / 4/27/2026

💬 OpinionTools & Practical UsageModels & Research

共有:

Key Points

The paper introduces CAT, a call-chain-aware LLM-based method for generating Java unit tests using static analysis to add call-chain and dependency context to prompts.
CAT goes beyond execution-path-only prompting by modeling caller–callee relationships, object constructors, and third-party dependencies to help produce executable and semantically valid test contexts.
It includes an iterative test-fixing mechanism to recover from generation failures, improving robustness when tests initially cannot run.
On the Defects4J benchmark, CAT raises line coverage by 18.04% and branch coverage by 21.74% compared with the state-of-the-art approach PANTA.
CAT also performs better than the prior approach on four real-world GitHub projects released after the LLM cutoff date, and an ablation study confirms the value of call-chain and dependency contexts.

Abstract

Large language models (LLMs) have recently shown strong potential for generating project-level unit tests. However, existing state-of-the-art approaches primarily rely on execution-path information to guide prompt construction, which is often insufficient for complex software systems with rich inter-class dependencies, deep call chains, and intricate object initialization requirements. In this paper, we present CAT, a novel call-chain-aware LLM-based test generation approach that explicitly incorporates call-chain and dependency contexts into prompts through dedicated static analysis. To construct executable, semantically valid test contexts, CAT systematically models caller--callee relationships, object constructors, and third-party dependencies, and supports iterative test fixing when generation failures occur. We evaluate CAT on the widely used Defects4J benchmark and on four real-world GitHub projects released after the LLM's cut-off date. The results show that, across projects in Defects4J, CAT improves line and branch coverage by 18.04% and 21.74%, respectively, over the state-of-the-art approach PANTA, while consistently achieving superior performance on post-cutoff real-world projects. An ablation study further demonstrates the importance of call-chain and dependency contexts in CAT.

Black Hat USA

AI Business

Legal Insight Transformation: 7 Mistakes to Avoid When Adopting AI Tools

Dev.to

Legal Insight Transformation: A Beginner's Guide to Modern Research

Dev.to

The Open Source AI Studio That Nobody's Talking About

Dev.to

How I Built a 10-Language Sports Analytics Platform with FastAPI, SQLite, and Claude AI (As a Solo Non-Technical Founder)

Dev.to

Call-Chain-Aware LLM-Based Test Generation for Java Projects

Key Points

Abstract

Related Articles

Black Hat USA

Legal Insight Transformation: 7 Mistakes to Avoid When Adopting AI Tools

Legal Insight Transformation: A Beginner's Guide to Modern Research

The Open Source AI Studio That Nobody's Talking About

How I Built a 10-Language Sports Analytics Platform with FastAPI, SQLite, and Claude AI (As a Solo Non-Technical Founder)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer