DataEvolver: Let Your Data Build and Improve Itself via Goal-Driven Loop Agents

arXiv cs.AI / 5/5/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

共有:

Key Points

The paper introduces DataEvolver, a closed-loop “visual data engine” that uses explicit goals and iterative generation–inspection–correction–filtering–export to create controllable training data for image editing and multimodal understanding.
DataEvolver is designed to manage multiple persistent artifact types, including RGB images, masks, depth/normal maps, meshes, poses, trajectories, and review traces.
The system’s current release uses two coupled loops: in-sample self-correction during generation and cross-round self-expansion during dataset validation.
Experiments on an image-level object-rotation task show that the proposed Ours+DualGate approach, using a fixed Qwen-Edit LoRA probe, outperforms an unadapted base model and a public multi-angle LoRA on both SpatialEdit and a held-out evaluation set.
Ablation results indicate a consistent performance improvement path from scene-aware generation to feedback-driven correction and dual-gated validation, with the core contribution framed as a reusable dataset-building framework.

Abstract

Constructing controllable visual data is a major bottleneck for image editing and multimodal understanding. Useful supervision is rarely produced by a single rendering pass; instead it emerges through iterative generation, inspection, correction, filtering, and export. We present DataEvolver, a closed-loop visual data engine that organizes this process around explicit goals, persistent artifacts, bounded corrective actions, and acceptance decisions. DataEvolver supports multiple artifact types, including RGB images, masks, depth maps, normal maps, meshes, poses, trajectories, and review traces. In the current release, the system operates through two coupled loops: generation-time self-correction within each sample and validation-time self-expansion across dataset rounds. We validate the framework on an image-level object-rotation setting. With a fixed Qwen-Edit LoRA probe, our final Ours+DualGate model outperforms both the unadapted base model and a public multi-angle LoRA on SpatialEdit and a held-out evaluation set. Ablations show a consistent improvement path from scene-aware generation to feedback-driven correction and dual-gated validation. Beyond the released rotation data, our main contribution is a reusable framework for building visual datasets through explicit goal tracking, review, correction, and acceptance loops.

Black Hat USA

AI Business

When Claims Freeze Because a Provider Record Drifted: The Case for Enrollment Repair Agents

Dev.to

Why Ship-and-Debit Claim Recovery Is a Better Agent Wedge Than Another “AI Back Office” Tool

Dev.to

AI is getting better at doing things, but still bad at deciding what to do?

Reddit r/artificial

I Built an AI-Powered Chinese BaZi (八字) Fortune Teller — Here's What DeepSeek Revealed About Destiny

Dev.to

DataEvolver: Let Your Data Build and Improve Itself via Goal-Driven Loop Agents

Key Points

Abstract

Related Articles

Black Hat USA

When Claims Freeze Because a Provider Record Drifted: The Case for Enrollment Repair Agents

Why Ship-and-Debit Claim Recovery Is a Better Agent Wedge Than Another “AI Back Office” Tool

AI is getting better at doing things, but still bad at deciding what to do?

I Built an AI-Powered Chinese BaZi (八字) Fortune Teller — Here's What DeepSeek Revealed About Destiny

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer