STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction

arXiv cs.LG / 4/14/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes a two-part framework for structured prediction under ambiguity, label skew, and group heterogeneity, combining task-agnostic prompting (XML-structured instructions, disambiguation/verification, schema constraints, and self-validation) with robust fine-tuning.
It introduces STaR-DRO, a stateful robust optimization method that uses Tsallis mirror descent with momentum-smoothed, centered group-loss signals and bounded excess-only reweighting to upweight only persistently hard groups.
The approach is evaluated on EPPC Miner, a benchmark for extracting hierarchical labels and evidence spans from secure patient-provider messages, targeting both correctness and evidence groundedness.
Results show prompting improves zero-shot structured extraction by +15.44 average F1 across multiple Llama models, and adding STaR-DRO to supervised fine-tuning improves hardest semantic decisions (e.g., Llama-3.3-70B-Instruct Code F1 79.24→81.47; Sub-code 67.78→69.30) while preserving span performance and reducing group-wise validation cross-entropy by up to 29.6%.
The authors argue the gains matter for real-world communication mining reliability in patient-centered care due to better handling of rare and clinically consequential categories.

Abstract

Structured prediction requires models to generate ontology-constrained labels, grounded evidence, and valid structure under ambiguity, label skew, and heterogeneous group difficulty. We present a two-part framework for controllable inference and robust fine-tuning. First, we introduce a task-agnostic prompting strategy that combines XML-based instruction structure, disambiguation rules, verification-style reasoning, schema constraints, and self-validation to address format drift, label ambiguity, evidence hallucination, and metadata-conditioned confusion in in-context structured generation. Second, we introduce STaR-DRO, a stateful robust optimization method for group heterogeneity. It combines Tsallis mirror descent with momentum-smoothed, centered group-loss signals and bounded excess-only multipliers so that only persistently hard groups above a neutral baseline are upweighted, concentrating learning where it is most needed while avoiding volatile, dense exponentiated-gradient reweighting and unnecessary loss from downweighting easier groups. We evaluate the combined framework on EPPC Miner, a benchmark for extracting hierarchical labels and evidence spans from patient-provider secure messages. Prompt engineering improves zero-shot by +15.44 average F1 across Code, Sub-code, and Span over four Llama models. Building on supervised fine-tuning, STaR-DRO further improves the hardest semantic decisions: on Llama-3.3-70B-Instruct, Code F1 rises from 79.24 to 81.47 and Sub-code F1 from 67.78 to 69.30, while preserving Span performance and reducing group-wise validation cross-entropy by up to 29.6% on the most difficult clinical categories. Because these rare and difficult groups correspond to clinically consequential communication behaviors, these gains are not merely statistical improvements: they directly strengthen communication mining reliability for patient-centered care analysis.

Black Hat Asia

AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

Don't forget, there is more than forgetting: new metrics for Continual Learning

Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale

Dev.to

Bit of a strange question?

Reddit r/artificial

STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction

Key Points

Abstract

Related Articles

Black Hat Asia

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Don't forget, there is more than forgetting: new metrics for Continual Learning

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale

Bit of a strange question?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer