Pseudo-Labeling for Unsupervised Domain Adaptation with Kernel GLMs

arXiv stat.ML / 3/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

Proposes a principled framework for unsupervised domain adaptation under covariate shift in kernel GLMs, covering kernelized linear, logistic, and Poisson regression with ridge regularization.
Splits labeled source data into two batches: one to train a family of candidate models and one to build an imputation model that generates pseudo-labels for the target data, enabling robust model selection.
Establishes non-asymptotic excess-risk bounds characterized by an 'effective labeled sample size' that accounts for unknown covariate shift, providing theoretical guarantees.
Demonstrates empirical gains over source-only baselines on synthetic and real datasets, validating the approach.

Abstract

We propose a principled framework for unsupervised domain adaptation under covariate shift in kernel Generalized Linear Models (GLMs), encompassing kernelized linear, logistic, and Poisson regression with ridge regularization. Our goal is to minimize prediction error in the target domain by leveraging labeled source data and unlabeled target data, despite differences in covariate distributions. We partition the labeled source data into two batches: one for training a family of candidate models, and the other for building an imputation model. This imputation model generates pseudo-labels for the target data, enabling robust model selection. We establish non-asymptotic excess-risk bounds that characterize adaptation performance through an "effective labeled sample size", explicitly accounting for the unknown covariate shift. Experiments on synthetic and real datasets demonstrate consistent performance gains over source-only baselines.

How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command

Dev.to

Does Synthetic Data Generation of LLMs Help Clinical Text Mining?

Dev.to

What CVE-2026-25253 Taught Me About Building Safe AI Assistants

Dev.to

Day 52: Building vs Shipping — Why We Had 711 Commits and 0 Users

Dev.to

The Dawn of the Local AI Era: From iPhone 17 Pro to the Future of NVIDIA RTX

Dev.to

Pseudo-Labeling for Unsupervised Domain Adaptation with Kernel GLMs

Key Points

Abstract

Related Articles

How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command

Does Synthetic Data Generation of LLMs Help Clinical Text Mining?

What CVE-2026-25253 Taught Me About Building Safe AI Assistants

Day 52: Building vs Shipping — Why We Had 711 Commits and 0 Users

The Dawn of the Local AI Era: From iPhone 17 Pro to the Future of NVIDIA RTX

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer