Partially Recentralization Softmax Loss for Vision-Language Models Robustness

arXiv cs.CL / 3/13/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper investigates adversarial robustness in multimodal vision-language models by restricting the top-K softmax outputs through a modified loss function.
After fine-tuning, the approach significantly improves robustness against common adversarial attacks on pre-trained multimodal models.
The authors highlight future research directions, including output diversity, generalization, and the robustness-performance trade-off of this loss formulation.
They plan to release their code after the paper is accepted.

Abstract

As Large Language Models make a breakthrough in natural language processing tasks (NLP), multimodal technique becomes extremely popular. However, it has been shown that multimodal NLP are vulnerable to adversarial attacks, where the outputs of a model can be dramatically changed by a perturbation to the input. While several defense techniques have been proposed both in computer vision and NLP models, the multimodal robustness of models have not been fully explored. In this paper, we study the adversarial robustness provided by modifying loss function of pre-trained multimodal models, by restricting top K softmax outputs. Based on the evaluation and scoring, our experiments show that after a fine-tuning, adversarial robustness of pre-trained models can be significantly improved, against popular attacks. Further research should be studying, such as output diversity, generalization and the robustness-performance trade-off of this kind of loss functions. Our code will be available after this paper is accepted

AI's Economic Impact Falls Short: Addressing the Gap Between Investment and Measurable Growth

Dev.to

The Inception Loop: A Month in the Life of a Self-Improving AI Sidekick

Dev.to

The Editing Tax: Why AI 'Saves Time' Until It Doesn't — And How to Reduce Rework

Dev.to

AI Can Write Your Code. Who's Testing Your Thinking?

Dev.to

[R] Weekly digest: arXiv AI security papers translated for practitioners -- Cascade (cross-stack CVE+Rowhammer attacks on compound AI), LAMLAD (dual-LLM adversarial ML, 97% evasion), OpenClaw (4 vuln classes in agent frameworks)

Reddit r/MachineLearning

Partially Recentralization Softmax Loss for Vision-Language Models Robustness

Key Points

Abstract

Related Articles

AI's Economic Impact Falls Short: Addressing the Gap Between Investment and Measurable Growth

The Inception Loop: A Month in the Life of a Self-Improving AI Sidekick

The Editing Tax: Why AI 'Saves Time' Until It Doesn't — And How to Reduce Rework

AI Can Write Your Code. Who's Testing Your Thinking?

[R] Weekly digest: arXiv AI security papers translated for practitioners -- Cascade (cross-stack CVE+Rowhammer attacks on compound AI), LAMLAD (dual-LLM adversarial ML, 97% evasion), OpenClaw (4 vuln classes in agent frameworks)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer