Agent-Based User-Adaptive Filtering for Categorized Harassing Communication

arXiv cs.AI / 3/17/2026

📰 NewsModels & Research

共有:

Key Points

The paper proposes an agent-based framework for personalized filtering of categorized harassing content in online social networks.
Agents learn from user feedback and adapt filtering thresholds across harassment categories (offensive, abusive, hateful) to reflect individual tolerance levels.
The authors implement and evaluate the approach using supervised classification techniques and simulated user interactions, showing improved precision and user satisfaction over static models.
The work highlights how agent-based personalization can enhance content moderation while preserving user autonomy in digital social environments.

Abstract

We propose an agent-based framework for personalized filtering of categorized harassing communication in online social networks. Unlike global moderation systems that apply uniform filtering rules, our approach models user-specific tolerance levels and preferences through adaptive filtering agents. These agents learn from user feedback and dynamically adjust filtering thresholds across multiple harassment categories, including offensive, abusive, and hateful content. We implement and evaluate the framework using supervised classification techniques and simulated user interaction data. Experimental results demonstrate that adaptive agents improve filtering precision and user satisfaction compared to static models. The proposed system illustrates how agent-based personalization can enhance content moderation while preserving user autonomy in digital social environments.

Co-Activation Pattern Detection for Prompt Injection: A Mechanistic Interpretability Approach Using Sparse Autoencoders

Reddit r/LocalLLaMA

How to Train Custom Language Models: Fine-Tuning vs Training From Scratch (2026)

Dev.to

KoboldCpp 1.110 - 3 YR Anniversary Edition, native music gen, qwen3tts voice cloning and more

Reddit r/LocalLLaMA

Qwen3.5 Knowledge density and performance

Reddit r/LocalLLaMA

I think I made the best general use System Prompt for Qwen 3.5 (OpenWebUI + Web search)

Reddit r/LocalLLaMA

Agent-Based User-Adaptive Filtering for Categorized Harassing Communication

Key Points

Abstract

Related Articles

Co-Activation Pattern Detection for Prompt Injection: A Mechanistic Interpretability Approach Using Sparse Autoencoders

How to Train Custom Language Models: Fine-Tuning vs Training From Scratch (2026)

KoboldCpp 1.110 - 3 YR Anniversary Edition, native music gen, qwen3tts voice cloning and more

Qwen3.5 Knowledge density and performance

I think I made the best general use System Prompt for Qwen 3.5 (OpenWebUI + Web search)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer