Red Skills or Blue Skills? A Dive Into Skills Published on ClawHub
arXiv cs.CL / 4/16/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper studies ClawHub, a large public registry of LLM agent “skills,” by building and normalizing a dataset of 26,502 skills and analyzing language, organization, popularity, and security-related signals.
- It finds strong cross-lingual patterns: English skills skew toward infrastructure and technical capabilities (e.g., APIs, automation, memory), while Chinese skills cluster more around application scenarios such as media generation, social content, and finance services.
- The authors report that over 30% of crawled skills show suspicious or malicious labeling via available platform signals, and many skills still lack complete safety observability.
- They propose an early risk-assessment approach using only submission-time information and evaluate a balanced benchmark of 11,010 skills, with the best Logistic Regression reaching 72.62% accuracy and 78.95% AUROC.
- Documentation quality is identified as the most informative submission-time signal for predicting skill risk, highlighting public registries as both an enabler for reuse and a new security risk surface.
Related Articles

Black Hat Asia
AI Business

Introducing Claude Opus 4.7
Anthropic News

AI traffic to US retailers rose 393% in Q1, and it’s boosting their revenue too
TechCrunch

Who Audits the Auditors? Building an LLM-as-a-Judge for Agentic Reliability
Dev.to

"Enterprise AI Cost Optimization: How Companies Are Cutting AI Infrastructure Sp
Dev.to