GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deployment
arXiv cs.CV / 4/29/2026
📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research
Key Points
- OpenAI’s GPT-Image-2 release is presented as a major inflection point, making it increasingly difficult to distinguish synthetic AI imagery from photographic reality.
- Researchers introduce and publish the GPT-Image-2 Twitter Dataset, containing 10,217 confirmed GPT-image-2 images collected from public Twitter/X posts during the first week after the April 21, 2026 deployment.
- The dataset creation combines Twitter API v2 collection with multilingual text heuristics, automated “Made with AI” badge verification, and model-name variant matching, using a six-day curation window.
- The paper analyzes the images using CLIP-based taxonomy, OCR (82.0% show detectable text), face detection (59.2% with 22,583 total faces), and semantic clustering (137 clusters).
- A major finding is that C2PA content credentials are stripped by Twitter’s CDN on upload, preventing cryptographic provenance verification for images sourced from social media.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

How I Use AI Agents to Maintain a Living Knowledge Base for My Team
Dev.to

An API testing tool built specifically for AI agent loops
Dev.to
IK_LLAMA now supports Qwen3.5 MTP Support :O
Reddit r/LocalLLaMA
OpenAI models, Codex, and Managed Agents come to AWS
Dev.to

Indian Developers: How to Build AI Side Income with $0 Capital in 2026
Dev.to