LLMs edited 86 human essays toward a semantic cluster not occupied by any human writer [D]

Reddit r/MachineLearning / 5/5/2026

💬 OpinionIdeas & Deep AnalysisIndustry & Market MovesModels & Research

共有:

Key Points

Researchers compared 86 human essays revised by GPT-5-mini, Gemini 2.5 Flash, and Claude Haiku, finding that the LLM-edited versions cluster tightly in embedding space in a semantic region not occupied by any human-written essay.
The study reports that even “fix grammar only” prompts produce the same directional shift, effectively overwriting each writer’s unique lexical fingerprint with the model’s preferred vocabulary.
LLM assistance measurably changes the writing style and stance, making essays more neutral, more formal and statistical (more nouns/adjectives, fewer pronouns), and replacing personal-experience arguments with statistical and expert-citation style reasoning.
Institutional evidence from ICLR 2026 peer reviews suggests the criteria being rewarded are shifting: 21% of reviews were AI-generated, AI reviewers scored papers 10% higher and emphasized reproducibility (136%) and scalability (84%) more often.
A user-study paradox emerged: heavy LLM users recognized a loss of individual voice but still reported similar satisfaction, implying immediate efficiency gains with a more diffuse cultural cost.

Researchers from Berkeley, UCSD, UW, and Google DeepMind studied three datasets: a 100-person controlled user study, 86 pre-LLM human-written essays (collected in 2021) revised by GPT-5-mini, Gemini 2.5 Flash, and Claude Haiku, and 18,000 ICLR 2026 peer reviews.

The clearest finding: when mapped in embedding space, LLM-revised essays cluster tightly in a region not occupied by any of the human-written essays. Human essays are spread broadly. The LLM pushes every essay in the same direction regardless of revision instruction - even "fix grammar only" produces this shift. The unique lexical fingerprint of each writer is overwritten by the LLM's preferred vocabulary.

The stance shift is measurable: users given LLM assistance wrote significantly more neutral essays and avoided definitive positions. LLMs increased nouns and adjectives, decreased pronouns - more formal and statistical, less personal. Personal experience arguments were replaced with statistical and expert-citation arguments.

The ICLR 2026 finding is the sharpest institutional data point: 21% of peer reviews were AI-generated. AI reviewers scored papers 10% higher, were 136% more likely to emphasize reproducibility, and 84% more likely to emphasize scalability. Humans were more likely to comment on clarity as both a strength and a weakness. The criteria being rewarded in peer review are already shifting.

The user study paradox: heavy LLM users recognized the loss of voice but reported equivalent satisfaction. The efficiency gain is immediate; the cultural cost is diffuse.

Is there a writing task where you've noticed LLM revision consistently pulling you away from what you actually wanted to say?

submitted by /u/jimmytoan
[link] [comments]

Black Hat USA

AI Business

Singapore's Fraud Frontier: Why AI Scam Detection Demands Regulatory Precision

Dev.to

Meta will use AI to analyze height and bone structure to identify if users are underage

TechCrunch

Google, Microsoft, and xAI will allow the US government to review their new AI models

The Verge

How AI is Changing the Way We Code in 2026: The Shift from Syntax to Strategy

Dev.to

LLMs edited 86 human essays toward a semantic cluster not occupied by any human writer [D]

Key Points

Related Articles

Black Hat USA

Singapore's Fraud Frontier: Why AI Scam Detection Demands Regulatory Precision

Meta will use AI to analyze height and bone structure to identify if users are underage

Google, Microsoft, and xAI will allow the US government to review their new AI models

How AI is Changing the Way We Code in 2026: The Shift from Syntax to Strategy

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer