If AI is about to get 10x smarter, how do we prevent the internet from collapsing under synthetic noise?

Reddit r/artificial / 4/28/2026

💬 OpinionSignals & Early TrendsIdeas & Deep Analysis

Key Points

  • The author argues that rapid progress toward AGI is constrained by “training data,” warning that large portions of online content are already synthetic and risk driving model collapse during future training cycles.
  • They claim synthetic-noise feedback loops can make model outputs blander and less useful, citing concerns similar to what has been observed with image generation systems.
  • To counter this, the piece proposes labeling or filtering human-generated data, emphasizing that diversity in training signals helps prevent collapse.
  • The proposed approach is “proof of personhood” (e.g., Face ID/Touch ID–level verification) to let platforms confirm humans without resorting to a full surveillance state.
  • The author frames proof-of-personhood as potential infrastructure for next-generation AI and asks whether it should be treated as a regulatory speed bump or core technical layer.

Im all for acceleration. I think the faster we hit AGI the better. but theres a bottleneck nobody here talks about enough-training data.

right now we are quietly poisoning the well. More than half of online content is already synthetic. bots talking to bots, articles written by AI, reddit threads generated by LLMs. when the next generation of models trains on this they eat their own tail. model collapse is real. we saw it with image generators. Outputs get blander, weirder, less useful.we need a way to label or filter human-generated data. not because humans are better but because diversity prevents collapse.

I know the standard solution sounds like a dystopian meme. biometric scanners, iris codes, hardware verification. and yeah maybe it is dystopian. but so is a dead internet where nothing can be trusted.Reddit CEO Steve Huffman put it simply recently - platforms need to know you're human without knowing your name. Face ID / Touch ID level stuff.

im not saying that specific device is the answer. but the category of solution - proof of human that doesnt create a surveillance state - seems necessary if we want to keep scaling past the cliff.what do you think? Is proof-of-personhood just a regulatory speed bump, or is it infrastructure for the next generation of AI?curious where this sub lands.

submitted by /u/jcveloso8
[link] [comments]