Perceptual misalignment of texture representations in convolutional neural networks
arXiv cs.CV / 4/3/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper studies whether CNN-based texture representations—specifically Gram matrices of convolutional features—match human perceptual texture content.
- By evaluating many CNNs and comparing their texture-feature correlations with human perceptual alignment using Brain-Score, the authors find no relationship between standard CNN “visual system model” quality metrics and human-like texture representation.
- The findings suggest that human texture perception relies on mechanisms different from those captured by common CNN object-recognition–trained approaches.
- The authors hypothesize that contextual integration may play a key role in human texture perception that is not adequately reflected by current CNN feature-correlation texture models.




