AI Navigate

Computational Analysis of Semantic Connections Between Herman Melville Reading and Writing

arXiv cs.CL / 3/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study investigates the potential influence of Herman Melville's reading on his own writing using computational semantic similarity analysis.
  • It uses documented records of books Melville owned or read, segments texts at sentence and non-overlapping 5-gram levels, and computes similarity with BERTScore.
  • Rather than applying fixed thresholds, it interprets precision, recall, and F1 scores as indicators of possible semantic alignment that may suggest literary influence.
  • Experimental results show the approach captures expert-identified instances and highlights additional passages warranting further qualitative examination, supporting semantic similarity methods as a framework for source and influence studies in literary scholarship.

Abstract

This study investigates the potential influence of Herman Melville reading on his own writings through computational semantic similarity analysis. Using documented records of books known to have been owned or read by Melville, we compare selected passages from his works with texts from his library. The methodology involves segmenting texts at both sentence level and non-overlapping 5-gram level, followed by similarity computation using BERTScore. Rather than applying fixed thresholds to determine reuse, we interpret precision, recall, and F1 scores as indicators of possible semantic alignment that may suggest literary influence. Experimental results demonstrate that the approach successfully captures expert-identified instances of similarity and highlights additional passages warranting further qualitative examination. The findings suggest that semantic similarity methods provide a useful computational framework for supporting source and influence studies in literary scholarship.