Hey everyone,
I recently finished building a Model Context Protocol (MCP) index containing roughly 3 million arXiv papers. My goal was to make it easier to connect local and cloud LLMs directly to a massive corpus of ML and STEM research to help reduce hallucinated citations and improve research workflows.
The index is live, but before I open it up broadly, I want to make sure the retrieval quality actually holds up against highly niche, complex queries (especially for obscure math, hyper-specific domains, or newer architectures).
I’m looking for a small group of folks (around 20) to try it out, try to break the retrieval system, and give me brutal feedback on the relevance of the fetched papers.
If you want to stress-test it with your own LLM setup and see how it performs with your daily research queries, let me know in the comments or shoot me a DM and I’ll send you the connection details!
Thanks!
[link] [comments]



