Aiki my local Wikipedia Retrieval-Augmented Generation system [R]

Reddit r/MachineLearning / 5/26/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • Aiki is a lightweight tool that enables users to chat with Wikipedia locally by downloading, chunking, and indexing Wikipedia content on their own machine.
  • It uses a custom-built TF-IDF plus cosine-similarity retriever to find relevant passages, with options to select specific articles or download similar topics.
  • The system includes query expansion based on Wikipedia links and redirects to improve retrieval quality.
  • It also offers optional answer generation using an LLM while maintaining a fully local, minimal-dependency setup.
  • The project is shared via a GitHub repository and invites user feedback to iterate on the tool.

Hey

i built Aiki a lightweight tool that let's you chat with Wikipedia locally.

what it does: - Downloads and chunks wikipedia articles (u can choose those articles by their name or articles and also the option of downloading the similar topics) - Uses a custom TF-IDF + cosine similarity retriever (built from scratch) - Supports query expansion using Wikipedia links/redirects - Optional answer generation with llm

Very minimal dependencies and runs completely locally.

Repo: https://github.com/yacine204/Aiki

Would really appreciate your feedback.

submitted by /u/Just_Jaguar3701
[link] [comments]