Hey everyone,
I've been working on a RAG system and would really appreciate some feedback from people who have built or scaled similar systems.
This isn't just a basic "upload + ask" demo — I tried to design it more like something you'd actually ship.
What it does
- Authenticated users with document ownership
- Document-scoped retrieval (to avoid cross-doc leakage)
- Agent loop with tool calling (retriever as a tool)
- Query refinement + semantic cache
- Pluggable embeddings + optional reranking
- Evaluation pipeline with run history and case inspection
- Built-in UI for asking questions and running evals
Tech stack
- FastAPI + SQLAlchemy + Postgres (pgvector)
- Chroma for vector storage
- OpenAI / HuggingFace embeddings
- Optional Cohere reranker
- Dockerized setup
github repo : https://github.com/mahmoudsamy7729/agentic-rag
[link] [comments]




![[P] I trained an AI to play Resident Evil 4 Remake using Behavioral Cloning + LSTM](/_next/image?url=https%3A%2F%2Fexternal-preview.redd.it%2FzgmJOxETuqgqlsgMxeBl7S4gZNDHf_K3U9w883ioT4M.jpeg%3Fwidth%3D320%26crop%3Dsmart%26auto%3Dwebp%26s%3Da63f97b9d03c40b846cd3eaac472e78050020a43&w=3840&q=75)