I built a solo AI platform from Bahrain with no funding, no team and no ad spend - here's what's inside it after 4 months

Reddit r/artificial / 4/28/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical UsageIndustry & Market MovesModels & Research

共有:

Key Points

Bahrain在住の39歳の個人開発者が、資金・チーム・広告費なしで4か月かけて多モデルAIプラットフォーム「AskSary」を立ち上げた。
AskSaryの中核は、モデルごとに失われがちな会話コンテキストをまたいで保持する永続メモリ層で、端末やモデルを切り替えても内容を引き継ぐ設計になっている。
GPT/Claude/Grok/Gemini/DeepSeekなど主要モデルを一つの画面に統合し、自動ルーティングと手動切替を用意している。
Google DriveやNotion連携、RAG向けナレッジベース、最大500MBまでのアップロード、YouTube URL/ファイルの動画分析、画像・音楽・動画生成、音声の双方向会話など多彩な機能を提供している。
提供状況として129か国で利用され、日々40件の新規登録があり、4週間ほどで1080件のサインアップに到達したと述べている。

I built a solo AI platform from Bahrain with no funding, no team and no ad spend - here's what's inside it after 4 months

https://reddit.com/link/1sxotqx/video/xlaqd9i8guxg1/player

I'm a self-taught developer, 39 years old, based in Bahrain. Four months ago I started building AskSary - a multi-model AI platform with a persistent memory layer that sits above all the models.

The core idea: the model is not the identity. Most AI tools lose your context the moment you switch models. I built the layer that remembers you across all of them.

Here's what's shipped so far:

Models & Routing Every major model in one place - GPT-5.2, Claude Sonnet 4.6, Grok 4, Gemini 3.1 Pro, DeepSeek R1, O1 Reasoning, Gemini Ultra and more - with smart auto-routing or manual override.

Memory & Context Persistent cross-model memory. Start with Claude on your phone, switch to GPT on your laptop - it already knows what you discussed. Proactive personalisation that messages you first on login before you've typed a word.

Integrations Google Drive and Notion - connect once, pull files and pages directly into chat or your RAG Knowledge Base. Unlimited uploads up to 500MB per file via OpenAI Vector Store.

Video Analysis - Gemini native video understanding for YouTube URL analysis (no download required, processed natively) and direct file upload up to 500MB. Full breakdown of visuals, audio, dialogue, editing style and key moments.

Generation Image generation and editing, video studio across Luma, Veo and Kling, music generation via ElevenLabs, video analysis via upload or YouTube URL.

Builder Tools Vision to Code, Web Architect, Game Engine, Code Lab with SQL Architect, Bug Buster, Git Guru and more. Tavily web search across all models.

Voice & Audio Real-time 2-way voice chat at near-zero latency, AI podcast mode downloadable as MP3, Voiceover, Voice Notes, Voice Tuner.

Platform Custom agents, 30+ live interactive themes, smart search, media gallery, folder organisation, full RTL support across 26 languages, iOS and Android apps, Apple Vision Pro.

Where it is now 129 countries. Currently at 40 new signups a day. 1080 Signup's so far after 4 weeks or so. MRR just started. Zero ad spend. All of it built solo, one feature at a time, on a balcony in Bahrain.

The Stack: Frontend - Next.js, Capacitor (iOS and Android) and Vanilla JS / React

Backend - Vercel serverless functions, Firebase / Firestore (database + auth) and Firebase Admin SDK

AI Models - OpenAI (GPT, GPT-Image-1), Anthropic (Claude), Google (Gemini), xAI (Grok), DeepSeek

Generation APIs - Luma AI (video), Kling via Replicate (video), Veo via Replicate (video), ElevenLabs (music), Flux via Replicate (image editing), Meshy (3D — coming soon)

Integrations - Google Drive (OAuth 2.0), Notion (OAuth 2.0), Tavily (web search), OpenAI Vector Store (RAG), Stripe (payments), CloudConvert (document conversion), Sentry (error tracking), Formidable (file handling)

Rendering - Mermaid (flow charts) and MathJax

Platforms - Web, iOS, Android, Apple Vision Pro (visionOS)

Languages - 26 UI languages with full RTL support

asksary.com

Happy to answer questions on any part of the build - stack, architecture, API cost management, anything.

submitted by /u/Beneficial-Cow-7408
[link] [comments]