| been building this on and off for a while and finally got it to a point where i'm not embarrassed to share it, so here goes. it's called Shadow AI. basically a voice-first AI companion that runs on your own windows machine. you just talk to it and it talks back, no typing required. i mostly made it because i was tired of opening a chat window and pecking at a keyboard every time i wanted to ask something or get a quick task done. stuff it can do: - actual voice conversations, not the awkward push-to-talk kind. you just talk. - you can talk to it in literally any language. switch mid-sentence if you want, it keeps up. this honestly turned out way better than i expected. - web search built in. it runs its own local SearXNG instance so you're not piping your searches through some third party, and no docker setup or any of that nonsense, it just works after install. - it remembers stuff. it builds up a memory of things you've told it across sessions so you're not reintroducing yourself every time. - reminders / scheduled tasks. tell it to remind you about something later and it actually does. - optional google stuff if you want it. gmail, calendar, drive, docs, sheets, contacts. it's completely optional and the app works fine without ever touching it. there's a little in-app guide that walks you through connecting your own google account so nothing goes through me or any server. - it can learn little skills over time based on how you use it. a few honest notes: it's windows only. that's not a "for now" thing, it's just what i built it for and where i use it, so don't expect a mac/linux build. it's bring-your-own-key. you paste in your own free gemini api key during setup. costs me nothing and costs you nothing (the free tier is plenty for normal use), and your key stays local on your machine. it's open source under AGPL-3.0 and the whole thing runs locally on 127.0.0.1, nothing phones home. if you want to read the code or check what it's doing, it's all there. closest comparison i can give is "think OpenClaw but voice-only," just so you have a mental picture. for the record this is an independent project, not affiliated with or endorsed by OpenClaw in any way, "OpenClaw" is the trademark of its respective owner. i'm just using it as a reference point. it's a solo project so issues are welcome but i can't promise i'll get to all of them, and i'm not really taking pull requests, i'd rather keep it built the way i want it. just being upfront about that. happy to answer questions. would genuinely love to hear what people think. https://github.com/shadowdoggie/shadow-ai EDIT: I am sorry, i messed up big time, i realize that now. My post title says "local", and as of right now there is no local model available yet, and i am truly sorry for that, and you folks rightfully downvoted my comments into oblivion. I promise i will make it up to you folks, and i already put it on the roadmap on the github repo, and even if i violated any rules for posting and the post gets removed, i will still make sure the application WILL get local model options for both voice aswell as the subagents feature. Again my sincere apologies. [link] [comments] |
Windows向け:どの言語でも話せるローカル音声AIを作った(オープンソース、キーは自分で用意)
Reddit r/LocalLLaMA / 2026/5/30
💬 オピニオンDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
要点
- Shadow AIは、Windows上でローカルに動作するオープンソースの音声中心AIコンパニオンで、ユーザーは話すだけで音声で返答を受けられ、入力なしで利用できます。
- プッシュ・トゥ・トークではない実際の音声会話に対応し、事実上あらゆる言語を扱えて、文中の途中で言語を切り替えることも可能です。
- ローカルSearXNGによる内蔵ウェブ検索や、セッションをまたいだ記憶機能により、毎回の自己紹介や繰り返しの手間を減らせます。
- 音声コマンドでリマインダーや予約(スケジュール)タスクを設定でき、オプションとしてGoogleサービス連携も用意されています。
- BYO-key方式で、セットアップ時にユーザー自身のGemini APIキー(無料枠)を貼り付け、127.0.0.1上で完結して外部へ送信せず、AGPL-3.0で公開されています。




