Membership lesson · Build Your Jarvis · Module 10 — Voice

Jarvis listens — push-to-talk, locally

The ears get built. core/voice/ arrives with the local speech-to-text engine — the member's language pick, English-fast or multilingual, with one honest kettle-moment of model downloading after which nothing spoken ever leaves the machine — and push-to-talk on the member's chosen key: the mic captures while held and not otherwise, structurally, with a visible indicator whenever the app can hear. Transcripts land in the input box for the glance-then-send habit, because mishears get caught where they're cheapest — before they're asks. The auto-send toggle exists, default off, framed as a loosening to earn rather than a convenience to assume. Proof is the privacy receipt the whole local-first decision was for: the aeroplane-mode test — wifi off, still listening, still transcribing — plus a deliberate mumble caught at review. Read-side first, as every module has taught: the ears work and earn trust before the mouth says a word.

This lesson ships with the paid path. Checkout isn't live yet — start with the free projects (Ground Zero + Off the Grid), or join the waitlist.

Warning

Real power. Educational use only.

What we teach you to build is genuinely powerful — uncensored assistants, agents, and automations on your own hardware. In the wrong hands, that is as dangerous as malicious code in the wrong hands. We do not teach illegal, malicious, or harmful use. You are responsible for what you deploy.

See what we mean →