๐ŸŽ™
RP AIA ยท an area of the work

Dictation

Push-to-talk speech โ†’ text, offline and private. Wispr Flow, replaced.

LIVE
Typing is slow โ€” and the fastest dictation tools are cloud-bound, sending your voice, your half-formed thoughts, and your jargon to someone else's server. Dictation makes speaking to your machine effortless and completely private: push to talk, get clean text in any app, fully on-device โ€” a Wispr Flow you actually own.
A privacy-first voice-to-text tool. Push-to-talk only โ€” no always-on mic. A local on-device speech-to-text engine transcribes, a vocabulary layer corrects your jargon, and the text auto-pastes into any app. It can read text back aloud, fully offline. Every dictation is saved to a personal repository.
๐ŸŒฑ Seed
Push-to-talk speech โ†’ text, fully on-device.
โ† shaped by typing is slow and cloud dictation leaks your words.
๐Ÿ›ค Path
Built a local on-device speech-to-text engine + push-to-talk hotkey + auto-paste into any app.
โ† shaped by privacy-first โ€” no always-on mic, nothing leaves the machine.
๐Ÿ”€ Pivot
From raw transcription to a vocabulary corrector that learns your jargon (Voice AI, Text AI, 4M SAIโ€ฆ) so the words come out right.
โ† shaped by raw transcripts mangle domain terms; fidelity matters more than raw speed.
๐Ÿ’Ž Crystal
STT โ†’ vocabulary correction โ†’ auto-clipboard โ†’ saved to a personal repository, with offline read-aloud. A working Wispr-Flow replacement.
โ† shaped by a complete daily-driver, not a demo.
โญ Principle
Speak naturally, get clean text anywhere, privately โ€” routing correction depth by need.
โ† shaped by voice as the natural, private way in.
  • โœ“STT engine live-verified end-to-end, on-device
  • โœ“Push-to-talk hotkey + mic capture working
  • โœ“Vocabulary corrector (Voice AI, Text AI, 4M SAIโ€ฆ) verified
  • โœ“Auto-clipboard inject + session save working
  • โœ“Read-aloud (offline) implemented and tested
  • โ†’Optional LLM cleanup pass (vocabulary-aware)
  • โ†’Menu-bar app packaging for daily use
  • โ†’Launch-at-login + standalone app distribution
  • โ†’Seed vocabulary + style from past dictation history
โ˜… the moonshot

Ensembled multi-model transcription with on-demand arbitration โ€” routing error-correction depth by need: noise to the ensemble, jargon to vocabulary, long-form to a relay pool.

Imagine this working on your everyday tasks. The deepest how reveals itself when we build it together.

Build with me โ†’ See how it all fits โ€” RARE
โŒ‚Home
๐Ÿ”ŠOm
๐ŸŽ™Ask โœฆVision โ—ทRoadmap