Dwani is a premium local push-to-talk dictation utility for macOS. Hold a key, speak your thoughts, and release. The transcript instantly streams, formatting itself perfectly into whatever application you are writing in.
Select a mockup, choose your HUD overlay skin, and click-and-hold the trigger button to dictate.
Every decision in Dwani centers around providing a zero-friction, native macOS experience.
No wake words, no delays. Hold Right ⌘ (or a custom hotkey), speak instantly, and let go. The transcription inserts exactly at your focused text cursor.
No audio files are ever sent to servers. Everything transcribes locally on your Apple Neural Engine, keeping your dictation confidential.
Seamlessly weave commands. Say "press tab" or "select all" mid-sentence, and Dwani executes native keyboard shortcuts instead of writing text.
Bind spoken shortcuts directly to system workflows. Say "open Claude Code" to run AppleScripts, launch terminal commands, or load URLs.
Train your speech. Map custom strings (e.g. heard words like "type script" or "react js") to print perfectly as "TypeScript" or "React" on output.
Intelligent app-matching rules. Dwani formats casual capitalization for Slack, while preserving lowercase code syntax and tabs in VS Code.
A direct comparison between built-in system dictation and Dwani's developer-focused voice engine.
| Capability | Default macOS Dictation | Dwani (Parakeet/Nemotron) |
|---|---|---|
| Global Trigger | Fn double-tap (slow to start, lacks hold/release mapping) | Push-to-Talk (Hold Right ⌘-zero start latency) |
| Data Processing | Hybrid cloud servers (sends voice files over network) | 100% offline local Neural Engine (private & offline) |
| App Customization | Writes identical text blocks in all applications | App-aware (formats lower-case code vs capitalized chat) |
| Spoken Shortcuts | Writes out words literally ("press enter") | Simulates keystroke actions ("press enter", "tab") |
| Workflow Hooks | None | Execute terminal commands & AppleScripts directly |
| Jargon Training | Manual Keyboard shortcuts list replacement rules | Phonetic custom vocabulary compilation sandbox |
Dwani ships NVIDIA's **Parakeet TDT v3** model natively integrated via FluidAudio, with support for the cutting-edge **Nemotron ASR** engine currently in parallel development. Both operate directly on the Apple Silicon Neural Engine (ANE) to provide unparalleled response times and record-low Word Error Rates (WER).
*Real-Time Factor (RTFx) represents processing throughput: higher is faster. Parakeet TDT v3 and Nemotron ASR complete transcription in microseconds, using almost zero CPU cycles by utilizing the Apple Silicon Neural Engine.
Configure how Dwani reformats shorthand text or technical jargon instantly.
Everything you need to know about setting up and running Dwani on your Mac.
Experience dictation at the speed of thought. 100% offline, private, and highly customizable for developer workflows.