
Everything You Need to Dictate Effortlessly
From local AI transcription to intelligent post-processing, Dikt gives you a complete speech-to-text workflow that respects your privacy.
Powerful Transcription Engine
Multiple transcription engines working together to give you the best possible results, whether you're online or offline.
- Local Whisper.cpp with multiple model sizes (tiny to large)
- OpenAI Whisper API for cloud accuracy
- GPT-4o Audio for high-quality cloud transcription
- Real-time streaming preview — words appear as you speak
- Automatic failover between providers
- Retry with exponential backoff
- Multi-language support (100+ languages)
- Model download manager
- Audio dimming — automatically dims system audio during dictation
- GPU acceleration — run local Whisper with CUDA or DirectML for faster transcription (Auto/CPU/CUDA/DirectML)
- Noise profile learning — record 3 seconds of ambient noise; Dikt filters it out of future recordings using spectral subtraction
- Auto-correction suggestions — word-level confidence scores highlight uncertain words before injection so you can review and fix them
AI-Powered Cleanup
Raw transcription is just the beginning. AI post-processing transforms your spoken words into polished, ready-to-use text.
- Grammar and punctuation correction
- Filler word removal ("um", "uh", "like")
- Voice commands ("period", "comma", "new line", "new paragraph")
- Editing voice commands ("scratch that", "delete last word", "select all")
- AI Command Mode — transform selected text with voice instructions (Ctrl+Shift+Space)
- Context-aware formatting adjusts tone by app (formal for email, casual for chat)
- Custom vocabulary for proper nouns, technical terms, and brand names
- Snippets — text shortcuts with {date} and {time} variables
- Profanity filter — automatically removes curse words with built-in + custom word lists
- Swear Jar — fun metrics tracking words cleaned, cost, and daily trends
- Powered by GPT-4o-mini and Claude Haiku
- BYOK: Use your own API keys, pay only for what you use
- Real-time translation — dictate in any language and have the output automatically translated (e.g. speak Spanish, get English text)
- Word correction training — define wrong→correct pairs (e.g. "mispelt" → "misspelled") applied as a post-processing pass before cleanup
- Markdown voice mode — say "heading one", "bullet point", "open code block" and get proper Markdown syntax; auto-enabled in Obsidian, Typora, and VS Code
- Custom AI persona library — choose from 6 built-in personas (Technical Writer, Doctor, Journalist, Lawyer, Code Reviewer, Casual) or create your own
- Multi-turn AI Command Mode — AI Command Mode remembers your last 3 instructions for back-and-forth editing; say "start over" to reset
Works Everywhere You Type
Dikt integrates at the system level so you can dictate into any application without switching windows or copying text.
- Global hotkey (customizable, default Ctrl+Alt+Space)
- Push-to-talk or toggle mode — choose your style
- Push-to-talk grace period — quick-tap toggles recording without holding
- Automatic text injection at cursor position
- Append mode for continuous dictation
- Works with Notepad, Word, Slack, VS Code, Chrome, email, any text field
- Obsidian integration — dictate directly into your vault notes
- Notion integration — push transcriptions to Notion pages via API
- VS Code extension — dictate code comments, docstrings, and commit messages
- Output templates — format with {text}, {date}, {time} placeholders
- Clipboard re-inject queue — when injection fails (no focused window), text is queued; click the overlay to inject into whatever window you focus next
- Code dictation mode — natural language dictation auto-formatted as code when dictating in an IDE (Rider, Visual Studio, Cursor, IntelliJ)
- Wake word / always-on listening — say "Hey Dikt" to start recording hands-free (uses Windows built-in speech recognition)
Privacy-First Design
Your voice data is yours. Dikt is built from the ground up to keep your data private and secure.
- Local-first: transcribe without any internet connection
- DPAPI encryption for API keys at rest
- No data sent to servers unless you choose cloud mode
- BYOK: your keys, your data, your control
- Atomic file writes prevent settings corruption
- Local-only mode disables all network features
Beautiful & Functional
A modern, polished interface that looks great and stays out of your way when you need to focus.
- Modern Fluent Design (WPF-UI)
- Light, dark, and system theme support
- System tray operation — runs quietly in the background
- Dashboard with usage charts and statistics
- Full transcription history with search and playback
- Export in TXT, CSV, JSON, SRT formats
- Daily dictation streak — track your streak and earn milestone badges
- Usage dashboard with words, time saved, provider breakdown, and streak stats
- Cloud sync for vocabulary, snippets, and profiles across devices (Pro)
- Voice profiles — multiple users per machine, each with own settings and history
- TTS preview before inject — hear your transcription read back before it's typed; choose Windows built-in voice or high-quality ElevenLabs neural voice
Accessible to Everyone
Designed to be usable by everyone, with full support for assistive technologies and flexible configuration.
- Full keyboard navigation
- Screen reader compatible
- Setup wizard for first-time configuration
- Notification overlay — configurable type (Minimal/Line/Box), style (Pill/Box/Text), 9 screen positions, multi-monitor display selector, and size scaling
- Compact overlay — persistent floating status window with live waveform indicators (8 styles), Box-mode transcription preview, controls, and history
- Compact overlay features — click-to-open, lock position, auto-hide, multi-monitor support, and size scaling
- Typing animation — letter-by-letter text display in overlays
- Overlay toggle in status bar for quick access
- Customizable notification styles (toast, popup, overlay)
Team Workspaces
Share vocabulary, snippets, and formatting profiles across your team. Everyone stays consistent, no one starts from scratch.
- Shared team vocabulary — proper nouns and jargon available to everyone
- Shared snippets — team-wide text shortcuts with variables
- Shared context profiles — consistent tone across apps
- Per-seat billing — add or remove members anytime
- Team admin controls — owners and admins manage shared settings
- Personal overrides — your personal settings always take priority
- Automatic sync — team settings pull alongside personal cloud sync
- Invite by email — members join with a simple invitation flow
Ready to Get Started?
14-day free trial. No credit card required. All features included.