How Dikt Works

From voice to polished text in seconds. A simple four-step pipeline that runs locally or in the cloud.

The Pipeline

1

Record

Press your global hotkey — or say "Hey Dikt" with wake word listening — and start talking. NAudio captures high-quality audio from your microphone in the background, in any application. Noise profile learning filters out ambient sound, and system audio is optionally dimmed during recording.

2

Transcribe

Your audio is converted to text using Whisper.cpp locally on your machine (with optional GPU acceleration via CUDA or DirectML), via OpenAI's Whisper API, or GPT-4o Audio for maximum accuracy.

3

AI Cleanup

Optional AI post-processing with Claude or GPT fixes grammar, punctuation, and filler words. Choose an AI persona for your writing style, apply word correction rules, translate to another language, or enable Markdown voice mode for structured documents. A profanity filter removes curse words automatically.

4

Inject

The polished text is automatically injected at your cursor position in whatever application you're using. Optionally hear it read back with TTS preview before injection. If injection fails, the clipboard re-inject queue saves it for one-click retry.

Local vs Cloud Transcription

Choose the approach that fits your needs. Use both with automatic failover.

Local (Whisper.cpp)Cloud (OpenAI API)GPT-4o Audio
CostFreePay-per-usePay-per-use
Internet RequiredNoYesYes
PrivacyAudio stays on deviceAudio sent to OpenAIAudio sent to OpenAI
AccuracyGood (varies by model)ExcellentBest
SpeedDepends on hardwareFast (server-side)Fast (server-side)
EngineWhisper.cpp modelsOpenAI Whisper APIGPT-4o multimodal
GPU AccelerationCUDA / DirectMLN/A (server-side)N/A (server-side)

Privacy & Security

Your voice data is yours. Dikt is designed from the ground up to keep your information private and secure.

  • DPAPI encryption for all API keys stored on disk
  • No server-side storage of your transcriptions or audio
  • Local-only mode disables all network features entirely
  • Anonymous telemetry is opt-out — disable anytime in Settings
  • Atomic file writes prevent settings corruption

Whisper Model Comparison

Choose the model that balances speed, accuracy, and disk space for your needs.

ModelSizeSpeedAccuracyBest For
tiny~75 MBFastestBasicQuick drafts, low-resource machines
base~142 MBFastGoodGeneral use with decent hardware
small~466 MBModerateVery GoodBalanced speed and accuracy
medium~1.5 GBSlowerExcellentHigh-accuracy offline transcription
large-v3-turbo~1.5 GBModerateBestBest accuracy-to-speed ratio, multi-language
large-v3~2.9 GBSlowestBestMaximum accuracy, multi-language

Ready to Try Dikt?

14-day free trial. No credit card required. All features included.

Stay in the loop

Get product updates, tips, and news delivered to your inbox.