On-device AI transcription. No cloud required.
Run whisper.cpp on Android with one Gradle line. 99 languages, on-device — your audio never leaves the phone. No NDK setup, no source build, no API key. Start free, go Pro for real-time streaming.
// on-device AI · arm64-v8a · API 24+ · Android 15 ready · 16 KB aligned
Your audio stays on the phone.
Cloud APIs send your audio to a server. Whisper Android runs the AI model locally — nothing uploaded, nothing stored, nothing billed per minute. Private by design, works offline, zero latency to a server.
Transcribe files for free. Stream in real time with Pro.
The free tier gives you full whisper.cpp transcription in one Gradle line. Pro adds real-time streaming from the microphone with AI-powered voice detection (Silero VAD), so you can build dictation, live subtitles and voice assistants.
- Full whisper.cpp transcription (file → text)
- 99 languages · auto-detect · translate to English
- Segments with timestamps · processing time
- Clean Kotlin coroutine API
- arm64-v8a · CPU NEON optimized
- Everything in Free
- Real-time streaming (mic → text, VAD-segmented)
- Silero VAD bundled — AI voice detection, skip silence
- Dual AI pipeline: Silero detects speech, Whisper transcribes
- arm64-v8a + x86_64 (emulators, Chromebooks)
- Quantized model support (q4_0, q5_1, q8_0)
Downloaded separately — pick what fits your app.
Models are not bundled in the AAR (too large). Download the one that matches your speed/quality tradeoff and ship it with your app or download it at first launch. All models support 99 languages.
No NDK. No source build. Just add the dependency.
The prebuilt AAR includes the compiled libwhisper.so and the Kotlin API. Add one line to your build file, download a model, transcribe.
Decode any audio → transcribe with Whisper.
Whisper needs 16 kHz mono PCM. Pair it with FFmpegKit to decode MP4, MKV, AAC, OGG or any format into a WAV that Whisper can transcribe — a full media-to-text pipeline on device.
