Kokoro-82M
Kokoro-82M is a tiny 82M-parameter text-to-speech AI that delivers lifelike, natural-sounding speech faster than cloud APIs, running locally on everyday hardware with no GPU needed. Customize with 11+ voices, speed controls from 0.1x to 5x, and seamless handling of long text for voiceovers, apps, and real-time interactions.
Optimized for clear, natural speech synthesis.
Get Started
Text to Speech
Turn written words into audio.
Paste your script, select a preset voice, and generate high-quality spoken audio instantly.
Browse Voice Library
Find the perfect sound.
Listen to samples of all available voices to find the right tone for your project before you generate.
Why use Kokoro-82M?
High-Quality Speech Synthesis
Generates natural-sounding speech using StyleTTS2 architecture with only 82M parameters, outperforming larger models on benchmarks
Efficient Long-Text Processing
Automatically splits and processes text of any length for seamless handling of long-form content
Voice and Speed Control
Offers 11 distinct voices (American/British English) and adjustable speed from 0.1x to 5x
Try These with Kokoro-82M
"Once upon a time in a misty forest, a brave little fox named Finn discovered a hidden glowing cave filled with ancient treasures. As he ventured deeper, whispers of forgotten legends echoed around him, guiding him to the heart of the mountain where destiny awaited."
Highlights a whimsical adventure tale to showcase narrative flow and emotional buildup.
"Hey there, have you tried the new Kokoro TTS yet? It's super lightweight, runs on almost any hardware, and sounds incredibly natural—like chatting with a friend over coffee."
Demonstrates casual dialogue with enthusiasm to test everyday conversational intonation.
"Kokoro-82M is an 82-million-parameter text-to-speech model that generates high-quality English speech using under 2 GB of VRAM, supporting voices like af_heart and bm_george for real-time applications."
Explains key technical features clearly to evaluate informative and precise delivery.
"In the dead of night, thunder crashed as the hero faced the towering shadow beast, his heart pounding with fury. With a final roar, he struck true, shattering the darkness forever!"
Emphasizes intense action and emotion to highlight dramatic pacing and vocal intensity.
Sample scripts — click any card to copy
How to generate
Go to Tool
Navigate to the "Text to Speech" page.
Select Model
Choose Kokoro-82M and pick a Voice.
Enter Text
Type or paste your script to be spoken.
Generate
Click generate and download your MP3 instantly.
Compare Voice Models
Unsure which voice sounds best? Test Kokoro-82M against others in our Speech Playground.
Open Speech PlaygroundMade with ❤ by AI4Chat