High Fidelity Audio

Kokoro-82M

Kokoro-82M is a tiny 82M-parameter text-to-speech AI that delivers lifelike, natural-sounding speech faster than cloud APIs, running locally on everyday hardware with no GPU needed. Customize with 11+ voices, speed controls from 0.1x to 5x, and seamless handling of long text for voiceovers, apps, and real-time interactions.

10+ Languages

67 Voices

Ultra-Low Latency

Optimized for clear, natural speech synthesis.

Get Started

Text to Speech

Turn written words into audio.
Paste your script, select a preset voice, and generate high-quality spoken audio instantly.

Generate Audio

Browse Voice Library

Find the perfect sound.
Listen to samples of all available voices to find the right tone for your project before you generate.

Audition Voices

Why use Kokoro-82M?

High-Quality Speech Synthesis

Generates natural-sounding speech using StyleTTS2 architecture with only 82M parameters, outperforming larger models on benchmarks

Efficient Long-Text Processing

Automatically splits and processes text of any length for seamless handling of long-form content

Voice and Speed Control

Offers 11 distinct voices (American/British English) and adjustable speed from 0.1x to 5x

Try These with Kokoro-82M

Storytelling copy

"Once upon a time in a misty forest, a brave little fox named Finn discovered a hidden glowing cave filled with ancient treasures. As he ventured deeper, whispers of forgotten legends echoed around him, guiding him to the heart of the mountain where destiny awaited."

Highlights a whimsical adventure tale to showcase narrative flow and emotional buildup.

Conversational copy

"Hey there, have you tried the new Kokoro TTS yet? It's super lightweight, runs on almost any hardware, and sounds incredibly natural—like chatting with a friend over coffee."

Demonstrates casual dialogue with enthusiasm to test everyday conversational intonation.

Educational copy

"Kokoro-82M is an 82-million-parameter text-to-speech model that generates high-quality English speech using under 2 GB of VRAM, supporting voices like af_heart and bm_george for real-time applications."

Explains key technical features clearly to evaluate informative and precise delivery.

Dramatic copy

"In the dead of night, thunder crashed as the hero faced the towering shadow beast, his heart pounding with fury. With a final roar, he struck true, shattering the darkness forever!"

Emphasizes intense action and emotion to highlight dramatic pacing and vocal intensity.

Sample scripts — click any card to copy

How to generate

Go to Tool

Navigate to the "Text to Speech" page.

Select Model

Choose Kokoro-82M and pick a Voice.

Enter Text

Type or paste your script to be spoken.

Generate

Click generate and download your MP3 instantly.

Made with ❤ by AI4Chat

Try AI4Chat for $1!

Upgrade to Premium

Credits Exhausted