Hebrew TTS Snapshot

A snapshot of Hebrew text-to-speech capabilities as of 22 March 2025, comparing voice quality across multiple TTS providers — including voice cloning experiments via Replicate.

Last updated: 06/04/2026

Hebrew TTS Snapshot

A snapshot of Hebrew text-to-speech capabilities as of 22 March 2025, comparing voice quality across multiple TTS providers — including voice cloning experiments via Replicate.

Key Findings

Voice Cloning Method

Voice clones for MiniMax and Chatterbox were generated using Replicate:

  1. ~1 minute of English source audio per voice (see voice-sources/english/)

  2. Voice clone IDs created on Replicate from those samples

  3. Hebrew text generated using the cloned voice IDs with Hebrew language parameter

  4. Hebrew reference audio also tested (see voice-sources/hebrew/)

MiniMax used the T2A v2.6 Turbo model with voice clones and Hebrew boost enabled.

Repository Structure

├── samples/                    # Generated TTS audio output
│   ├── chatterbox/             # Chatterbox multilingual voice cloning (via Replicate)
│   │   ├── run1/               # English reference audio, unvowelised input
│   │   └── run2/               # Hebrew reference audio
│   ├── edge-tts/               # Microsoft Edge TTS
│   │   ├── avri-100pc/         # Avri voice, normal speed
│   │   ├── avri-70pc/          # Avri voice, 70% speed
│   │   ├── hila-100pc/         # Hila voice, normal speed
│   │   └── hila-70pc/          # Hila voice, 70% speed
│   ├── elevenlabs/             # ElevenLabs v3 model (Rachel, Adam, Bella)
│   ├── gemini/                 # Google Gemini 2.5 Flash Preview TTS (Puck, Zephyr)
│   ├── minimax/                # MiniMax T2A v2.6 Turbo voice clones (Corn, Herman)
│   └── resemble/               # Resemble AI
│       ├── stock/              # Avigail (Hebrew preset voice)
│       └── voice-clone/        # Chatterbox multilingual clone (Herman)
├── voice-sources/              # Input audio used for voice cloning
│   ├── english/                # ~1 min English samples (corn, daniel, herman)
│   └── hebrew/                 # Hebrew reference samples
│       ├── corn/
│       ├── daniel/
│       └── herman/
├── texts/                      # Hebrew text prompts
│   ├── source/                 # Texts used for TTS generation (+ PDF versions)
│   └── target/                 # Additional test texts (cooking, music, weather)
└── resources.md                # Links to related tools and services

Provider Details

MiniMax (Best Results)

Edge TTS

ElevenLabs

Google Gemini

Chatterbox (via Resemble/Replicate)

Resemble AI

Resources