A comprehensive catalog of AI models for the Hebrew language, covering LLMs, speech-to-text, text-to-speech, NLP, embeddings, translation, and more. This is a community resource maintained on GitHub.
GitHub: danielrosehill/Hebrew-AI-Models
A catalog of AI models for the Hebrew language — LLMs, STT, TTS, NLP, embeddings, translation, and more
Large Language Models (LLMs)
dicta-il (Israel Center for Text Analysis)
DictaLM-3.0-24B — Latest generation Hebrew LLM, 24B parameters with Base, Instruct, and Thinking variants
DictaLM-3.0-Nemotron-12B — Nemotron-based Hebrew LLM with Base and Instruct variants
DictaLM-3.0-1.7B — Compact Hebrew LLM with Base, Instruct, and Thinking variants
DictaLM 2.0 — Previous-gen Hebrew LLM (Mistral-based), 7B with Instruct variant
BEREL 3.0 — Biblical/Rabbinic Hebrew language model
yam-peleg
Hebrew-Gemma-11B-V2 — Hebrew-adapted Gemma 11B with V2, Instruct, and Base variants
Hebrew-Mistral-7B — Hebrew-adapted Mistral 7B, including 200K extended context version
Other Hebrew LLMs
Hebrew_Nemo — Hebrew fine-tune of Mistral Nemo 12B
HebrewGPT (Slasky) — Hebrew-native GPT trained from scratch in 1B and 296M sizes
hebrew-math-tutor-v1 (Intel) — Hebrew math tutoring model, 4B
Llama-3.2-3B-Hebrew-Master — Hebrew fine-tune of Llama 3.2
Speech-to-Text / ASR
ivrit-ai
The leading Hebrew ASR project, community-driven. GitHub: ivrit-ai/ivrit.ai
ivrit.ai codebase
whisper-large-v3-turbo-ct2 — Top Hebrew ASR model, CTranslate2 optimized (22K+ downloads)
whisper-large-v3 — Hebrew fine-tuned Whisper large-v3
pyannote-speaker-diarization-3.1 — Hebrew speaker diarization
imvladikon
wav2vec2-xls-r-300m-hebrew — Most downloaded Hebrew model (144K downloads)
wav2vec2-xls-r-1b-hebrew — Hebrew fine-tuned XLS-R 1B
Text-to-Speech (TTS)
An emerging area for Hebrew with active research:
HebTTS (Technion) — Diacritic-free Hebrew TTS research
slp-rl/HebTTS ★ 109The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"
aihebrewslmsttsisrawave — Hebrew TTS project
thewh1teagle/israwave ★ 40Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet
hebrewisraelpytorchttsZonos-Hebrew — Hebrew variant of Zonos TTS
hebrewTTS-orpheus3b — Hebrew Orpheus TTS 3B
phonikud — Hebrew diacritization/phonemization for TTS pipelines
mms-tts-heb — Hebrew MMS TTS
NLP Models
BERT-based Foundation Models
AlephBERT-base (onlplab/BIU) — Foundational Hebrew BERT (29.6K downloads)
heBERT — Hebrew BERT with NER and sentiment analysis variants
DictaBERT — Comprehensive Hebrew NLP toolkit with NER, sentiment, morphology, parsing, diacritization
HeRo — Hebrew RoBERTa with long-context variant
Embeddings and Translation
neodictabert-bilingual-embed — Bilingual Hebrew-English embeddings
sentence-transformers-alephbert — Hebrew sentence embeddings (4.9K downloads)
opus-mt-en-he (Helsinki-NLP) — English to Hebrew translation
opus-mt-tc-big-he-en (Helsinki-NLP) — Hebrew to English, large model
Key Organizations
dicta-il — Israel Center for Text Analysis. Most comprehensive Hebrew NLP toolkit
ivrit-ai — Leading Hebrew speech recognition, community-driven
onlplab (Bar-Ilan University) — AlephBERT foundational model
slprl (Technion) — Speech Processing Lab, TTS research
imvladikon — wav2vec2 Hebrew (144K downloads), sentence transformers
Helsinki-NLP — OPUS-MT Hebrew translation models
The full catalog includes OCR/vision models, speech foundation models, summarization, curated Hugging Face collections, and more. Visit the GitHub repository for the complete list.