Hebrew AI Models: A Comprehensive Catalog
A comprehensive catalog of AI models for the Hebrew language, covering LLMs, speech-to-text, text-to-speech, NLP, embeddings, translation, and more. This is a community resource maintained on GitHub.
GitHub: danielrosehill/Hebrew-AI-Models
A catalog of AI models for the Hebrew language — LLMs, STT, TTS, NLP, embeddings, translation, and more
Large Language Models (LLMs)
dicta-il (Israel Center for Text Analysis)
DictaLM-3.0-24B — Latest generation Hebrew LLM, 24B parameters with Base, Instruct, and Thinking variants
DictaLM-3.0-Nemotron-12B — Nemotron-based Hebrew LLM with Base and Instruct variants
DictaLM-3.0-1.7B — Compact Hebrew LLM with Base, Instruct, and Thinking variants
DictaLM 2.0 — Previous-gen Hebrew LLM (Mistral-based), 7B with Instruct variant
BEREL 3.0 — Biblical/Rabbinic Hebrew language model
yam-peleg
Hebrew-Gemma-11B-V2 — Hebrew-adapted Gemma 11B with V2, Instruct, and Base variants
Hebrew-Mistral-7B — Hebrew-adapted Mistral 7B, including 200K extended context version
Other Hebrew LLMs
Hebrew_Nemo — Hebrew fine-tune of Mistral Nemo 12B
HebrewGPT (Slasky) — Hebrew-native GPT trained from scratch in 1B and 296M sizes
hebrew-math-tutor-v1 (Intel) — Hebrew math tutoring model, 4B
Llama-3.2-3B-Hebrew-Master — Hebrew fine-tune of Llama 3.2
Speech-to-Text / ASR
ivrit-ai
The leading Hebrew ASR project, community-driven. GitHub: ivrit-ai/ivrit.ai
ivrit.ai codebase
whisper-large-v3-turbo-ct2 — Top Hebrew ASR model, CTranslate2 optimized (22K+ downloads)
whisper-large-v3 — Hebrew fine-tuned Whisper large-v3
pyannote-speaker-diarization-3.1 — Hebrew speaker diarization
imvladikon
wav2vec2-xls-r-300m-hebrew — Most downloaded Hebrew model (144K downloads)
wav2vec2-xls-r-1b-hebrew — Hebrew fine-tuned XLS-R 1B
Text-to-Speech (TTS)
An emerging area for Hebrew with active research:
HebTTS (Technion) — Diacritic-free Hebrew TTS research
slp-rl/HebTTS ★ 108The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"
aihebrewslmsttsisrawave — Hebrew TTS project
thewh1teagle/israwave ★ 39Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet
hebrewisraelpytorchttsZonos-Hebrew — Hebrew variant of Zonos TTS
hebrewTTS-orpheus3b — Hebrew Orpheus TTS 3B
phonikud — Hebrew diacritization/phonemization for TTS pipelines
mms-tts-heb — Hebrew MMS TTS
NLP Models
BERT-based Foundation Models
AlephBERT-base (onlplab/BIU) — Foundational Hebrew BERT (29.6K downloads)
heBERT — Hebrew BERT with NER and sentiment analysis variants
DictaBERT — Comprehensive Hebrew NLP toolkit with NER, sentiment, morphology, parsing, diacritization
HeRo — Hebrew RoBERTa with long-context variant
Embeddings and Translation
neodictabert-bilingual-embed — Bilingual Hebrew-English embeddings
sentence-transformers-alephbert — Hebrew sentence embeddings (4.9K downloads)
opus-mt-en-he (Helsinki-NLP) — English to Hebrew translation
opus-mt-tc-big-he-en (Helsinki-NLP) — Hebrew to English, large model
Key Organizations
dicta-il — Israel Center for Text Analysis. Most comprehensive Hebrew NLP toolkit
ivrit-ai — Leading Hebrew speech recognition, community-driven
onlplab (Bar-Ilan University) — AlephBERT foundational model
slprl (Technion) — Speech Processing Lab, TTS research
imvladikon — wav2vec2 Hebrew (144K downloads), sentence transformers
Helsinki-NLP — OPUS-MT Hebrew translation models
The full catalog includes OCR/vision models, speech foundation models, summarization, curated Hugging Face collections, and more. Visit the GitHub repository for the complete list.
Daniel Rosehill
AI developer and technologist specializing in AI systems, workflow orchestration, and automation. Specific interests include agentic AI, workflows, MCP, STT and ASR, and multimodal AI.