Other
Text Generation
Text To Image
Text Classification
Fill Mask
Text2text Generation
Automatic Speech Recognition
Image Text To Text
Sentence Similarity
Token Classification
Feature Extraction
Translation
Image Classification
Text To Speech
Question Answering
Image To Image
Summarization
Object Detection
Image Segmentation
Image To Text
Text To Video
Audio Classification
Zero Shot Image Classification
Image Feature Extraction
Zero Shot Classification
Audio To Audio
Visual Question Answering
Reinforcement Learning
Image To Video
Text To Audio
Video Classification
Unconditional Image Generation
Time Series Forecasting
Video Text To Text
Any To Any
Audio Text To Text
Depth Estimation
Text To 3d
Image To 3d
Robotics
Mask Generation
Table Question Answering
Voice Activity Detection
Document Question Answering
Keypoint Detection
Zero Shot Object Detection
Tabular Classification
Graph Ml
Tabular Regression
Automatic Speech Recognition
List of Automatic Speech Recognition 194 Open Source LLM models.

whisper large v3 turbo

whisper large v3

seamless m4t v2 large

speaker diarization 3.1

whisper.cpp

speaker diarization

whisper small

Crisper Whisper

moonshine tiny

anime whisper

canary 1b

whisper medium

whisper small cantonese

faster whisper large v3

parakeet rnnt 1.1b

moonshine base

overlapped speech detection

mms 300m 1130 forced aligner

wav2vec2 large xlsr 53 english

wav2vec2 large xlsr persian v2

voice activity detection

whisper base

whisper large
Paraformer large

mms 1b all

seamless m4t large

distil whisper large v3 es

whisperkit coreml
whisper large v3 russian

moonshine

faster whisper large v3 turbo ct2

kotoba whisper v2.2
whisper large v3 turbo swissgerman

moonshine base O N N X

kb whisper large beta

xls r 2b nl v2_lm 5gram os2_hunspell

fonxlsr

hubert large ls960 ft

s2t small librispeech asr

wav2vec2 base 960h

wav2vec2 large 960h lv60 self

wav2vec2 large xlsr 53 japanese

wav2vec2 large xlsr 53 persian

wav2vec2 large xlsr 53 polish

wav2vec2 large xlsr 53 portuguese

wav2vec2 xls r 1b russian
wav2vec2 large xlsr korean

wav2vec2 large xlsr persian v3

wav2vec2 large xlsr turkish

dvoice amharic

asr wav2vec2 dvoice darija

Sharif wav2vec2

stt_zh_conformer_transducer_large
wav2vec2 large chinese zh cn

whisper tiny

whisper base.en

stt_be_conformer_ctc_large

stt_be_conformer_transducer_large

wav2vec2 xls r 300m en atc atcosim

wav2vec2 xls r 300m en atc uwb atcc and atcosim

wav2vec2 large 960h lv60 self en atc uwb atcc

whisper large zh cv11
whisper large v2 cv11 german

whisper medium da

whisper small luxembourgish

asr whisper large v2 commonvoice fa

whisper large v2 Ko
whisper_tflite
wav2vec2 large xlsr moroccan darija

whisper base

whisper large v2

whisper medium

whisper large
Bangla A S R
faster whisper large v2 japanese 5k steps
faster whisper large v2 Ko

wav2vec2 large xlsr 53 english

vosk model small ru

sherpa onnx pruned transducer stateless7 streaming id

faster whisper tiny
M E Ra Li O N Audio L L M Whisper S E A L I O N

wav2vec2 large xlsr 53 th

wav2vec2 large xlsr 53 vietnamese

sinai voice ar stt
wav2vec2 large xlsr cantonese

hubert xlarge ls960 ft

wav2vec2 base 100k voxpopuli

wav2vec2 base 10k voxpopuli ft sk

wav2vec2 large 960h

wav2vec2 large robust ft swbd 300h

wav2vec2 lv 60 espeak cv ft

wav2vec2 xlsr 53 espeak cv ft
wav2vec2 large xlsr indonesian

wav2vec2 large xlsr 53 arabic

wav2vec2 large xlsr 53 chinese zh cn

wav2vec2 large xlsr 53 french

wav2vec2 large xlsr 53 russian

wav2vec2 large xlsr 53 spanish

wav2vec2 urdu

unispeech sat base 100h libri ft
wav2vec2 xls r 300m cv7 turkish
wav2vec2 xls r 300m cv8 turkish

asr transformer aishell

asr wav2vec2 commonvoice en

wav2vec2 xlsr multilingual 56
wav2vec2 xls r 300m phoneme

stt_en_conformer_transducer_xlarge

hubert large arabic egyptian

indicwav2vec hindi

whisper tiny.en

whisper small.en

stt_ru_conformer_transducer_large

whisper base european

whisper small hi

wav2vec2 base vi vlsp2020
pop2piano

lyric alignment
asr wav2vec2 ctc french

whisper small pt
whisper_italian

whisper large czech cv11

whisper large v2

wav2vec2 mbart50 ru

whisper medium french
whisper medium cv tr

stt_eo_conformer_transducer_large

whisper base ar quran
whisper medium pashto

whisper large v2 cantonese
whisper medium ko zeroth
whisper S V

whisper medium portuguese
whisper tiny zh
whisper large v2 ru

whisper large v2 mix jp

whisper kannada tiny

whisper small sindhi

whisper tamil medium

whisper medium turkish 2
whisper medium french

Whisper Small Kinyarwanda

whisper hindi large v2

whisper large v2 japanese 5k steps

Whisper Hindi2 Hinglish Prime

Belle whisper large v3 turbo zh

Whisper Hindi2 Hinglish Swift

whisper largev2 cantonese peft lora

faster whisper large v2

parakeet tdt_ctc 110m

whisper large icelandic 10k steps 1000h

Vi Whisper medium

distil large v3

wav2vec2 large xlsr bahasa indonesia

hindi_model_with_lm_vakyansh

Quran_speech_recognizer

wav2vec2 large xlsr 53 arabic egyptian
wav2vec2 large xlsr kazakh

wav2vec2 large xlsr 53 ukrainian

wav2vec2 large xls r 300m ha cv8

wav2vec2 large xlsr basque

wav2vec2 xls r 300m cs 250

wav2vec2 large xlsr 53 arabic

Hoon_ Chung_jsut_asr_train_asr_conformer8_raw_char_sp_valid.acc.ave

brianyan918_iwslt22_dialect_train_asr_conformer_ctc0.3_lr2e 3_warmup15k_newspecaug

data2vec audio base 960h

s2t medium mustc multilingual st

wav2vec2 base 10k voxpopuli ft es

wav2vec2 large robust ft libri 960h

wav2vec2 large xlsr 53 portuguese

wav2vec2 large xlsr 53 spanish

wav2vec2 xls r 2b 21 to en

romanian wav2vec2
espnet2_librispeech_100_conformer_word

wav2vec2 large xlsr 53 dutch

wav2vec2 large xlsr open brazilian portuguese v2
xls r uyghur cv7

wav2vec2 base 100k gtzan music genres

wav2vec2 xlsr greek speech emotion recognition

xlsr sg lm

wav2vec2 base vietnamese 250h

wav2vec2 large xlsr turkish demo colab

speaker segmentation

wav2vec2 xlsr 1b finnish lm v2

farsi_commonvoice_blstm
viwav2vec2 base 3k

dvoice kabyle

asr wav2vec2 dvoice wolof

wav2vec2 bloom speech bam

wav2vec2 bloom speech snk

stt_uk_citrinet_1024_gamma_0_25

whisper medium.en
wav2vec2 large english T I M I T phoneme_v3

whisper large v3 turbo german

stt_en_fastconformer_tdt_large
Newsletter
Get updates on new LLM models, and other cool AI stuff.