Other
Text Generation
Text To Image
Text Classification
Fill Mask
Text2text Generation
Automatic Speech Recognition
Image Text To Text
Sentence Similarity
Token Classification
Feature Extraction
Translation
Image Classification
Text To Speech
Question Answering
Image To Image
Summarization
Object Detection
Image Segmentation
Image To Text
Text To Video
Audio Classification
Zero Shot Image Classification
Image Feature Extraction
Zero Shot Classification
Audio To Audio
Visual Question Answering
Reinforcement Learning
Image To Video
Text To Audio
Video Classification
Unconditional Image Generation
Time Series Forecasting
Video Text To Text
Any To Any
Audio Text To Text
Depth Estimation
Text To 3d
Image To 3d
Robotics
Mask Generation
Table Question Answering
Voice Activity Detection
Document Question Answering
Keypoint Detection
Zero Shot Object Detection
Tabular Classification
Graph Ml
Tabular Regression
Image To Text
List of Image To Text 44 Open Source LLM models.

blip image captioning base

trocr large printed

blip image captioning large
vit gpt2 image captioning

O C R Donut C O R D

trocr base handwritten

pix2text mfr
Qwen2 V L 7 B Captioner Relaxed
musk
Florence 2 base gemini 2.0 flash thinking exp 1219 v0.2

Image Guard
manga ocr base

trocr base printed

trocr large handwritten
donut base finetuned cord v2
donut base

git base coco

git large coco

pix2struct textcaps base
ko trocr

recognize_anything_model
invoice parser

mblip mt0 xl
uae license detection

trocr large spanish

llava llama 3 8b v1_1 gguf

thai trocr

ocr for captcha

trocr small handwritten

trocr small printed
manga ocr
donut base finetuned rvlcdip

iam_handwriting_ocr

git base

blip2 flan t5 xl coco

donut base finetuned invoices

pix2struct base

Tr O C R

nougat base

trocr base stage1

trocr large stage1

doctr dummy torch crnn mobilenet v3 small

trocr large str

kosmos 2 patch14 224
Newsletter
Get updates on new LLM models, and other cool AI stuff.