Other
Text Generation
Text To Image
Text Classification
Fill Mask
Text2text Generation
Automatic Speech Recognition
Image Text To Text
Sentence Similarity
Token Classification
Feature Extraction
Translation
Image Classification
Text To Speech
Question Answering
Image To Image
Summarization
Object Detection
Image Segmentation
Image To Text
Text To Video
Audio Classification
Zero Shot Image Classification
Image Feature Extraction
Zero Shot Classification
Audio To Audio
Visual Question Answering
Reinforcement Learning
Image To Video
Text To Audio
Video Classification
Unconditional Image Generation
Time Series Forecasting
Video Text To Text
Any To Any
Audio Text To Text
Depth Estimation
Text To 3d
Image To 3d
Robotics
Mask Generation
Table Question Answering
Voice Activity Detection
Document Question Answering
Keypoint Detection
Zero Shot Object Detection
Tabular Classification
Graph Ml
Tabular Regression
Visual Question Answering
List of Visual Question Answering 23 Open Source LLM models.

deplot

internlm xcomposer2d5 ol 7b

Mini C P M V 2

blip vqa base

matcha chart2text pew

Astro L La V A_v2

Era X V L 7 B V2.0 Preview

vilt b32 finetuned vqa

blip vqa capfilt large

pix2struct infographics vqa large

Ziya B L I P2 14 B Visual v1
Ivy V L llava

Mini C P M V

Vintern 1 B v2

Era X V L 2 B V1.5

Vinqw 1 B v1

git base vqav2

Era X V L 7 B V1.5
Video Refer 7 B stage2.5

pix2struct widget captioning large

pix2struct screen2words base

pix2struct screen2words large
Video Refer 7 B
Newsletter
Get updates on new LLM models, and other cool AI stuff.