Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
merve
's Collections
Fun Spaces ๐คนโโ๏ธ
LLM Playgrounds ๐
Computer Vision Backbones ๐งฉ
Image Classification Models ๐ถ ๐ฑ
Object Detection Models ๐ฅฅ
Image Segmentation Models ๐
Zero-shot Image Classification Models ๐ผ๏ธ
Image-to-Image Models ๐จ
Video Classification Models ๐บ
Image-to-Text Models ๐
Text-to-Image Models ๐ฅ
Foundation Models for Vision ๐งฉ
Segment Anything Model
OWL-series ๐ฆ
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Video Language Models
updated
Aug 1
A collection of video-language models
Upvote
1
Running
on
Zero
16
๐จ
Video Llava
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
โข
Updated
Aug 16
โข
428k
โข
35
llava-hf/LLaVA-NeXT-Video-7B-DPO-hf
Video-Text-to-Text
โข
Updated
Aug 16
โข
14.6k
โข
5
llava-hf/LLaVA-NeXT-Video-7B-32K-hf
Image-Text-to-Text
โข
Updated
Aug 16
โข
6.22k
โข
6
Running
on
Zero
29
๐
Llava Interleave
Upvote
1
Share collection
View history
Collection guide
Browse collections