Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
qihoo360 's Collections
RefVTON
RzenEmbed
TinyR1
FG-CLIP 2
360Zhinao3
FG-CLIP
360Zhinao
360Zhinao2
Light-R1
Light-IF

FG-CLIP 2

updated Nov 6

FG-CLIP 2 is the foundation model for fine-grained vision-language understanding in both English and Chinese.

Upvote
5

  • FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

    Paper • 2510.10921 • Published Oct 13 • 10

  • qihoo360/fg-clip2-base

    Zero-Shot Image Classification • 0.4B • Updated Nov 6 • 4.6k • 21

  • qihoo360/fg-clip2-large

    Zero-Shot Image Classification • 0.9B • Updated Oct 20 • 1k • 9

  • qihoo360/fg-clip2-so400m

    Zero-Shot Image Classification • 1B • Updated Oct 20 • 740 • 5

  • qihoo360/LIT-CN

    Updated Oct 20 • 62 • 1

  • qihoo360/BoxClass-CN

    Updated Oct 15 • 34 • 1

  • qihoo360/DOCCI-CN

    Viewer • Updated Oct 20 • 5k • 68 • 1

  • qihoo360/DCI-CN

    Updated Oct 20 • 65

  • Running
    1

    FG CLIP2 Densefeature Demo

    🦀
    1

    Visualize image similarity to labels


  • Running
    2

    FG CLIP2 Retrieval Demo

    💻
    2

    Classify images based on given labels

Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs