Li's picture

Li

Jia-ao

·

grayJiaaoLi

AI & ML interests

None yet

Organizations

upvoted a paper 4 months ago

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Paper • 2601.20833 • Published Jan 28 • 183

upvoted a paper 5 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 154

upvoted an article 6 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 313

upvoted a collection 8 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 720

upvoted a collection 12 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.79k

upvoted a paper about 1 year ago

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

Paper • 2311.09122 • Published Nov 15, 2023 • 8

upvoted an article about 1 year ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

EuroBERT

•

Mar 10, 2025

• 147

upvoted a collection over 1 year ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 308

upvoted an article over 1 year ago

Article

Introduction to ggml

+1

ngxson, ggerganov, slaren

•

Aug 13, 2024

• 286

upvoted 2 articles almost 2 years ago

Article

XetHub is joining Hugging Face!

yuchenglow, julien-c

•

Aug 8, 2024

• 117

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 455

upvoted a collection almost 2 years ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 251

upvoted 2 articles almost 2 years ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

andito, HugoLaurencon

•

Jul 18, 2024

• 78

Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

+8

evijit, frimelle, yjernite, meg, irenesolaiman, dvilasuero, fdaudens, BrigitteTousi, giadap, sasha

•

Jun 24, 2024

• 34

upvoted a collection about 2 years ago

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 88 items • Updated Mar 2 • 119