4 40 142

airlsyn

AI & ML interests

AI & RL

Recent Activity

upvoted an article about 23 hours ago

We Got Claude to Fine-Tune an Open Source LLM

updated a collection 2 days ago

Multi-Modal

liked a dataset 5 days ago

openbmb/InfLLM-V2-data-5B

View all activity

Organizations

None yet

upvoted an article about 23 hours ago

Article

We Got Claude to Fine-Tune an Open Source LLM

3 days ago

•

235

upvoted a paper 12 days ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published 20 days ago • 44

upvoted a paper 18 days ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published 20 days ago • 134

upvoted a paper about 1 month ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29 • 44

upvoted an article about 1 month ago

Article

Streaming datasets: 100x More Efficient

Oct 27

•

upvoted a paper about 1 month ago

Scalable Vision Language Model Training via High Quality Data Curation

Paper • 2501.05952 • Published Jan 10 • 5

upvoted an article about 1 month ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21

•

273

upvoted a paper about 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 65

upvoted an article 2 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

Sep 23

•

130

upvoted a paper 2 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 68

upvoted 2 papers 3 months ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25 • 14

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Paper • 2505.02835 • Published May 5 • 29

upvoted an article 4 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

•

202

upvoted a paper 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 313

upvoted an article 5 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18

•

upvoted a paper 5 months ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 107

upvoted 3 articles 5 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

May 7, 2024

•

109

Article

Efficient MultiModal Data Pipeline

Jul 8

•

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

734

upvoted a paper 5 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

airlsyn

AI & ML interests

Recent Activity

Organizations

airlsyn's activity

We Got Claude to Fine-Tune an Open Source LLM

Streaming datasets: 100x More Efficient

Supercharge your OCR Pipelines with Open Models

Smol2Operator: Post-Training GUI Agents for Computer Use

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Efficient MultiModal Data Pipeline

SmolLM3: smol, multilingual, long-context reasoner