NickyNicky (Nicky)

upvoted 4 articles 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

282

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

+7

Apr 29, 2025

•

44

Article

Gemma 3n fully available in the open-source ecosystem!

+6

Jun 26, 2025

•

121

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Jul 1, 2025

•

138

upvoted an article about 1 year ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Mar 10, 2025

•

147

upvoted a paper about 1 year ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18, 2025 • 58

upvoted 2 articles about 1 year ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

Sep 27, 2024

•

54

upvoted a paper about 1 year ago

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Paper • 2305.17679 • Published May 28, 2023 • 2

upvoted 7 articles about 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.32k

Article

Welcome to Inference Providers on the Hub 🔥

+5

Jan 28, 2025

•

495

Article

The AI tools for Art Newsletter - Issue 1

Jan 31, 2025

•

84

Article

The N Implementation Details of RLHF with PPO

+1

Oct 24, 2023

•

72

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Jan 24, 2025

•

58

Article

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

Jan 23, 2025

•

4

Article

We now support VLMs in smolagents!

+1

Jan 24, 2025

•

113

upvoted a collection about 1 year ago

ProLIP

Collection

Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18, 2025 • 10

upvoted 3 articles about 1 year ago

Article

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

Dec 13, 2024

•

3

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

Jan 19, 2025

•

46

Article

Fine-tune ModernBERT for RAG with Synthetic Data

Jan 20, 2025

•

42

Nicky

AI & ML interests

Organizations

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Gemma 3n fully available in the open-source ecosystem!

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Magma: A Foundation Model for Multimodal AI Agents

Open R1: Update #2

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Open-source DeepResearch – Freeing our search agents

Welcome to Inference Providers on the Hub 🔥

The AI tools for Art Newsletter - Issue 1

The N Implementation Details of RLHF with PPO

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

We now support VLMs in smolagents!

ProLIP

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

Fine-tune ModernBERT for RAG with Synthetic Data

Nicky

AI & ML interests

Organizations

NickyNicky's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Gemma 3n fully available in the open-source ecosystem!

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Open R1: Update #2

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

Open-source DeepResearch – Freeing our search agents

Welcome to Inference Providers on the Hub 🔥

The AI tools for Art Newsletter - Issue 1

The N Implementation Details of RLHF with PPO

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

We now support VLMs in smolagents!

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

Fine-tune ModernBERT for RAG with Synthetic Data