view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7 • 253
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 Apr 29 • 43
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1 • 130
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model Mar 10 • 146
RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts Paper • 2305.17679 • Published May 28, 2023 • 2
ProLIP Collection Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18 • 10
view article Article How to Expand Your AI Music Generations of 30 Seconds to Several Minutes Dec 13, 2024 • 3
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) Jan 19 • 37