Thierry Herrmann

thierryh

AI & ML interests

deep learning, machine learning

Organizations

None yet

upvoted 2 articles 9 months ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

AviSoori1x

•

Mar 18, 2024

• 14

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

AviSoori1x

•

May 7, 2024

• 121

upvoted 6 articles about 1 year ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 230

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 274

Article

Training and Finetuning Reranker Models with Sentence Transformers

tomaarsen

•

Mar 26, 2025

• 194

Article

Faster Text Generation with Self-Speculative Decoding

ariG23498, melhoushi, pcuenq, reach-vb

•

Nov 20, 2024

• 65

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

ariG23498, aerdem4

•

Dec 23, 2024

• 51

Article

Visualize and understand GPU memory in PyTorch

qgallouedec

•

Dec 24, 2024

• 270

upvoted an article over 1 year ago

Article

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

muellerzr

•

Oct 21, 2022

• 44

Thierry Herrmann

AI & ML interests

Organizations

thierryh's activity

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Train 400x faster Static Embedding Models with Sentence Transformers

Training and Finetuning Embedding Models with Sentence Transformers

Training and Finetuning Reranker Models with Sentence Transformers

Faster Text Generation with Self-Speculative Decoding

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Visualize and understand GPU memory in PyTorch

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease