AI & ML interests

Community of researchers interested in OpenBuddy Early Access. Please note that this is not an official group for OpenBuddy, and the members have no affiliation with the OpenBuddy team. Every independent researchers can apply to join by submitting a Form available in our GitHub.

raincandy-u 
posted an update 29 days ago
view post
Post
2953
Introducing Rain-v2: Democratizing LLM training on gaming GPUs! ⚡

​Following Rain-100M, we’re scaling up. Rain-v2 features a larger training dataset.

We’ve published a comprehensive blog covering the end-to-end journey—from raw data collection to rigorous evaluation and safety testing.

​HF Repo: 🤗 raincandy-u/Rain-v2

​Blog: 📚
https://angelkawaii.xyz/2026/01/29/rain-v2/

​Special thanks to the open-source community and the SmolLM2 team for their foundational work! 🚀

HuggingFaceTB

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model (2502.02737)
raincandy-u 
posted an update about 1 month ago
view post
Post
5432
🤗 Just released Rain-100M, an experimental ~97M-parameter Qwen3-style language model trained from random initialization.

Repo: raincandy-u/Rain-100M

Data: HuggingFaceFW/fineweb-edu, ~3B tokens, English only

Tokenizer: custom 16k BPE, context length 4096

Architecture: 12 Transformer layers, hidden size 768, 12 heads, MLP 2048, SiLU, bf16


Rain-100M is a raw base model (not instruction-tuned or safety-aligned), aimed at small-scale research, debugging training pipelines, and CPU/edge experiments. If you run evaluations, finetunes, or visualizations with it, I would be very interested in your results!
·
AtAndDev 
posted an update 7 months ago
view post
Post
626
Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE
AtAndDev 
posted an update 9 months ago
view post
Post
3113
deepseek-ai/DeepSeek-R1-0528

This is the end
  • 1 reply
·
AtAndDev 
posted an update 11 months ago
view post
Post
3149
Llama 4 is out...
·
AtAndDev 
posted an update 12 months ago
view post
Post
4379
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
  • 6 replies
·
AtAndDev 
posted an update 12 months ago
view post
Post
1675
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
AtAndDev 
posted an update about 1 year ago
AtAndDev 
posted an update about 1 year ago
view post
Post
1951
everywhere i go i see his face
AtAndDev 
posted an update about 1 year ago
view post
Post
587
Deepseek gang on fire fr fr
AtAndDev 
posted an update about 1 year ago
view post
Post
1662
R1 is out! And with a lot of other R1 releated models...
AtAndDev 
posted an update about 1 year ago
view post
Post
501
@s3nh Hey man check your discord! Got some news.
  • 4 replies
·
Niansuh 
posted an update over 1 year ago
Niansuh 
posted an update over 1 year ago
raincandy-u 
posted an update over 1 year ago
view post
Post
2686
🤗 I trained what is probably the smallest (600k ~) TinyStories model! It really can write grammatically correct stories!

raincandy-u/TinyStories-656K

Try this space based on this minuscule model!

https://huggingface.co/spaces/raincandy-u/Story-Teller

Edit: Moreover, the model weight size is only 1.31MB under bf16, and can be reduced to the 700KB level when using Q8_0 quantization U•ェ•*U

Edit: Now 1000K params chat model!

raincandy-u/TinyChat-1776K
·
Niansuh 
posted an update almost 2 years ago
view post
Post
1178
**Model Names:** gpt-4-turbo-preview, gpt-4-vision-preview, gpt-3.5-turbo-16k
**Searchable Models:** Creative, Balanced, Precise

Image creation will be available soon in NiansuhAI.
**Model Name:** DALL-E 3

https://huggingface.co/spaces/NiansuhAI/LLMs1
---
  • 2 replies
·
Niansuh 
posted an update almost 2 years ago
raincandy-u 
posted an update almost 2 years ago
view post
Post
2173
First post, thanks HF! 🤗

Here is a Claude 3 Sonnet generated dataset using prompts from WildChat:

raincandy-u/claudy-chat-5k
  • 1 reply
·
ff670 
updated a Space about 2 years ago