beyoru
/

EvolLLM

Text Generation

text-generation-inference

Model card Files Files and versions

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

📑 Model Card

💻Github Repo • 🤗Model Collections

Model Details

This model is a merged version of two Qwen base models:

Qwen/Qwen3-4B-Instruct-2507
Qwen/Qwen3-4B-Thinking-2507

Notations:

Evoluation dataset: openai/gsm8k (subset of 100 samples, not trained)
Generation runs: 50
Population size: 10
This model design for instruct model not reasoning model with same function like Qwen3-Instruct-2507
A good start for SFT or GRPO training.

Evaluation

For my evaluation in my agent benchmark is not surpass too much but only 3% with instruct model.
Surpass openfree/Darwin-Qwen3-4B (Evolution model) and base model in ACEBench.

@misc{nafy_qwen_merge_2025,
  title        = {Merged Qwen3 4B Instruct + Thinking Models},
  author       = {Beyoru},
  year         = {2025},
  howpublished = {\url{https://huggingface.co/beyoru/EvolLLM}},
  note         = {Merged model combining instruction-tuned and reasoning Qwen3 variants.},
  base_models  = {Qwen/Qwen3-4B-Instruct-2507, Qwen/Qwen3-4B-Thinking-2507}
}

Downloads last month: 80

Safetensors

Model size

4B params

Tensor type

BF16

·

Model tree for beyoru/EvolLLM

Qwen/Qwen3-4B-Instruct-2507

Qwen/Qwen3-4B-Thinking-2507

Merge model

this model

Finetunes

Dataset used to train beyoru/EvolLLM

Collection including beyoru/EvolLLM

Evolution Model

An evolution merge model • 5 items • Updated 29 days ago