You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

πŸ“‘ Model Card

πŸ’»Github Repo β€’ πŸ€—Model Collections

Model Details

This model is a merged version of two Qwen base models:

  • Qwen/Qwen3-4B-Instruct-2507
  • Qwen/Qwen3-4B-Thinking-2507

Notations:

  • Evoluation dataset: openai/gsm8k (subset of 100 samples, not trained)
  • Generation runs: 50
  • Population size: 10
  • This model design for instruct model not reasoning model with same function like Qwen3-Instruct-2507
  • A good start for SFT or GRPO training.

Evaluation

  • For my evaluation in my agent benchmark is not surpass too much but only 3% with instruct model.
  • Surpass openfree/Darwin-Qwen3-4B (Evolution model) and base model in ACEBench.
@misc{nafy_qwen_merge_2025,
  title        = {Merged Qwen3 4B Instruct + Thinking Models},
  author       = {Beyoru},
  year         = {2025},
  howpublished = {\url{https://huggingface.co/beyoru/EvolLLM}},
  note         = {Merged model combining instruction-tuned and reasoning Qwen3 variants.},
  base_models  = {Qwen/Qwen3-4B-Instruct-2507, Qwen/Qwen3-4B-Thinking-2507}
}
Downloads last month
80
Safetensors
Model size
4B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for beyoru/EvolLLM

Dataset used to train beyoru/EvolLLM

Collection including beyoru/EvolLLM