Zen Reranker
Zen Reranker is a high-performance reranking model for search and retrieval pipelines. Part of the Zen AI model family by Hanzo AI.
Overview
Zen Reranker is optimized for:
- Retrieval-Augmented Generation (RAG) โ re-score retrieved passages for LLM context
- Search quality improvement โ rerank initial BM25/dense retrieval results
- Cross-lingual retrieval โ strong multilingual performance
- DSO integration โ compatible with Hanzo's Decentralized Semantic Optimization
Quick Start
import torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification
model_name = "zenlm/zen-reranker"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name, torch_dtype=torch.float16)
def rerank(query, passages):
pairs = [[query, p] for p in passages]
inputs = tokenizer(
pairs, padding=True, truncation=True,
max_length=512, return_tensors="pt"
)
with torch.no_grad():
scores = model(**inputs).logits.squeeze(-1)
ranked = sorted(zip(passages, scores.tolist()), key=lambda x: x[1], reverse=True)
return ranked
query = "What is the capital of France?"
passages = ["Paris is the capital of France.", "Berlin is in Germany.", "Madrid is in Spain."]
results = rerank(query, passages)
for passage, score in results:
print(f"{score:.3f}: {passage}")
With sentence-transformers
from sentence_transformers import CrossEncoder
model = CrossEncoder("zenlm/zen-reranker")
scores = model.predict([
["What is the capital of France?", "Paris is the capital of France."],
["What is the capital of France?", "Berlin is in Germany."],
])
Specifications
| Attribute | Value |
|---|---|
| Parameters | 4B |
| Architecture | Qwen3ForSequenceClassification |
| Context | 32,768 tokens |
| Languages | 100+ (multilingual) |
| License | Apache 2.0 |
Use Cases
- RAG pipelines โ rerank retrieved chunks before passing to LLM
- Search engines โ improve document ranking quality
- QA systems โ score answer candidates for relevance
- Semantic deduplication โ score similarity for clustering
Abliteration
Like all Zen models, Zen Reranker is abliterated โ refusal bias has been removed using directional ablation via hanzoai/remove-refusals.
Technique: Refusal in LLMs is mediated by a single direction โ Arditi et al.
Model Family
| Model | Parameters | Use Case |
|---|---|---|
| Zen Nano | 0.6B | Edge AI |
| Zen Scribe | 4B | Writing |
| Zen Pro | 8B | Professional AI |
| Zen Max | 671B MoE | Frontier |
| Zen Reranker | 4B | Retrieval |
| Zen Embedding | โ | Embeddings |
Citation
@misc{zen-reranker-2025,
title={Zen Reranker: High-Performance Neural Reranking},
author={Hanzo AI and Zoo Labs Foundation},
year={2025},
url={https://huggingface.co/zenlm/zen-reranker}
}
Part of the Zen model ecosystem by Hanzo AI (Techstars '17) and Zoo Labs Foundation.
- Downloads last month
- 39