davanstrien HF Staff Claude Opus 4.6 (1M context) commited on
Commit
12683d4
·
1 Parent(s): 4f40a48

Fix: use 'document' prompt for index embeddings, not 'query'

Browse files

Qwen3-Embedding-0.6B has separate prompt templates for queries vs
documents. Documents should use 'document' prompt at index time,
'query' prompt at search time. Was using 'query' for both.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Files changed (1) hide show
  1. build_embeddings.py +2 -2
build_embeddings.py CHANGED
@@ -88,7 +88,7 @@ def embed_datasets(model, dataset_source):
88
  summaries,
89
  batch_size=BATCH_SIZE,
90
  show_progress_bar=True,
91
- prompt_name="query",
92
  )
93
  logger.info("Embeddings computed")
94
 
@@ -125,7 +125,7 @@ def embed_models(model, model_source):
125
  summaries,
126
  batch_size=BATCH_SIZE,
127
  show_progress_bar=True,
128
- prompt_name="query",
129
  )
130
  logger.info("Embeddings computed")
131
 
 
88
  summaries,
89
  batch_size=BATCH_SIZE,
90
  show_progress_bar=True,
91
+ prompt_name="document",
92
  )
93
  logger.info("Embeddings computed")
94
 
 
125
  summaries,
126
  batch_size=BATCH_SIZE,
127
  show_progress_bar=True,
128
+ prompt_name="document",
129
  )
130
  logger.info("Embeddings computed")
131