Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yao Xuesong's picture
1 2 3

Yao Xuesong

NathanYao

AI & ML interests

None yet

Organizations

None yet

authored 2 papers 8 months ago

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Paper • 2501.02506 • Published Jan 5 • 11

Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?

Paper • 2504.00509 • Published Apr 1 • 22
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs