Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Paper • 2403.04132 • Published Mar 7, 2024 • 41
An Unsupervised Method for Estimating Class Separability of Datasets with Application to LLMs Fine-Tuning Paper • 2305.15016 • Published May 24, 2023 • 6
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content Paper • 2403.13031 • Published Mar 19, 2024 • 3
Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models Paper • 2604.06912 • Published 16 days ago • 8
A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens Paper • 2604.04913 • Published 18 days ago • 10
Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment Paper • 2604.05684 • Published 17 days ago • 9
AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning Paper • 2604.05846 • Published 17 days ago • 10
Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs Paper • 2604.05643 • Published 17 days ago • 13
Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization Paper • 2604.07343 • Published 16 days ago • 13
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper • 2604.07340 • Published 16 days ago • 16
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published 19 days ago • 30
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 16 days ago • 38
Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism Paper • 2604.09544 • Published 14 days ago • 6
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published 26 days ago • 17