Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2309.02427

Papers - CoT - Chain of Thought

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 53
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20, 2024 • 13

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 63
Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 77
Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

Paper • 2311.00871 • Published Nov 1, 2023 • 3

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Paper • 2204.04581 • Published Apr 10, 2022 • 1
Retrieval-Augmented Multimodal Language Modeling

Paper • 2211.12561 • Published Nov 22, 2022 • 1
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories

Paper • 2212.10511 • Published Dec 20, 2022 • 1
Memorizing Transformers

Paper • 2203.08913 • Published Mar 16, 2022 • 2

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Paper • 2310.04484 • Published Oct 6, 2023 • 5
Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 81
Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 16

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Paper • 2310.15511 • Published Oct 24, 2023 • 5
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search

Paper • 2310.13227 • Published Oct 20, 2023 • 14
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning

Paper • 2310.04474 • Published Oct 6, 2023 • 2
AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 36

Papers - Autonomous Drones

Learning Generalizable Feature Fields for Mobile Manipulation

Paper • 2403.07563 • Published Mar 12, 2024 • 8
Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Paper • 2310.01352 • Published Oct 2, 2023 • 7
Self-Consistency Improves Chain of Thought Reasoning in Language Models

Paper • 2203.11171 • Published Mar 21, 2022 • 5
MemGPT: Towards LLMs as Operating Systems

Paper • 2310.08560 • Published Oct 12, 2023 • 8
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

Paper • 2310.06117 • Published Oct 9, 2023 • 2

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning

Paper • 2310.04474 • Published Oct 6, 2023 • 2
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques

Paper • 2310.08101 • Published Oct 12, 2023 • 2
Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance

Paper • 2310.02107 • Published Oct 3, 2023 • 3

Branch-Solve-Merge Improves Large Language Model Evaluation and Generation

Paper • 2310.15123 • Published Oct 23, 2023 • 8
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 19
Self-Convinced Prompting: Few-Shot Question Answering with Repeated Introspection

Paper • 2310.05035 • Published Oct 8, 2023 • 1
Chain-of-Thought Reasoning is a Policy Improvement Operator

Paper • 2309.08589 • Published Sep 15, 2023 • 2

Papers - CoT - Chain of Thought

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 53
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20, 2024 • 13

Papers - Autonomous Drones

Learning Generalizable Feature Fields for Mobile Manipulation

Paper • 2403.07563 • Published Mar 12, 2024 • 8
Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 63
Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 77
Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

Paper • 2311.00871 • Published Nov 1, 2023 • 3

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Paper • 2310.01352 • Published Oct 2, 2023 • 7
Self-Consistency Improves Chain of Thought Reasoning in Language Models

Paper • 2203.11171 • Published Mar 21, 2022 • 5
MemGPT: Towards LLMs as Operating Systems

Paper • 2310.08560 • Published Oct 12, 2023 • 8
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

Paper • 2310.06117 • Published Oct 9, 2023 • 2

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Paper • 2204.04581 • Published Apr 10, 2022 • 1
Retrieval-Augmented Multimodal Language Modeling

Paper • 2211.12561 • Published Nov 22, 2022 • 1
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories

Paper • 2212.10511 • Published Dec 20, 2022 • 1
Memorizing Transformers

Paper • 2203.08913 • Published Mar 16, 2022 • 2

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning

Paper • 2310.04474 • Published Oct 6, 2023 • 2
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques

Paper • 2310.08101 • Published Oct 12, 2023 • 2
Instance Needs More Care: Rewriting Prompts for Instances Yields Better Zero-Shot Performance

Paper • 2310.02107 • Published Oct 3, 2023 • 3

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Paper • 2310.04484 • Published Oct 6, 2023 • 5
Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 81
Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 16

Branch-Solve-Merge Improves Large Language Model Evaluation and Generation

Paper • 2310.15123 • Published Oct 23, 2023 • 8
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 19
Self-Convinced Prompting: Few-Shot Question Answering with Repeated Introspection

Paper • 2310.05035 • Published Oct 8, 2023 • 1
Chain-of-Thought Reasoning is a Policy Improvement Operator

Paper • 2309.08589 • Published Sep 15, 2023 • 2

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Paper • 2310.15511 • Published Oct 24, 2023 • 5
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search

Paper • 2310.13227 • Published Oct 20, 2023 • 14
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning

Paper • 2310.04474 • Published Oct 6, 2023 • 2
AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 36

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs