Reasoning
updated
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
• 2407.00653
• Published
• 13
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
LLMs
Paper
• 2406.18629
• Published
• 42
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Paper
• 2406.14562
• Published
• 28
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language
Models
Paper
• 2406.04271
• Published
• 29
Iterative Reasoning Preference Optimization
Paper
• 2404.19733
• Published
• 49
FlowMind: Automatic Workflow Generation with LLMs
Paper
• 2404.13050
• Published
• 34
Cognitive Map for Language Models: Optimal Planning via Verbally
Representing the World Model
Paper
• 2406.15275
• Published
• 12
Learn Beyond The Answer: Training Language Models with Reflection for
Mathematical Reasoning
Paper
• 2406.12050
• Published
• 19
Improve Mathematical Reasoning in Language Models by Automated Process
Supervision
Paper
• 2406.06592
• Published
• 29
Transformers meet Neural Algorithmic Reasoners
Paper
• 2406.09308
• Published
• 44
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Paper
• 2406.09170
• Published
• 27
To Compress or Not to Compress- Self-Supervised Learning and Information
Theory: A Review
Paper
• 2304.09355
• Published
• 6
Towards Building Specialized Generalist AI with System 1 and System 2
Fusion
Paper
• 2407.08642
• Published
• 11
Teaching Large Language Models to Reason with Reinforcement Learning
Paper
• 2403.04642
• Published
• 49
Chain-of-Thought Reasoning Without Prompting
Paper
• 2402.10200
• Published
• 109
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper
• 2402.03620
• Published
• 117
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large
Language Models -- The Story Goes On
Paper
• 2407.08348
• Published
• 52
Case2Code: Learning Inductive Reasoning with Synthetic Data
Paper
• 2407.12504
• Published
• 8
Internal Consistency and Self-Feedback in Large Language Models: A
Survey
Paper
• 2407.14507
• Published
• 46
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis
Paper
• 2407.13301
• Published
• 55
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
• 2407.18248
• Published
• 33
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper
• 2408.06195
• Published
• 73
On the Diagram of Thought
Paper
• 2409.10038
• Published
• 13
Not All LLM Reasoners Are Created Equal
Paper
• 2410.01748
• Published
• 29