-
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
Orca 2: Teaching Small Language Models How to Reason
Paper ⢠2311.11045 ⢠Published ⢠77 -
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper ⢠2309.12284 ⢠Published ⢠18
Collections
Discover the best community collections!
Collections including paper arxiv:2309.11235
-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper ⢠2312.15166 ⢠Published ⢠60 -
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
Paper ⢠2312.12456 ⢠Published ⢠44 -
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper ⢠2312.12742 ⢠Published ⢠14 -
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper ⢠2312.12682 ⢠Published ⢠10
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ⢠2401.02038 ⢠Published ⢠65 -
Learning To Teach Large Language Models Logical Reasoning
Paper ⢠2310.09158 ⢠Published ⢠1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ⢠2311.00176 ⢠Published ⢠9 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ⢠2308.09583 ⢠Published ⢠7
-
Moral Foundations of Large Language Models
Paper ⢠2310.15337 ⢠Published ⢠1 -
Specific versus General Principles for Constitutional AI
Paper ⢠2310.13798 ⢠Published ⢠3 -
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper ⢠2310.13639 ⢠Published ⢠25 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper ⢠2309.00267 ⢠Published ⢠52
-
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠55 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper ⢠2307.09288 ⢠Published ⢠247 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper ⢠2501.12948 ⢠Published ⢠430
-
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation ⢠7B ⢠Updated ⢠46.5k ⢠459 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
openchat/openchat-3.5-1210
Text Generation ⢠7B ⢠Updated ⢠641 ⢠278 -
File Research
š
-
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Paper ⢠2311.03285 ⢠Published ⢠32 -
Tailoring Self-Rationalizers with Multi-Reward Distillation
Paper ⢠2311.02805 ⢠Published ⢠7 -
Ultra-Long Sequence Distributed Transformer
Paper ⢠2311.02382 ⢠Published ⢠6 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15
-
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Paper ⢠2310.13961 ⢠Published ⢠5 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper ⢠2309.09582 ⢠Published ⢠4 -
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper ⢠2310.13127 ⢠Published ⢠12 -
Evaluating the Robustness to Instructions of Large Language Models
Paper ⢠2308.14306 ⢠Published ⢠1
-
TheBloke/Llama-2-7B-Chat-GGML
Text Generation ⢠Updated ⢠542 ⢠872 -
uonlp/CulturaX
Viewer ⢠Updated ⢠7.18B ⢠9.25k ⢠555 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
Self-Instruct: Aligning Language Model with Self Generated Instructions
Paper ⢠2212.10560 ⢠Published ⢠9
-
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
Orca 2: Teaching Small Language Models How to Reason
Paper ⢠2311.11045 ⢠Published ⢠77 -
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper ⢠2309.12284 ⢠Published ⢠18
-
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠55 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper ⢠2307.09288 ⢠Published ⢠247 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper ⢠2501.12948 ⢠Published ⢠430
-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper ⢠2312.15166 ⢠Published ⢠60 -
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
Paper ⢠2312.12456 ⢠Published ⢠44 -
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper ⢠2312.12742 ⢠Published ⢠14 -
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper ⢠2312.12682 ⢠Published ⢠10
-
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation ⢠7B ⢠Updated ⢠46.5k ⢠459 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
openchat/openchat-3.5-1210
Text Generation ⢠7B ⢠Updated ⢠641 ⢠278 -
File Research
š
-
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Paper ⢠2311.03285 ⢠Published ⢠32 -
Tailoring Self-Rationalizers with Multi-Reward Distillation
Paper ⢠2311.02805 ⢠Published ⢠7 -
Ultra-Long Sequence Distributed Transformer
Paper ⢠2311.02382 ⢠Published ⢠6 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ⢠2401.02038 ⢠Published ⢠65 -
Learning To Teach Large Language Models Logical Reasoning
Paper ⢠2310.09158 ⢠Published ⢠1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ⢠2311.00176 ⢠Published ⢠9 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ⢠2308.09583 ⢠Published ⢠7
-
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Paper ⢠2310.13961 ⢠Published ⢠5 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper ⢠2309.09582 ⢠Published ⢠4 -
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper ⢠2310.13127 ⢠Published ⢠12 -
Evaluating the Robustness to Instructions of Large Language Models
Paper ⢠2308.14306 ⢠Published ⢠1
-
Moral Foundations of Large Language Models
Paper ⢠2310.15337 ⢠Published ⢠1 -
Specific versus General Principles for Constitutional AI
Paper ⢠2310.13798 ⢠Published ⢠3 -
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper ⢠2310.13639 ⢠Published ⢠25 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper ⢠2309.00267 ⢠Published ⢠52
-
TheBloke/Llama-2-7B-Chat-GGML
Text Generation ⢠Updated ⢠542 ⢠872 -
uonlp/CulturaX
Viewer ⢠Updated ⢠7.18B ⢠9.25k ⢠555 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ⢠2309.11235 ⢠Published ⢠15 -
Self-Instruct: Aligning Language Model with Self Generated Instructions
Paper ⢠2212.10560 ⢠Published ⢠9