-
Moral Foundations of Large Language Models
Paper ā¢ 2310.15337 ā¢ Published ā¢ 1 -
Specific versus General Principles for Constitutional AI
Paper ā¢ 2310.13798 ā¢ Published ā¢ 3 -
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper ā¢ 2310.13639 ā¢ Published ā¢ 25 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper ā¢ 2309.00267 ā¢ Published ā¢ 47
Collections
Discover the best community collections!
Collections including paper arxiv:2309.11235
-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper ā¢ 2312.15166 ā¢ Published ā¢ 57 -
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
Paper ā¢ 2312.12456 ā¢ Published ā¢ 41 -
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper ā¢ 2312.12742 ā¢ Published ā¢ 13 -
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper ā¢ 2312.12682 ā¢ Published ā¢ 9
-
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 15 -
Orca 2: Teaching Small Language Models How to Reason
Paper ā¢ 2311.11045 ā¢ Published ā¢ 72 -
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper ā¢ 2309.12284 ā¢ Published ā¢ 18
-
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Paper ā¢ 2311.03285 ā¢ Published ā¢ 29 -
Tailoring Self-Rationalizers with Multi-Reward Distillation
Paper ā¢ 2311.02805 ā¢ Published ā¢ 4 -
Ultra-Long Sequence Distributed Transformer
Paper ā¢ 2311.02382 ā¢ Published ā¢ 3 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 15
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 63 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 9 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7
-
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Paper ā¢ 2310.13961 ā¢ Published ā¢ 5 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper ā¢ 2309.09582 ā¢ Published ā¢ 4 -
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper ā¢ 2310.13127 ā¢ Published ā¢ 12 -
Evaluating the Robustness to Instructions of Large Language Models
Paper ā¢ 2308.14306 ā¢ Published ā¢ 1
-
TheBloke/Llama-2-7B-Chat-GGML
Text Generation ā¢ Updated ā¢ 2.66k ā¢ 866 -
uonlp/CulturaX
Viewer ā¢ Updated ā¢ 7.18B ā¢ 18.5k ā¢ 492 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 15 -
Self-Instruct: Aligning Language Model with Self Generated Instructions
Paper ā¢ 2212.10560 ā¢ Published ā¢ 9
-
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 15 -
openchat/openchat-3.5-0106
Text Generation ā¢ Updated ā¢ 30.8k ā¢ 351 -
openchat/openchat-3.5-1210
Text Generation ā¢ Updated ā¢ 2.05k ā¢ 274 -
openchat/openchat_3.5
Text Generation ā¢ Updated ā¢ 45.1k ā¢ 1.12k