Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2406.04093

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 21 days ago • 10
Self-Refine: Iterative Refinement with Self-Feedback

Paper • 2303.17651 • Published Mar 30, 2023 • 2
Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval

Paper • 2410.13339 • Published Oct 17, 2024

Papers - Training - Sparse Learning - k-Sparse Autoencoder

k-Sparse Autoencoders

Paper • 1312.5663 • Published Dec 19, 2013 • 1
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3

Papers - Text - SAE - Sparse Autoencoders

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 11
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 39
Disentangling Dense Embeddings with Sparse Autoencoders

Paper • 2408.00657 • Published Aug 1, 2024 • 1

Papers - Training - Scaling Properties

Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10
Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7, 2024 • 7
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3

mechanistic interpretability with sparse autoencoders

A collection of papers that I found useful for learning about using Sparse Autoencoders for finding interpretable features in language models

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 13
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Paper • 2403.19647 • Published Mar 28, 2024 • 3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 39

Papers - Training

SELF: Language-Driven Self-Evolution for Large Language Model

Paper • 2310.00533 • Published Oct 1, 2023 • 2
GrowLength: Accelerating LLMs Pretraining by Progressively Growing Training Length

Paper • 2310.00576 • Published Oct 1, 2023 • 2
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Paper • 2305.13169 • Published May 22, 2023 • 3
Transformers Can Achieve Length Generalization But Not Robustly

Paper • 2402.09371 • Published Feb 14, 2024 • 14

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs