Lysandre's picture

Lysandre

lysandre

·

http://lysand.re

AI & ML interests

chief open-source officer @ hf

Recent Activity

upvoted an article about 3 hours ago

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

View all activity

Organizations

lysandre's activity

upvoted an article about 3 hours ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

about 6 hours ago

• 13

upvoted an article 3 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 319

upvoted a paper 3 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted 2 articles 4 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 67

Article

Tool Use, Unified

Aug 12, 2024

• 73

upvoted a collection 5 months ago

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14, 2024 • 11

upvoted 2 articles 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 217

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

Apr 5, 2022

• 17

upvoted 2 articles 6 months ago

Article

MobileNet Baselines

By

•

Jul 26, 2024

• 23

Article

MobileNet-V4 (now in timm)

By

•

Jun 17, 2024

• 42

upvoted an article 7 months ago

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22, 2024

• 57

upvoted a collection 8 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 25 days ago • 161

upvoted a collection 9 months ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3, 2024 • 103

upvoted an article 9 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 128

upvoted an article 10 months ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24, 2024

• 61

upvoted a collection 10 months ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 329

upvoted a collection 12 months ago

Canonical models

This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace • 68 items • Updated Feb 13, 2024 • 14

upvoted a collection about 1 year ago

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Dec 13, 2024 • 51

upvoted a paper about 1 year ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118

upvoted a collection about 1 year ago

Switch-Transformers release

This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated Dec 13, 2024 • 17