2 10 128

Leonard Püttmann

puettmann

leonard-puettmann

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

lerobot/pi0

liked a model 12 days ago

mistralai/Mistral-Small-24B-Base-2501

liked a model 12 days ago

mistralai/Mistral-Small-24B-Instruct-2501

View all activity

Organizations

puettmann's activity

liked a model 7 days ago

lerobot/pi0

Robotics • Updated 5 days ago • 2.07k • 83

liked 2 models 12 days ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated 12 days ago • 9.83k • 208

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated 10 days ago • 338k • • 704

liked 3 Spaces 14 days ago

248

Think in Sync

🧠

An addictive AI-powered word puzzle.

Text to Survive

📱

Send chat messages to Mistral API

Rizztral

⚡

Team's 1 demo

liked a model 14 days ago

bartowski/Ministral-8B-Instruct-2410-GGUF

Text Generation • Updated Oct 21, 2024 • 19k • 32

liked a model 16 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 3 days ago • 2.94M • • 8.32k

reacted to anakin87's post with 👍 21 days ago

Post

1607

𝐍𝐞𝐰 𝐈𝐭𝐚𝐥𝐢𝐚𝐧 𝐒𝐦𝐚𝐥𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬: 𝐆𝐞𝐦𝐦𝐚 𝐍𝐞𝐨𝐠𝐞𝐧𝐞𝐬𝐢𝐬 𝐜𝐨𝐥𝐥𝐞𝐜𝐭𝐢𝐨𝐧 💎🌍🇮🇹

I am happy to release two new language models for the Italian Language!

💪 Gemma 2 9B Neogenesis ITA
anakin87/gemma-2-9b-neogenesis-ita
Building on the impressive work by VAGO Solutions, I applied Direct Preference Optimization with a mix of Italian and English data.
Using Spectrum, I trained 20% of model layers.

📊 Evaluated on the Open ITA LLM leaderboard ( mii-llm/open_ita_llm_leaderboard), this model achieves strong performance.
To beat it on this benchmark, you'd need a 27B model 😎

🤏 Gemma 2 2B Neogenesis ITA
anakin87/gemma-2-2b-neogenesis-ita
This smaller variant is fine-tuned from the original Gemma 2 2B it by Google.
Through a combination of Supervised Fine-Tuning and Direct Preference Optimization, I trained 25% of the layers using Spectrum.

📈 Compared to the original model, it shows improved Italian proficiency, good for its small size.

Both models were developed during the recent #gemma competition on Kaggle.
📓 Training code: https://www.kaggle.com/code/anakin87/post-training-gemma-for-italian-and-beyond

🙏 Thanks @FinancialSupport and mii-llm for the help during evaluation.