Kazuaki Hiraga's picture

32 326

Kazuaki Hiraga

kazuakey

·

AI & ML interests

NLP, Sentiment Analysis, Named Entity Recognition

Recent Activity

liked a model 4 days ago

mistralai/Mistral-Small-24B-Instruct-2501

liked a dataset 5 days ago

FreedomIntelligence/medical-o1-reasoning-SFT

liked a dataset 5 days ago

FreedomIntelligence/medical-o1-verifiable-problem

View all activity

Organizations

kazuakey's activity

upvoted a paper 10 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 28 days ago • 54

upvoted a collection 10 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated Jan 6 • 19

upvoted a collection 13 days ago

TinySwallow

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 13 days ago • 13

upvoted 4 papers 15 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 17 days ago • 54

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 17 days ago • 54

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 19 days ago • 48

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published 22 days ago • 5

upvoted a paper 19 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 21 days ago • 49

upvoted an article 21 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

28 days ago

• 142

upvoted a paper 22 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 26 days ago • 105

upvoted an article 26 days ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 153

upvoted 2 collections 27 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 6 days ago • 227

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated Dec 22, 2024 • 33

upvoted an article 3 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

and 1 other •

Nov 21, 2024

• 35

upvoted an article 4 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 67

upvoted a collection 4 months ago

Llama-3.1-Swallow

9 items • Updated 12 days ago • 5

upvoted 2 articles 4 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 69

Article

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code

By

•

Oct 2, 2024

• 46

upvoted a collection 4 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 26 days ago • 51

upvoted a collection 5 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 554