caution!

ๆ€่€ƒใ‚’ใ•ใ›ใ‚‹ใŸใ‚ใฏ--jinja ใ‚ชใƒ—ใ‚ทใƒงใƒณใซใฆใ“ใฎใƒขใƒ‡ใƒซ็‰นๆœ‰ใฎใ‚ทใ‚นใƒ†ใƒ ใƒ—ใƒญใƒณใƒ—ใƒˆใ‚’่ชญใฟ่พผใ‚€ๅฟ…่ฆใŒใ‚ใ‚Šใพใ™ใ€‚
ใ“ใฎใ‚ชใƒ—ใ‚ทใƒงใƒณใ‚’ไฝฟ็”จใ™ใ‚‹ใซใฏllama.cpp-b4524ไปฅ้™ใธใฎๆ›ดๆ–ฐใŒๅฟ…่ฆใงใ™ใ€‚

What is this?

KARAKURI Inc.ใซใ‚ˆใ‚‹QwQ-32B-Previewใฎๆ—ฅๆœฌ่ชžใƒ•ใ‚กใ‚คใƒณใƒใƒฅใƒผใƒ‹ใƒณใ‚ฐใƒขใƒ‡ใƒซใ€karakuri-lm-32b-thinking-2501-expใ‚’GGUFใƒ•ใ‚ฉใƒผใƒžใƒƒใƒˆใซๅค‰ๆ›ใ—ใŸใ‚‚ใฎใงใ™ใ€‚

imatrix dataset

ๆ—ฅๆœฌ่ชž่ƒฝๅŠ›ใ‚’้‡่ฆ–ใ—ใ€ๆ—ฅๆœฌ่ชžใŒๅคš้‡ใซๅซใพใ‚Œใ‚‹TFMC/imatrix-dataset-for-japanese-llmใƒ‡ใƒผใ‚ฟใ‚ปใƒƒใƒˆใ‚’ไฝฟ็”จใ—ใพใ—ใŸใ€‚
ใพใŸใ€CUDA็‰ˆllama.cppใŒbfloat16ใซๅฏพๅฟœใ—ใŸใŸใ‚ใ€imatrixใฎ็ฎ—ๅ‡บใฏๆœฌๆฅใฎๆ•ฐๅ€ค็ฒพๅบฆใงใ‚ใ‚‹BF16ใฎใƒขใƒ‡ใƒซใ‚’ไฝฟ็”จใ—ใฆ่กŒใ„ใพใ—ใŸใ€‚

Environment

Windows็‰ˆllama.cpp-b4514ใŠใ‚ˆใณllama.cpp-b4524ๅŒๆ™‚ใƒชใƒชใƒผใ‚นใฎconvert-hf-to-gguf.pyใ‚’ไฝฟ็”จใ—ใฆ้‡ๅญๅŒ–ไฝœๆฅญใ‚’ๅฎŸๆ–ฝใ—ใพใ—ใŸใ€‚

License

Apache 2.0

Developer

Alibaba Cloud & KARAKURI Inc.

Downloads last month
85
GGUF
Model size
32.8B params
Architecture
qwen2

4-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.