Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

kernels-community
/

quantization

Model card Files Files and versions Community

1 contributor

History: 25 commits

danieldk's picture

danieldk HF staff

Build (Torch 2.6)

95272c7 11 days ago

build
Build (Torch 2.6) 11 days ago
compressed_tensors
Sync with vLLM 26 days ago
core
Sync with vLLM 26 days ago
cutlass_extensions
Sync with vLLM 26 days ago
cutlass_w8a8
Sync with vLLM 26 days ago
ext-torch
Export `ScalarType`/`scalartypes` 15 days ago
fp8
Sync with vLLM 26 days ago
gptq_marlin
Sync with vLLM 26 days ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS 2 months ago
tests
Add full Marlin support and tests for Marlin/CUTLASS 2 months ago
.gitattributes

1.56 kB

Build 2 months ago
LICENSE

11.4 kB

Add cutlass_w8a8 2 months ago
README.md

181 Bytes

Fixup metadata 2 months ago
build.toml

2.82 kB

Sync with vLLM 26 days ago
dispatch_utils.h

1.49 kB

Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` 2 months ago
flake.lock

2.53 kB

Update flake.lock 15 days ago
flake.nix

257 Bytes

Simplify `flake.nix` about 2 months ago
vectorization.cuh

778 Bytes

Sync with vLLM 26 days ago