Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kernels-community
/
quantization
like
0
Follow
kernels-community
9
License:
apache-2.0
Model card
Files
Files and versions
Community
main
quantization
/
cutlass_w8a8
1 contributor
History:
2 commits
danieldk
HF staff
Sync with vLLM
0da5bf5
27 days ago
Epilogues.md
Safe
5.64 kB
Add cutlass_w8a8
2 months ago
common.hpp
Safe
808 Bytes
Add cutlass_w8a8
2 months ago
scaled_mm_c2x.cu
Safe
8.59 kB
Sync with vLLM
27 days ago
scaled_mm_c2x.cuh
Safe
7.6 kB
Sync with vLLM
27 days ago
scaled_mm_c2x_sm75_dispatch.cuh
Safe
5.08 kB
Add cutlass_w8a8
2 months ago
scaled_mm_c2x_sm80_dispatch.cuh
Safe
5.83 kB
Add cutlass_w8a8
2 months ago
scaled_mm_c2x_sm89_fp8_dispatch.cuh
Safe
16.4 kB
Add cutlass_w8a8
2 months ago
scaled_mm_c2x_sm89_int8_dispatch.cuh
Safe
14.9 kB
Add cutlass_w8a8
2 months ago
scaled_mm_c3x.cu
Safe
3.58 kB
Sync with vLLM
27 days ago
scaled_mm_c3x.cuh
Safe
5.77 kB
Sync with vLLM
27 days ago
scaled_mm_c3x_sm90_fp8_dispatch.cuh
Safe
3.64 kB
Sync with vLLM
27 days ago
scaled_mm_c3x_sm90_int8_dispatch.cuh
Safe
5.55 kB
Sync with vLLM
27 days ago
scaled_mm_entry.cu
Safe
8.32 kB
Sync with vLLM
27 days ago