Haihao Shen
Haihao
·
AI & ML interests
LLM quantization, sparsity, and acceleration
Recent Activity
Organizations
Haihao's activity
-
-
-
-
-
-
-
-
-
-
-
view article
Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon
published
an
article
about 1 year ago
view article
Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding