Morgan Funtowicz's picture

22 3 4

Morgan Funtowicz

mfuntowicz

·

https://github.com/mfuntowicz

AI & ML interests

Model inference low-level optimization, hardware affinity and large-scale distributed training.

Recent Activity

upvoted an article 21 days ago

Yay! Organizations can now publish blog Articles

upvoted an article 22 days ago

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

published an article 27 days ago

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

View all activity

Organizations

mfuntowicz's activity

published an article 27 days ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

27 days ago

• 64

published an article 4 months ago

Article

Introducing the AMD 5th Gen EPYC™ CPU

Oct 10, 2024

• 6

published an article 9 months ago

Article

Hugging Face on AMD Instinct MI300 GPU

May 21, 2024

• 11

published an article 11 months ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

Mar 15, 2024

• 8

published an article about 1 year ago

Article

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

Jan 15, 2024

• 4

published an article about 1 year ago

Article

AMD + 🤗: Large Language Models Out-of-the-Box Acceleration with AMD GPU

Dec 5, 2023

• 2

published an article about 1 year ago

Article

Optimum-NVIDIA - Unlock blazingly fast LLM inference in just 1 line of code

Dec 5, 2023

• 4

published an article over 1 year ago

Article

Accelerating over 130,000 Hugging Face models with ONNX Runtime

Oct 4, 2023

published an article about 3 years ago

Article

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Jan 13, 2022

• 2

published an article over 3 years ago

Article

Scaling up BERT-like model Inference on modern CPU - Part 2

Nov 4, 2021

• 1

published an article over 3 years ago

Article

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

Sep 14, 2021

• 1

published an article almost 4 years ago

Article

Scaling-up BERT Inference on CPU (Part 1)

Apr 20, 2021

• 3