Skip to content

Commit f22086d

Browse files
authored
Improved the excerpts on blog posts (#1828)
Signed-off-by: Chris Abraham <[email protected]>
1 parent a4df587 commit f22086d

4 files changed

+4
-0
lines changed

_posts/2024-10-28-unleashing-ai-mobile.md

+1
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@
22
layout: blog_detail
33
title: "Unleashing the Power of AI on Mobile: LLM Inference for Llama 3.2 Quantized Models with ExecuTorch and KleidiAI"
44
author: Gian Marco Iodice, Arm and Digant Desai, Meta
5+
excerpt: "At the recent PyTorch Conference, Arm highlighted the widespread impact of its technology, spanning from cloud to edge, emphasizing its commitment to delivering its advanced AI computing capabilities seamlessly to millions of developers worldwide."
56
---
67

78
## Introduction

_posts/2024-11-01-cutlass-ping-pong-gemm-kernel.md

+1
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@
22
layout: blog_detail
33
title: "Deep Dive on CUTLASS Ping-Pong GEMM Kernel"
44
author: Less Wright, Adnan Hoque
5+
excerpt: "In this post, we provide an overview, with relevant FP8 inference kernel benchmarking, of the CUTLASS Ping-Pong GEMM kernel."
56
---
67

78
![Figure 1. FP8 GEMM Throughput Comparison CUTLASS vs Triton](/assets/images/cutlass-ping-pong-gemm-kernel/fg1.png){:style="width:100%"}

_posts/2024-11-21-rebellions.md

+1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
---
22
layout: blog_detail
33
title: "Rebellions Joins the PyTorch Foundation as a General Member"
4+
excerpt: "The PyTorch Foundation, a neutral home for the deep learning community to collaborate on the open source PyTorch framework and ecosystem, is announcing today that Rebellions has joined as a general member."
45
---
56

67
![Rebellions logo](/assets/images/rebellions-logo.svg){:style="max-width:350px;width:100%;float:right;margin: 20px;"}

_posts/2024-11-25-training-using-float8-fsdp2.md

+1
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@
22
layout: blog_detail
33
title: "Supercharging Training using float8 and FSDP2"
44
author: "IBM and Meta"
5+
excerpt: "In this blog, we will demonstrate how we achieve up to 50% throughput speedup while achieving loss and evaluation benchmark parity in training over FSDP1 bf16 training"
56
---
67

78
**IBM**: Tuan Hoang Trong, Alexei Karve, Yan Koyfman, Linsong Chu, Divya Kumari, Shweta Salaria, Robert Walkup, Praneet Adusumilli, Nirmit Desai, Raghu Ganti, Seetharami Seelam

0 commit comments

Comments
 (0)