Algebraic Enhancements for Systolic Arrays

Pogue, Trevor E.

Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/30640

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Nicolici, Nicola	-
dc.contributor.author	Pogue, Trevor E.	-
dc.date.accessioned	2024-12-16T15:22:48Z	-
dc.date.available	2024-12-16T15:22:48Z	-
dc.date.issued	2025-06	-
dc.identifier.uri	http://hdl.handle.net/11375/30640	-
dc.description.abstract	The field of deep learning has seen increasing breakthroughs and commercial adoption in recent years for enabling a wide range of applications including image and speech recognition, multimedia generation, information summarization, and human-like chatbots. This has led to a growing need for hardware that can quickly and efficiently perform deep learning inference, which increasingly requires massive amounts of computational power. To address this need, recent years have seen many works for optimizing deep learning inference in hardware. Systolic arrays are an efficient class of hardware designs to use as a starting point for this application. However, after hardware-oriented deep learning model optimizations reach their limits, after the known parallelism for executing their compute patterns in hardware is exhausted, and after technology scaling slows to a halt, there is an accelerator wall that limits further improvement on the implementation side. In this thesis, we contribute to this field through an under-explored direction by presenting new efficient matrix multiplication algorithms and/or their systolic-array hardware architectures that increase performance-per-area by reducing the workload at the algebraic level, and thus by computing the same result from a re-arranged compute pattern requiring fewer or cheaper operations to be performed in hardware. We evaluate our architectures in an end-to-end deep learning accelerator, demonstrating their ability to increase the performance-per-area of hardware accelerators beyond their normal theoretical limits.	en_US
dc.language.iso	en	en_US
dc.subject	hardware	en_US
dc.subject	acceleration	en_US
dc.subject	architecture	en_US
dc.subject	performance	en_US
dc.subject	algorithms	en_US
dc.subject	matrix multiplication	en_US
dc.subject	machine learning	en_US
dc.subject	artificial intelligence	en_US
dc.title	Algebraic Enhancements for Systolic Arrays	en_US
dc.type	Thesis	en_US
dc.contributor.department	Electrical and Computer Engineering	en_US
dc.description.degreetype	Thesis	en_US
dc.description.degree	Doctor of Philosophy (PhD)	en_US
Appears in Collections:	Open Access Dissertations and Theses

Files in This Item:

File	Description	Size	Format
pogue_trevor_e_2024december_phd.pdf Open Access		2.68 MB	Adobe PDF	View/Open

Show simple item record