master-thesis-presentation/slides/pim.md at 3d15758c8228fffb2cb3c6e05de4bd457d02421d

derek/master-thesis-presentation

Fork 0

Files

Derek Christ 3d15758c82 Refactor presentation

2024-04-07 21:21:40 +02:00

4.9 KiB

Raw Blame History

Processing-in-Memory

Applicable Workloads

Fully connected layers have a large weight matrix
- Weight matrix does not fit onto on-chip cache
- No data reuse in the matrix

preload: false clicks: 1

Processing-in-Memory

Applicable Workloads

Convolutional layers have a small filter matrix
- Matrix does fit onto on-chip cache
- Excessive data reuse in the matrix

Processing-in-Memory

Applicable Workloads

Suitable candidates for PIM:

Multilayer perceptrons (MLPs)
Layers in recurrent neural networks (RNNs)

Unsuitable candidates for PIM:

Convolutional neural networks (CNNs)

Processing-in-Memory

Architectures

Inside the memory subarray
Near the subarray in the PSA output region
Near the bank in its peripheral region
In the I/O region of the memory

The nearer the computation is to the memory cells, the higher the achievable bandwidth!

Sudarshan et al. „A Critical Assessment of DRAM-PIM Architectures - Trends, Challenges and Solutions“, 2022.

Processing-in-Memory

Samsung's PIM-HBM

Real-world PIM implementation based on HBM2
PIM units embedded at the bank level

Lee et al. „Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology : Industrial Product“, 2021.

Processing-in-Memory

Samsung's PIM-HBM

Two 16-wide 16-bit FPUs
Register files and control unit

Lee et al. „Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology : Industrial Product“, 2021.

layout: figure figureUrl: /gemv.svg figureCaption: Procedure to perform a (128×8)×(128) GEMV operation

Processing-in-Memory

Samsung's PIM-HBM

Lee et al. „Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology : Industrial Product“, 2021.

4.9 KiB

Raw Blame History

Processing-in-Memory

Applicable Workloads

preload: false clicks: 1

Processing-in-Memory

Applicable Workloads

Processing-in-Memory

Applicable Workloads

Suitable candidates for PIM:

Unsuitable candidates for PIM:

Processing-in-Memory

Architectures

Processing-in-Memory

Samsung's PIM-HBM

Processing-in-Memory

Samsung's PIM-HBM

layout: figure figureUrl: /gemv.svg figureCaption: Procedure to perform a (128×8)×(128) GEMV operation

Processing-in-Memory

Samsung's PIM-HBM

layout: figure figureUrl: /layout.svg figureCaption: Mapping of the weight matrix onto the memory banks

Processing-in-Memory

Samsung's PIM-HBM

Processing-in-Memory

Research

4.9 KiB Raw Blame History Unescape Escape

Processing-in-Memory

Applicable Workloads

preload: false clicks: 1

Processing-in-Memory

Applicable Workloads

Processing-in-Memory

Applicable Workloads

Suitable candidates for PIM:

Unsuitable candidates for PIM:

Processing-in-Memory

Architectures

Processing-in-Memory

Samsung's PIM-HBM

Processing-in-Memory

Samsung's PIM-HBM

layout: figure figureUrl: /gemv.svg figureCaption: Procedure to perform a (128×8)×(128) GEMV operation

Processing-in-Memory

Samsung's PIM-HBM

layout: figure figureUrl: /layout.svg figureCaption: Mapping of the weight matrix onto the memory banks

Processing-in-Memory

Samsung's PIM-HBM

Processing-in-Memory

Research

4.9 KiB

Raw Blame History