Add complete matrix memory layout example figure
This commit is contained in:
@@ -329,7 +329,8 @@ Those matrix row blocks possibly span over multiple \ac{dram} rows or even other
|
||||
% Once all banks have been accessed, the mapping of the column bits can continue.
|
||||
Furthermore, the number of columns defines the number of iterations the \ac{mac} core of the microkernel has to perform.
|
||||
As always 16 \ac{fp16} elements are packed together in a column-major fashion, and while ensuring that the \ac{am} of the memory controller switches to the next bank after exactly one burst size, the \ac{pim} units each contain 16 different matrix row elements of the same set of 16 matrix columns.
|
||||
Note, that this interleaving of \ac{fp16} vectors is very similar to the chunking of the weight matrix of SK Hynix's Newton architecture, as illustrated in \cref{img:hynix}.
|
||||
\Cref{img:matrix_layout} gives a complete overview of the layout of the weight matrix in the linear address space and its mapping onto the memory banks.
|
||||
Note, that the interleaving of \ac{fp16} vectors is very similar to the chunking of the weight matrix of SK Hynix's Newton architecture, as illustrated in \cref{img:hynix}.
|
||||
|
||||
The input vector must adhere also a special memory layout.
|
||||
Since a vector is essentially a single-column matrix, it is always laid out sequentially in memory.
|
||||
@@ -401,3 +402,11 @@ However, the increased processing bandwidth and the reduced power consumption on
|
||||
In conclusion, \aca{fimdram} is one of the few real \ac{pim} implementations by hardware vendors at this time and promises significant performance gains and higher power efficiency compared to regular \aca{hbm} \ac{dram}.
|
||||
The following \cref{sec:vp} introduces the concept of virtual prototyping, which is the basis for the following implementation of the \aca{fimdram} model in a simulator.
|
||||
|
||||
\begin{landscape}
|
||||
\begin{figure}
|
||||
\input{images/matrix_layout}
|
||||
\caption[Mapping of the weight matrix onto the memory banks and its layout in the linear address space]{Mapping of the weight matrix onto the memory banks and its layout in the linear address space.}
|
||||
\label{img:matrix_layout}
|
||||
\end{figure}
|
||||
\end{landscape}
|
||||
|
||||
|
||||
Reference in New Issue
Block a user