Last correction from Lukas
This commit is contained in:
@@ -32,8 +32,8 @@ A special compiler extension would be able to generate the necessary \ac{ld} and
|
||||
This extension could also make use of so-called non-temporal instructions, which bypass the cache hierarchy on a per-instruction basis instead of preallocating the entire \ac{pim}-enabled memory as non-cacheable.
|
||||
|
||||
In addition to the performance comparison, further research should also model and compare the power efficiency gain of \ac{pim} to the non-\ac{pim} case.
|
||||
Since \ac{pim} not only provides a shorter computation time per operation, but also does not actually drive the memory data bus during operation, it promises good improvements in this area.
|
||||
However, this would require a detailed performance model of both \aca{hbm} and \aca{fimdram}.
|
||||
Since \ac{pim} not only provides a shorter computation time per operation, but also does not actually transfer data out of the \ac{dram} and therefore does not need to drive the data bus during operation, it promises good improvements in this area.
|
||||
However, such research would require a detailed power model of both \aca{hbm} and \aca{fimdram}.
|
||||
|
||||
In conclusion, \ac{pim} is a promising approach to address the future processing and power efficiency needs of \ac{ai} and possibly other applications.
|
||||
Research needs to consider not only the architecture itself, but also the integration of \ac{pim} into applications at the software level.
|
||||
|
||||
Reference in New Issue
Block a user