Minor fixes and compiler-based approach in colcusion
This commit is contained in:
@@ -19,6 +19,10 @@ For a better evaluation of the performance gains of \aca{fimdram}, it should be
|
||||
Effects such as the initialization overhead of \aca{fimdram} can only be evaluated in such an environment.
|
||||
Furthermore, the integration of \aca{fimdram} should be extended to \acp{gpu} or \acp{tpu}, so that the comparison can be extended to the deployment of the real \ac{dnn} applications.
|
||||
|
||||
Further research could also investigate whether the library-based approach of leveraging \ac{pim} could be replaced by a compiler-based approach.
|
||||
A special compiler extension would be able to generate the necessary \ac{ld} and \ac{st} instructions by analyzing the data types of the operands.
|
||||
This extension might also make use of so-called non-temporal instructions that bypass the cache hierarchy on a per-instruction basis.
|
||||
|
||||
In conclusion, \ac{pim} is a promising approach to address the future processing needs of \ac{ai} and possibly other applications.
|
||||
Not only the architecture itself has to be considered, but also the integration of \ac{pim} into the applications at the software level.
|
||||
By overcoming these challenges, \ac{pim} could be part of the solution to increase the performance and energy efficiency of future computing platforms.
|
||||
|
||||
Reference in New Issue
Block a user