gem5/src/arch/amdgpu at 8b78e87f1bfb08ab952fde5161ec9e2397f86014 - gem5

derek/gem5

Files

Matthew Poremba 4d336c0636 arch-vega: Implement buffer_atomic_cmpswap (#439 )

This is a standard compare and swap but implemented on vector memory
buffer instructions (i.e., it is the same as FLAT_ATOMIC_CMPSWAP with
MUBUF's special address calculation).

This was tested using a Tensile kernel, a backend for rocBLAS, which is
used by PyTorch and Tensorflow. Prior to this patch both ML frameworks
crashed. With this patch they both make forward progress.

Change-Id: Ie76447a72d210f81624e01e1fa374e41c2c21e06

2023-10-12 07:33:40 -07:00

common

dev-amdgpu: Update deprecated ports

2023-02-14 18:57:33 +00:00

gcn3

arch-gcn3,arch-vega: Fix ds_read2st64_b32

2023-05-13 20:09:37 +00:00

vega

arch-vega: Implement buffer_atomic_cmpswap (#439 )

2023-10-12 07:33:40 -07:00