gem5/src/gpu-compute at b0d81ec8a2bcece1a510b20d40bd47314b24a89e - gem5 - Gitea: Git with a cup of tea

derek/gem5

Files

History

Matthew Poremba 7d46c50663 arch-vega: Swizzle multi-dword scratch requests (#1445 )

Scratch memory requests that are larger than one dword are using a
different memory layout than global instructions. Rather than being
placed contiguously, each dword is interleaved 64 lanes * 4 bytes away
as described in Section 9.1.5.2. "Swizzled Buffer Addressing" in the
MI300 specification. This was verified by comparing MI300 output (which
uses scratch_ instructions) with MI200 (which uses buffer instructions).
MI300 FashionMNIST bs=1 now matches CPU reference.

This requires several changes to the instruction implementations:
- For stores, data in the GPUDynInst can be swizzled before the data is
written to memory. This is easy to do using a helper method. This is
done in the template<int N> variant of initMemWrite. To use this x2
stores are changed to use template<int N> rather than loading a U64. The
swizzle function is renamed to swizzleAddr to avoid confusion with
swizzleData.
- For loads, data is unswizzled in completeAcc when writing register
values. This is not as easy to implement as a helper and is thus
implemented for the three load instructions that load more than one
dword.
- Accessing swizzled data requires at least one packet per dword. A new
GPU memory helper is added to create these packets for scratch requests
specifically. This is called in the template<int N> variant of
initMemRead / initMemWrite. Loads and stores of x2 are changed to use
this variant instead of accessing a U64.

The GPUDynInst status vector restrictions are increased to allow for
swizzled x4 accesses. For simplicity this does not currently support
misaligned swizzled accesses and will panic upon seeing such a case.

Change-Id: Ic686c51e28e0af029a043d5a5b3d4069f2cb94f9

2024-08-12 06:58:48 -07:00

..

comm.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

comm.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

compute_unit.cc

gpu-compute: Update Requests for invalidations

2024-08-07 14:37:49 -07:00

compute_unit.hh

gpu-compute,mem,systemc: This commit corrects typos of 'cache' (#1263 )

2024-06-20 09:45:13 -07:00

dispatcher.cc

gpu-compute: update GPUKernelInfo print to print WG number (#1413 )

2024-08-05 12:43:41 -07:00

dispatcher.hh

gpu-compute,configs: Make sim exits conditional

2023-07-07 14:12:54 +00:00

dyn_pool_manager.cc

gpu-compute: Fix register checking and allocation in dyn manager

2022-02-18 18:46:33 +00:00

dyn_pool_manager.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

exec_stage.cc

gpu-compute: Fix stat bucket sizes

2024-04-13 15:51:41 -07:00

exec_stage.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

fetch_stage.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

fetch_stage.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

fetch_unit.cc

gpu-compute: Remove unused and redundant functions

2024-02-09 12:17:24 -06:00

fetch_unit.hh

gpu-compute: Remove unused and redundant functions

2024-02-09 12:17:24 -06:00

global_memory_pipeline.cc

gpu-compute: Support Scalar and Vector access to system pages

2022-04-07 20:11:01 +00:00

global_memory_pipeline.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

gpu_command_processor.cc

mem,gpu-compute: Implement GPU TCC directed invalidate

2024-04-10 11:35:25 -07:00

gpu_command_processor.hh

gpu-compute: Add support for skipping GPU kernels (#940 )

2024-03-21 07:46:27 -07:00

gpu_compute_driver.cc

misc: Remove all references to GCN3

2024-01-17 11:11:06 -06:00

gpu_compute_driver.hh

misc: Remove all references to GCN3

2024-01-17 11:11:06 -06:00

gpu_dyn_inst.cc

gpu-compute: fix typo in GPUMem debug print (#1412 )

2024-08-05 12:44:13 -07:00

gpu_dyn_inst.hh

arch-vega: Swizzle multi-dword scratch requests (#1445 )

2024-08-12 06:58:48 -07:00

gpu_exec_context.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

gpu_exec_context.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

gpu_render_driver.cc

gpu-compute: Add mmap functionality to GPURenderDriver

2021-07-09 16:11:20 +00:00

gpu_render_driver.hh

gpu-compute: Add mmap functionality to GPURenderDriver

2021-07-09 16:11:20 +00:00

gpu_static_inst.cc

arch-vega,gpu-compute: Fix misc ubsan runtime errors

2024-05-03 14:26:46 -07:00

gpu_static_inst.hh

gpu-compute: Add MFMA stats (#1248 )

2024-06-15 13:04:00 -07:00

GPU.py

dev-amdgpu,configs,gpu-compute: Add gfx942 version

2024-05-15 12:08:41 -07:00

GPUStaticInstFlags.py

gpu-compute: Add MFMA stats (#1248 )

2024-06-15 13:04:00 -07:00

hsa_queue_entry.hh

dev-amdgpu,configs,gpu-compute: Add gfx942 version

2024-05-15 12:08:41 -07:00

Kconfig

scons: Use Kconfig to configure gem5.

2023-11-23 08:26:10 +08:00

kernel_code.hh

gpu-compute: Update code object to latest LLVM

2024-01-03 15:41:06 -06:00

lds_state.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

lds_state.hh

gpu-compute: Add DebugFlag for LDS

2024-05-03 14:31:17 -07:00

LdsState.py

misc: Run pre-commit run --all-files

2023-11-29 22:06:41 -08:00

local_memory_pipeline.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

local_memory_pipeline.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

misc.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

of_scheduling_policy.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

operand_info.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

pool_manager.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

pool_manager.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

register_file_cache.cc

gpu-compute: Added register file cache support

2024-01-04 22:43:05 -06:00

register_file_cache.hh

gpu-compute: Added register file cache support

2024-01-04 22:43:05 -06:00

register_file.cc

gpu-compute: Added register file cache support

2024-01-04 22:43:05 -06:00

register_file.hh

gpu-compute: Added register file cache support

2024-01-04 22:43:05 -06:00

register_manager_policy.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

register_manager.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

register_manager.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

rr_scheduling_policy.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

scalar_memory_pipeline.cc

gpu-compute: Invalidate Scalar cache when SQC invalidates (#1093 )

2024-05-06 07:35:38 -07:00

scalar_memory_pipeline.hh

gpu-compute: Add support for injecting scalar memory barrier

2024-02-09 12:14:57 -06:00

scalar_register_file.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

scalar_register_file.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

schedule_stage.cc

misc: Remove all references to GCN3

2024-01-17 11:11:06 -06:00

schedule_stage.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

scheduler.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

scheduler.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

scheduling_policy.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

SConscript

gpu-compute: Add DebugFlag for LDS

2024-05-03 14:31:17 -07:00

scoreboard_check_stage.cc

gpu-compute,arch-vega: Implement flat scratch insts

2023-08-26 13:40:12 -05:00

scoreboard_check_stage.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

shader.cc

gpu-compute: Update Requests for invalidations

2024-08-07 14:37:49 -07:00

shader.hh

mem,gpu-compute: Implement GPU TCC directed invalidate

2024-04-10 11:35:25 -07:00

simple_pool_manager.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

simple_pool_manager.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

static_register_manager_policy.cc

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

static_register_manager_policy.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

vector_register_file.cc

gpu-compute: WAX dependency detection (#731 )

2024-01-05 12:57:24 -06:00

vector_register_file.hh

misc: Remove AMD license addition

2021-12-11 04:00:56 +00:00

wavefront.cc

gpu-compute: Fix architected flat scratch

2024-06-15 15:46:33 -07:00

wavefront.hh

gpu-compute: Fix architected flat scratch

2024-06-15 15:46:33 -07:00