derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Kyle Roarty	e2e18d41e1	configs,gpu-compute: Add support for gfx902/Raven This patch adds support for a gfx902 Vega APU, ripping the appropriate values for device_id from the ROCm Thunk (src/topology.c). Note: gfx902 isn't officially supported by ROCm. This means that it may not work for all programs. In particular, rocBLAS is incompatible with gfx902, so anything that uses rocBLAS won't be able to run with gfx902. Change-Id: I48893e7cc9c7e52275fdfd22314f371a9db8e90a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47530 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-10 03:42:03 +00:00
Matthew Poremba	897c0c11ed	dev,dev-hsa,gpu-compute: Refactor dmaVirt calls Remove the duplicate dmaVirt calls from HSA packet processor and GPU command processor and move them into their own class. This removes some duplicate code and allows a DmaVirtDevice to be created which will be useful for upcoming full system GPU commits. The DmaVirtDevice is an abstraction of the base DmaDevice but iterates using ChunkGenerator over virtual addresses. Classes which inherit from DmaVirtDevice must provide a translation function to translate from virtual address to physical address. Once translated, the physical address is passed to DmaDevice to do the work. Change-Id: Idd59ccb4d9ba21c0b1150ee328ededf5a88d824e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47179 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:40:18 +00:00
Kyle Roarty	1812041dc0	gpu-compute: Update GET_PROCESS_APERTURES IOCTLs The apertures for non-gfx801 GPUs are set differently. If the apertures aren't set properly, ROCm will error out. This change sets the apertures appropriately based on the gfx version of the simulated GPU. It also adds in new functions to set the scratch and lds apertures in GFX9 to mimic the linux kernel. Change-Id: I1fa6f60bc20c7b6eb3896057841d96846460a9f8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47529 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 16:22:07 +00:00
Kyle Roarty	ab9e28ddb8	configs,gpu-compute: Set proper dGPUPoolID defaults In GPU.py, dGPUPoolID is defined as an int, but was defaulted to False. Explicitly set it to 0, instead. In apu_se.py, dGPUPoolID was being set to 1, but that was resulting in crashes. Setting it to 0 avoids those crashes. Change-Id: I0f1161588279a335bbd0d8ae7acda97fc23201b5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47527 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 16:11:20 +00:00
Kyle Roarty	76888a9cca	gpu-compute: Add mmap functionality to GPURenderDriver dGPUs mmap the GPURenderDriver, however it doesn't appear that they do anything with it. This patch implements the mmap function by just returning the address provided, while not doing anything else Change-Id: Ia010a2aebcf7e2c75e22d93dfb440937d1bef3b1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47523 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 16:11:20 +00:00
Kyle Roarty	ebb6c4b99b	gpu-compute: Check for WAX dependences This adds checking if the destination registers are free or busy in the operandsReady() function for both scalar and vector registers. This allows us to catch WAX dependences between instructions. Change-Id: I0fb0b29e9608fca0d90c059422d4d9500d5b2a7d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47539 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 01:18:01 +00:00
Kyle Roarty	02dd6b77ff	arch-gcn3,arch-vega,gpu-compute: Move request counters When the Vega ISA got committed, it lacked the request counter tracking for memory requests that existed in the GCN3 code. Instead of copying over the same lines from the GCN3 code to the Vega code, this commit makes the various memory pipelines handle updating the request counter information instead, as every memory instruction calls a memory pipeline. This commit also adds an issueRequest in scalar_memory_pipeline, as previously, the gpuDynInsts were explicitly placed in the queue of issuedRequests. Change-Id: I5140d3b2f12be582f2ae9ff7c433167aeec5b68e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45347 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 01:18:01 +00:00
Kyle Roarty	3f9b03522c	arch-gcn3,gpu-compute: Set gpuDynInst exec_mask before use vector_register_file uses the exec_mask of a memory instruction in order to determine if it should mark a register as in-use or not. Previously, the exec_mask of memory instructions was only set on execution of that instruction, which occurs after the code in vector_register_file. This led to the code reading potentially garbage data, leading to a scenario where a register would be marked used when it shouldn't be. This fix sets the exec_mask of memory instructions in schedule_stage, which works because the only time the wavefront execMask() is updated is on a instruction executing, and we know the previous instruction will have executed by the time schedule_stage executes, due to the order the pipeline is executed in. This also undoes part of a patch from last year (`62ec973`) which treated the symptom of accidental register allocation, without preventing the registers from being allocated in the first place. This patch also removes now redundant code that sets the exec_mask in instructions.cc for memory instructions Change-Id: Idabd35020000764fb06133ac2458606c1aaf6f04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45346 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 01:18:01 +00:00
Daniel R. Carvalho	5ff1fac819	misc: Rename Debug namespace as debug As part of recent decisions regarding namespace naming conventions, all namespaces will be changed to snake case. gem5::Debug became gem5::debug. Change-Id: Ic04606baab3317d2e58ab3ca9b37fc201c406ee8 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47305 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 23:18:59 +00:00
Daniel R. Carvalho	60e4ad955d	mem-ruby: Add a ruby namespace Encapsulate all ruby-related files in a ruby namespace. Change-Id: If642c9751ecefc35b45c5dd69d85e67813cc5224 Issued-on: https://gem5.atlassian.net/browse/GEM5-984 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47307 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 23:18:59 +00:00
Giacomo Travaglini	d1cdcb311b	misc: Move Mode and Translation from BaseTLB to BaseMMU This is a step towards moving most of the TLB logic to the MMU class. Change-Id: Id6b1fb30aa89960705f165f9738f5b50aa1e6bdb Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46779 Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 08:44:13 +00:00
Matthew Poremba	7aa3c4c638	gpu-compute: Add missing override in render driver This fixes the build error in the clang-11 compiler check for GCN3_X86. Change-Id: I2245589182b80811b8bc07409196adca98899213 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47479 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-02 00:50:28 +00:00
Daniel R. Carvalho	974a47dfb9	misc: Adopt the gem5 namespace Apply the gem5 namespace to the codebase. Some anonymous namespaces could theoretically be removed, but since this change's main goal was to keep conflicts at a minimum, it was decided not to modify much the general shape of the files. A few missing comments of the form "// namespace X" that occurred before the newly added "} // namespace gem5" have been added for consistency. std out should not be included in the gem5 namespace, so they weren't. ProtoMessage has not been included in the gem5 namespace, since I'm not familiar with how proto works. Regarding the SystemC files, although they belong to gem5, they actually perform integration between gem5 and SystemC; therefore, it deserved its own separate namespace. Files that are automatically generated have been included in the gem5 namespace. The .isa files currently are limited to a single namespace. This limitation should be later removed to make it easier to accomodate a better API. Regarding the files in util, gem5:: was prepended where suitable. Notice that this patch was tested as much as possible given that most of these were already not previously compiling. Change-Id: Ia53d404ec79c46edaa98f654e23bc3b0e179fe2d Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46323 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 19:08:24 +00:00
Kyle Roarty	c96f43d83e	gpu-compute: Initialize GPUDriver member variables before use A few member variables weren't initialized, but we were assuming that they were 0 when first read. This explicitly sets those variables to 0. Change-Id: I2c840d361ed3a7d306e22dc7561a3870f1ef94a1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46248 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	b1df141bed	gpu-compute: Change certain IOCTL errors to warnings There are certain IOCTL errors that were triggering with the change to ROCm 4, however they could be set to warnings without causing any errors in the program Change-Id: Ie0052267f3ccfbdbadb90249b6f19e6a1205f57e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46247 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	5fad68f576	dev-hsa,gpu-compute: IOCTL updates for ROCm 4 This change copies over the up-to-date kfd_ioctl.h file from the linux kernel, and updates the gpu_compute_driver to reflect the changes found in the new version of the kfd_ioctl.h file Change-Id: I51e8e7158762f4b7e06c0f84507e5889a17939a2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46246 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	f2a029058a	gpu-compute: Ignore GPU kernel names ROCm 4 seems to have updated the akc, and the only real issue that has occured is that we're no longer able to read kernel names in the same way as we were in ROCm 1.6. This patch removes the prior method of reading kernel names and gives all kernels a temporary name Change-Id: I0040e0cf4cd35d6f56ded6a8acfb10c600bcc77a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46245 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	a71801b9a0	configs,gpu-compute: Add render driver needed for ROCm 4 ROCm 4 utilizes the render driver located at /dev/dri/renderDXXX. This patch implements a very simple driver that just returns a file descriptor when opened, as testing has shown that's all that's needed Change-Id: I65602346cbf17b2dc80e114046ebf5c9830a1507 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46244 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Daniel R. Carvalho	98ac080ec4	base-stats,misc: Rename Stats namespace as statistics As part of recent decisions regarding namespace naming conventions, all namespaces will be changed to snake case. ::Stats became ::statistics. "statistics" was chosen over "stats" to avoid generating conflicts with the already existing variables (there are way too many "stats" in the codebase), which would make this patch even more disturbing for the users. Change-Id: If877b12d7dac356f86e3b3d941bf7558a4fd8719 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45421 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-29 11:13:49 +00:00
Daniel R. Carvalho	4dd099ba3d	misc: Rename Enums namespace as enums As part of recent decisions regarding namespace naming conventions, all namespaces will be changed to snake case. ::Enums became ::enums. Change-Id: I39b5fb48817ad16abbac92f6254284b37fc90c40 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45420 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-29 11:13:49 +00:00
Gabe Black	a91af24e60	misc: Clean up ISA switching header includes. Remove includes that aren't needed, including ones for config/the_isa.hh. Also stop using switching includes when the ISA is known. Change-Id: I2af6c88dcaf511b086ec808b0ba3196179982af2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40336 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com>	2021-05-28 23:41:03 +00:00
Daniel R. Carvalho	71460cb13e	sim,misc: Rename Int namespace as as_int As part of recent decisions regarding namespace naming conventions, all namespaces will be changed to snake case. sim_clock::Int became sim_clock::as_int. "as_int" was chosen because "int" is a reserved keyword, and this namespace acts as a selector of how to read the internal variables. Another possibility to resolve this would be to remove the namespaces "Float" and "Int" and use unions instead. Change-Id: I65f47608d2212424bed1731c7f53d242d5a7d89a Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45436 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Maintainer: Gabe Black <gabe.black@gmail.com>	2021-05-26 23:08:21 +00:00
Daniel R. Carvalho	0967a43c10	misc: Rename SimClock namespace as sim_clock As part of recent decisions regarding namespace naming conventions, all namespaces will be changed to snake case. ::SimClock became ::sim_clock. Change-Id: I25b8cfc93f283081bc2add9fdef6fec7d7ff3846 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45402 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Hoa Nguyen <hoanguyen@ucdavis.edu>	2021-05-26 22:30:33 +00:00
Daniel R. Carvalho	85b8c5b0a3	gpu-compute: Rename prefetch variable as isPrefetch Pave the way for a prefetch namespace. Change-Id: I4372abb5603eb6a920f7ff127cde54cb24e31377 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45409 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-05-21 23:10:39 +00:00
Bobby R. Bruce	521d04f92f	arch-gcn3: Fixing .fast compilation for gcn3 DPRINTF was altered here: https://gem5-review.googlesource.com/c/public/gem5/+/44988. This change results in DPRINTFs always compiling. As such, the variables decladed within NDEBUG ifdefs, and later used in DPRINTFs, cause an error when compiling .fast. In this patch the NDEBUG ifdefs have been removed. Change-Id: I54992cfe152c84b265e64e1389bf2656c95ba42e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45481 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-14 18:13:59 +00:00
Daniel R. Carvalho	9b675ebea8	misc: Add missing compiler.hh include Add some missing base/compiler.hh includes. Found by manually checking the files in: grep -r --include \*.hh -L \ '#include "base/compiler.hh"' \ $(grep -r -l "GEM5_" src/) And occasionally checking some .cc files through a similar methodology. Change-Id: I6b6e27189c627bb76ace73c338486743d469be46 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45459 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-14 10:02:14 +00:00
Gabe Black	e1fea279e2	misc: Replace M5_NODISCARD with GEM5_NO_DISCARD. Change-Id: I1ddaf03afe865092d1664e395b51b1f573c19c85 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45232 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br>	2021-05-11 20:16:31 +00:00
Gabe Black	fb3befcc6d	misc: Replace M5_VAR_USED with GEM5_VAR_USED. Change-Id: I64a874ccd1a9ac0541dfa01971d7d620a98c9d32 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45231 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br>	2021-05-11 20:16:31 +00:00
Bobby R. Bruce	8f22b3bee8	arch-gcn3: Add missing overrides These overrides are required to compile gcn3_x86 with clang. Change-Id: I65ece501f16a4fbf8ffdc6b754de69fb36ab7515 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45085 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-05 04:15:39 +00:00
Bobby R. Bruce	2c8bade2c3	arch-gcn3,misc: Fix .fast compilation errors for GCN3_x86 Unused variable errors occurred when compiling gem5.fast with GCC. This patch fixes this. Change-Id: Iaca1fb8194c2381c0a4ba5d0ea1fb5b8f2a11829 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44885 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-27 23:26:04 +00:00
Kyle Roarty	529736a7ce	gpu-compute, dev-hsa: Fix doorbell for gfx900 gfx9 changed the size of the doorbell, and what the write index is when the doorbell is rang. --gfx-version flag is used to set the doorbell size Change-Id: I48e4e57dc1c80a08133b17cdf3f92533b541f7c3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42220 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-24 15:54:15 +00:00
Kyle Roarty	ec6b325382	gpu-compute, dev-hsa: Remove HSADriver, HSADevice HSADriver/HSADevice were primarily used with GPUCommandProcessor/ GPUComputeDriver. This change merges the classes together to simplify the inheritance hierarchy, as well as removing any casting. Change-Id: I670eb9b49a16c8aba17e13fd1d1287d0621c9f48 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42219 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-24 15:54:15 +00:00
Kyle Roarty	eb09361eef	configs, gpu-compute: Add option to specify gfx version Currently uses gfx801, gfx803, gfx900 for Carrizo, Fiji, and Vega respectively Change-Id: I62758914b6a60f16dd4f2141a23c0a9141a4e1a0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42217 Maintainer: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-24 15:54:15 +00:00
Michael LeBeane	ad43083bb3	gpu-compute: Implement per-request MTYPEs GPU MTYPE is currently set using a global config passed to the PACoalescer. This patch enables MTYPE to be set by the shader on a per-request bases. In real hardware, the MTYPE is extracted from a GPUVM PTE during address translation. However, our current simulator only models x86 page tables which do not have the appropriate bits for GPU MTYPES. Rather than hacking non-x86 bits into our x86 page table models, this patch instead keeps an interval tree of all pages that request custom MTYPES in the driver itself. This is currently only used to map host pages to the GPU as uncacheable, but is easily extensible to other MTYPES. Change-Id: I7daab0ffae42084b9131a67c85cd0aa4bbbfc8d6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42216 Maintainer: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-24 15:54:15 +00:00
Michael LeBeane	a5f55e0be1	gpu-compute: Topology and driver changes for dGPU New topology ripped from Fiji to support dGPU. A dGPU flag is added to the config which is propogated to the driver. The emulated driver is now able to properly deal with dGPU ioctls and mmaps. For now, dGPU physical memory is allocated from the host, but this is easy to change once we get a GPU memory controller up and running. Change-Id: I594418482b12ec8fb2e4018d8d0371d56f4f51c8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42214 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-15 16:41:11 +00:00
Gabe Black	0dade68dae	arch,cpu,gpu-compute: Further simplify VecRegContainer. Get rid of VecRegT, and a few redundant or unused methods. Change-Id: I6c88c40653e1939fe74b8ffb847ef50ab8064670 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41995 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-10 07:31:23 +00:00
Kyle Roarty	c734ab7602	dev-hsa,gpu-compute: Fix override for updateHsaSignal Change `965ad12` removed a parameter from the updateHsaSignal function. Change `25e8a14` added the parameter back, but only for the derived class, breaking the override. This patch adds that parameter back to the base class, fixing the override. Change-Id: Id1e96e29ca4be7f3ce244bac83a112e3250812d1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44046 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-03 02:39:27 +00:00
Gabe Black	1791b8732c	scons: Pull domain specific build setup out of SConstruct. Use SConsopts files local to individual domains to pull non-foundational build code out of SConstruct. This greatly simplifies SConstruct, and also makes it easier to find build configuration having to do with particular pieces of gem5. This change also converts some python level variables, all_protocols, protocol_dirs, and slicc_includes, into the environment where the timing of their initialization is more flexible. Change-Id: Ie61ceb75ae9e5557cc400603c972a9582e99c1ea Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40872 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2021-04-03 01:18:17 +00:00
Kyle Roarty	df5ddabc03	gpu-compute: Fix scalar register ready check Replaces some curly braces that were accidentally removed causing the function to return false even when it shouldn't Change-Id: I15fb4167468c8e3dd1107f1ca3dc98c48df4611b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44045 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-02 20:12:29 +00:00
Kyle Roarty	2bb8d6bc0c	gpu-compute: remove index-based operand access This commit removes functions that indexed into the vectors that held the operands. Instead, for-each loops are used, iterating through one of 6 vectors (src, dst, srcScalar, srcVec, dstScalar, dstVec) that all hold various (potentially overlapping) combinations of the operands. Change-Id: Ia3a857c8f6675be86c51ba2f77e3d85bfea9ffdb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42212 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-01 02:58:31 +00:00
Kyle Roarty	b40b361bee	arch-vega, gpu-compute: Add vectors to hold op info This removes the need for redundant functions like isScalarRegister/isVectorRegister, as well as isSrcOperand/isDstOperand. Also, the op info is only generated once this way instead of every time it's needed. Change-Id: I8af5080502ed08ed9107a441e2728828f86496f4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42211 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-01 02:58:31 +00:00
Tony Gutierrez	0e2564a629	arch-gcn3, gpu-compute: Update getRegisterIndex() API This change removes the GPUDynInstPtr argument from getRegisterIndex(). The dynamic inst was only needed to get access to its parent WF's state so it could determine the number of scalar registers the wave was allocated. However, we can simply pass the number of scalar registers directly. This cuts down on shared pointer usage. Change-Id: I29ab8d9a3de1f8b82b820ef421fc653284567c65 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42210 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-01 02:58:31 +00:00
Tony Gutierrez	236b4a502f	gpu-compute: Add operand info class to GPUDynInst This change adds a class that stores operand register info for the GPUDynInst. The operand info is calculated when the instruction object is created and stored for easy access by the RF, etc. Change-Id: I3cf267942e54fe60fcb4224d3b88da08a1a0226e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42209 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-04-01 02:58:31 +00:00
Gabe Black	3f67faec83	arch,dev,gpu-compute,sim: Rename isa_traits.hh page_size.hh. The only thing left in isa_traits.hh are two constants, one for the number of bytes in a page, and one for how far to shift an address to get the page number. To make it clear that this is the only thing isa_traits.hh should be used for from this point forward (until it is entirely eliminated), this change renames it to the much less generic page_size.hh. Also, because isa_traits.hh used to have much more stuff in it, it was included in a lot of places it didn't need to be. This change also clears out all these legacy includes while updating the actually needed ones to the new name. Change-Id: I939b01b117c53d620b6b0a98982f6f21dc2ada72 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40179 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-30 10:17:48 +00:00
Kyle Roarty	c9415dc389	gpu-compute: Remove unused functions These functions were probably used for some stat collection, but they're no longer used, so they're being removed Change-Id: Ic99f22391c0d5ffb0e9963670efb35e503f9957d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42202 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-03-25 17:21:16 +00:00
Michael LeBeane	25e8a14a6b	gpu-compute: Support dynamic scratch allocations dGPUs in all versions of ROCm and APUs starting with ROCM 2.2 can under-allocate scratch resources. This patch adds support for the CP to trigger a recoverable error so that the host can attempt to re-allocate scratch to satisfy the currently stalled kernel. Note that this patch does not include a mechanism to handle dynamic scratch allocation for queues with in-flight kernels, as these queues would first need to be drained and descheduled, which would require some additional effort in the hsaPP and HW queue scheduler. If the CP encounters this scenerio it will assert. I suspect this is not a particularly common occurence in most of our applications so it is left as a TODO. This patch also fixes a few memory leaks and updates the old DMA callback object interface to use a much cleaner c++11 lambda interface. Change-Id: Ica8a5fc88888283415507544d6cc49fa748fe84d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42201 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-03-25 17:21:08 +00:00
Daniel R. Carvalho	7f1de4e686	misc: Fix coding style for enum's opening braces The systemc dir was not included in this fix. First it was identified that there were only occurrences at 0, 1, and 2 levels of indentation (and 2 of 2 spaces, 1 of 3 spaces and 2 of 12 spaces), using: grep -nrE --exclude-dir=systemc \ "^ enum [A-Za-z]. {$" src/ Then the following commands were run to replace: <indent level>enum X ... { by: <indent level>enum X ... <indent level>{ Level 0: grep -nrl --exclude-dir=systemc \ "^enum [A-Za-z].* {$" src/ \| \ xargs sed -Ei \ 's/^enum ([A-Za-z].) \{$/enum \1\n\{/g' Level 1: grep -nrl --exclude-dir=systemc \ "^ enum [A-Za-z]. {$" src/ \| \ xargs sed -Ei \ 's/^ enum ([A-Za-z].*) \{$/ enum \1\n \{/g' and so on. Change-Id: Ib186cf379049098ceaec20dfe4d1edcedd5f940d Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/43326 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-23 16:26:04 +00:00
Kyle Roarty	f5383a5733	gpu-compute: Fix accidental execution when stopped at barrier Due the compute unit pipeline being executed in reverse order, there exists a scenario where a compute unit will execute an extra instruction when it's supposed to be stopped at a barrier. It occurs as follows: * The ScheduleStage sets a barrier instruction ready to execute. * The ScoreboardCheckStage adds another instruction to the readyList. This is where the barrier is checked, but because the barrier isn't executing yet, the instruction can be passed along to ScheduleStage * The barrier executes, and stalls * The ScheduleStage sees that there's a new instruction and schedules it to be executed. * Only now will the ScoreboardCheckStage realize a barrier is active and stall accordingly * The subsequent instruction executes This patch sets the wavefront status to be S_BARRIER in ScheduleStage instead of in the barrier instruction execution in order to have ScoreboardCheckStage realize that we're going to execute a barrier, preventing it from marking another instruciton as ready. Change-Id: Ib683e2c68f361d7ee60a3beaf53b4b6c888c9f8d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41573 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Alexandru Duțu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-04 17:37:19 +00:00
Kyle Roarty	a9e0a1ccf1	gpu-compute: Explicitly set driver to nullptr in constructor We have a fail_if in attachDriver to prevent driver from being overwritten. However, the fail_if only checks for if the driver is not nullptr. Previously, in some cases driver was set to garbage, which made the fail_if trip the first time we were assigning the driver. This patch explicitly sets driver to nullptr in the constructor, thus ensuring that it will be nullptr the first time we call attachDriver Change-Id: I325f6033e785025a912e3af3888c66cee0332f40 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41973 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-01 18:10:11 +00:00
Giacomo Travaglini	41928dac80	misc: Remove unused params() definitions Lots of times the params() helper has been defined but not used Change-Id: Id71829aca71341d46964d8f071099342b946b62f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41613 Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-19 23:27:34 +00:00

1 2 3 4

191 Commits