derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Matthew Poremba	b64467025d	arch-vega: Implement SOP2 S_MUL_HI instructions Two new 32-bit signed and unsigned variants of S_MUL were added in gfx900 which operate similar to S_MUL expect they shift the product by 32 bits after multiplication. Tested with Histogram HIP-Sample and b+tree in rodinia 3.0 HIP port. Change-Id: I1bed32b17ccda7aa47f3b59528eb3304912d3610 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58473 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 17:06:43 +00:00
Matthew Poremba	e3f65393fd	dev-amdgpu,arch-vega: Implement TLB invalidation logic Add logic to collect pointers to all GPU TLBs in full system. Implement the invalid TLBs PM4 packet. The invalidate is done functionally since there is really no benefit to simulate it with timing and there is no support in the TLB to do so. This allow application with much larger data sets which may reuse device memory pages to work in gem5 without possibly crashing due to a stale translation being leftover in the TLB. Change-Id: Ia30cce02154d482d8f75b2280409abb8f8375c24 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58470 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-08 17:12:32 +00:00
Gabe Black	e6c0ba97db	scons: Put all config variables in an env['CONF'] sub-dict. This makes what are configuration and what are internal SCons variables explicit and separate, and makes it unnecessary to call out what variables to export to C++. These variables will also be plumbed into and out of kconfiglib in later changes. Change-Id: Iaf5e098d7404af06285c421dbdf8ef4171b3f001 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56892 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 20:31:21 +00:00
Matthew Poremba	dd90417211	arch-vega: Bypass Ruby for functional page walks Currently if a Ruby functional access fails to find an address in the caches, it gives up. For functional page table walks we need to be able to go all the way to memory. This adds a pointer to the system object which allows the walker to get a pointer to device memory which can be used to do a functional access directly to memory bypassing Ruby. Change-Id: I0ead6e5e130a0d53021c44ae9221b167c6316ab2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57529 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 19:51:29 +00:00
Matthew Poremba	9cbdf75295	dev-amdgpu: Add VM class for apertures, TranslationGens Create a VM class to reduce clutter in the amdgpu_device.* files. This new file is in charge of reading/writting MMIOs related to VM contexts and apertures. It also provides ranges checks for various apertures and breaks out the MMIO interface so that there are not overloaded macro definitions in the device MMIO methods. The new translation generator classes for the various apertures are also added to this class. Change-Id: Ic224c1aa485685685b1136a46eed50bcf99d2350 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53066 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 14:59:57 +00:00
Matthew Poremba	f64f05eff6	arch-vega: Mark global instructions executed as global The executed_as field is currently not set for global memory instructions. This results in the default of SC_NONE, causing the status vector to be all zeros. The GM pipe sees this and completes the instruction immediately rather than issuing memory requests. This is fixed by marking the instruction as executed as SC_GLOBAL always. Flat instructions use resolvedFlatSegment for this, however since global instructions are known to be global we can set this field directly. This results in the expected issuing of memory requests to GPU memory. Change-Id: Ic23102853ccd49a41e2f083b7bb24f033dfed18a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57829 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-18 22:44:28 +00:00
Gabe Black	72d67e6426	arch-vega: Replace deprecated Stats namespace recently reintroduced. The deprecated "Stats" namespace was recently reintroduced to the vega TLB code. Replace it with the new statistics namespace. Change-Id: Ie5daf288176ce7e8aadd27b84a70baf4cbc72dff Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57949 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-18 20:19:37 +00:00
Matthew Poremba	539a2e2bcd	arch-vega: Add VEGA page tables and TLB Add the page table walker, page table format, TLB, TLB coalescer, and associated support in the AMDGPUDevice. This page table format used the hardware format for dGPU and is very different from APU/GCN3 which use the X86 page table format. In order to support either format for the GPU model, a common TranslationState called GpuTranslation state is created which holds the combined fields of both the APU and Vega translation state. Similarly the TlbEntry is cast at runtime by the corresponding arch files as they are the only files which touch the internals of the TlbEntry. The GPU model only checks if a TlbEntry is non-null and thus does not need to cast to peek inside the data structure. Change-Id: I4484c66239b48df5224d61caa6e968e56eea38a5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51848 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-17 00:11:14 +00:00
Kyle Roarty	5e721db9a2	arch-vega: Handle signed offsets in Global/Scratch instructions The offset field in Flat-style instructions is treated differently based on if the instruction is Flat or Global/Scratch. In Flat insts, the offset is treated as a 12-bit unsigned number. In Global/Scratch insts, the offset is treated as a 13-bit signed number. This patch updates the calcAddr function for Flat-style instructions to properly sign-extend the offset on Global/Scratch instructions Change-Id: I57f10258c23d900da9bf6ded6717c6e8abd177b7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57209 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2022-03-01 21:14:38 +00:00
Gabe Black	5df52e0dca	arch-x86: Overhaul how address size is handled, particularly for stack. The stack size is something that applies to addresses when performing accesses as part of some instructions. This was handled inconsistently or incompletely or simply incorrectly in a few ways. First, when pushing or popping from the stack, the address size should be set to the stack size. The data size is generally the operand size. When the stack pointer is incremented/decremented, it should be changed by the data size. When a stack pointer is manipulated, the data size for those calculations should be the stack size. Importantly that does not change the value of the increment/decrement, which is the operand size still. This usage has been fixed throughout. The TLB generally needs to know what the address size was in order to figure out what segment offset was used so that it can do limit checks. There is some inherent inaccuracy in doing things in reverse like this, but that's how it works currently. To find that size, the TLB tried to start from first principles to figure out what the default address size was, and then whether there was an override was passed in through the request flags. This is very inaccurate for a few reasons. First, the override doesn't always apply. Second, the address size used by a particular instruction doesn't have to be based on any particular size, whether that is the default or alternate address size, the stack size, etc. Instead, the instructions now pass the actual size being used in as a 2 bit value (0 -> 1 byte, 1 -> 2 bytes, 2 -> 4 bytes, 3 -> 8 bytes), avoiding most of the inaccuracy and approximation. Because the CPU won't embed any size information into fetches, we'll just assume those have no wrap around within the address size. Finally, there were microops that had been added which overrode the address size to be the stack size internally, and try to help the TLB figure out what to do to figure out the address size. Because both of those things are now handled in a different way, those microops are no longer needed or used and have been deleted. Change-Id: I2b1bdf1acf1540bf643fac6d49fe1a5a576ba5c1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55443 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2022-02-26 01:58:23 +00:00
Matthew Poremba	faf3730559	arch-vega: Fix global 64-bit calcAddr with SGPR base Global instruction address calculation when using an SGPR or SGPR pair as a base address was being calculated incorrectly when 64-bit addresses were to be generated. From the ISA documentation, the SGPR should be read as 32-bit or 64-bit depending on "ADDRESS_MODE." The VGPR-offset (computed from the lower 32-bits of vaddr) should always be 32-bits and the offset is 12 bits from the instruction. This means the 32-bit mask should only be applied to vaddr to get the VGPU-offset rather than the final sum. The SGPR base format is being seen in more recent clang/ROCm versions to avoid unnecessary copies of SGPRs into VGPRs to use VGPRs as the base address. Change-Id: I48910611fcfac5b62bc63496bbaabd6f6e53fe0d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55643 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 16:03:23 +00:00
Matthew Poremba	3ecd28a222	arch-vega: Update FLAT memory access helpers to support LDS This patch ports the changes from a similar patch for arch-gcn3: https://gem5-review.googlesource.com/c/public/gem5/+/48343. Vega already has an helper function to send to the correct pipe depending on the scope, however the initMem helpers currently always assume global scope. In addition the MUBUF WBINVL1 instructions are updated similarly to the GCN3 patch. Change-Id: I612b9198cb56e226721a90e72bba64395c84ebcd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55465 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-18 15:20:10 +00:00
Matthew Poremba	ff17ecc177	arch-vega: Fix MUBUF out-of-bounds case 1 Ported from https://gem5-review.googlesource.com/c/public/gem5/+/51127: This patch updates the out-of-bounds check to properly check against the correct buffer_offset, which is different depending on if the const_swizzle_enable is true or false. Change-Id: I9757226e62c587b679cab2a42f3616a5dca97e60 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55464 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-18 15:20:10 +00:00
Matthew Poremba	0cb64ce9f0	arch-vega: Free dest registers in non-memory Load DS insts Ported from https://gem5-review.googlesource.com/c/public/gem5/+/48019: Certain DS insts are classfied as Loads, but don't actually go through the memory pipeline. However, any instruction classified as a load marks its destination registers as free in the memory pipeline. Because these instructions didn't use the memory pipeline, they never freed their destination registers, which led to a deadlock. This patch explicitly calls the function used to free the destination registers in the execute() method of those Load instructions that don't use the memory pipeline. Change-Id: I8231217a79661ca6acc837b2ab4931b946049a1a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55463 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-17 23:55:51 +00:00
Matthew Poremba	d6bd91a9fd	arch-vega: Implement large ds_read/write instructions Port large DS read/write instructions from https://gem5-review.googlesource.com/c/public/gem5/+/48342. This implements the 96 and 128b ds_read/write instructions in a similar fashion to the 3 and 4 dword flat_load/store instructions. These instructions are treated as reads/writes of 3 or 4 dwords, instead of as a single 96b/128b memory transaction, due to the limitations of the VecOperand class used in the amdgpu code. In order to handle treating the memory transaction as multiple dwords, the patch also adds in new initMemRead/initMemWrite functions for ds instructions. These are similar to the functions used in flat instructions for the same purpose. Change-Id: Iee2de14eb7f32b6654799d53dc97d806288af98f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55344 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-11 16:58:09 +00:00
Matthew Poremba	5a94e73d00	arch-vega: Validate if scalar sources are scalar gprs Port the fixes for scalar source checks from arch-gcn3 at https://gem5-review.googlesource.com/c/public/gem5/+/48344. Scalar sources can either be a general-purpose register or a constant register that holds a single value. If we don't check for if the register is a general-purpose register, it's possible that we get a constant register, which then causes all of the register mapping code to break, as the constant registers aren't supposed to be mapped like the general-purpose registers are. This fix adds an isScalarReg check to the instruction encodings that were missing it. Change-Id: I30dd2d082a5a1dcc3075843bcefd325113ed1df6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55343 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-11 16:58:09 +00:00
Kyle Roarty	f9deeea427	arch-gcn3,arch-vega: Select proper data on misaligned access req1->getSize() returns the size in bytes, but because we're using it in an array index, we need to scale it by the size of the data type. This ensures we give the second request the proper data. Change-Id: I578665406762d5d0c95f2ea8297c362e1cc0620b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54503 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2021-12-20 18:28:08 +00:00
Matthew Poremba	9313294efe	misc: Remove AMD license addition Remove the line "For use for simulation and test purposes only" in files were AMD is the only copyright holder listed in the header. This happens to be the case for all files where this line exists, removing it completely from gem5. Change-Id: I623f266b002f564301b28774f49081099cfc60fd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53943 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 04:00:56 +00:00
Matthew Poremba	0ae1a9d109	arch-vega: Implement S_SLEEP This is merely copied from arch-gcn3. Change-Id: Ibd2bda37fe9adc083a35efab0f59617d386019b9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53883 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	23aec13f07	arch-vega: Implement V_AND_OR_B32 Change-Id: I8daeb8de2db5996e132cf7ed729f02c3c94a6862 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53868 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	f1e3fa7a3e	arch-vega: Implement V_ADD3_U32 Change-Id: I4d01265f946e289cbff56090c2dd193ea66d5c70 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53867 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	2abc51e810	arch-vega: Impelemnt V_ADD_LSHL_U32 Change-Id: Ia4e465ef2534fe28dc846f728b2e1da3dfe4f7d6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53866 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	04d025806d	arch-vega: Implement V_LSHL_ADD_U32 Change-Id: I986f82e8c6c02b0d62e55fbaed1c3f9e5b2b4a43 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53865 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	19788d3b56	arch-vega: Implement V_LSHL_OR_B32 Change-Id: I237410e05df9a96323a6ceb7d09ae2a2a8608f16 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53864 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	b5d8d9ddcc	arch-vega: Implement V_OR3_B32 Change-Id: Id6c074033b08058b739e056f06b40ee5735f8f00 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53863 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 03:59:19 +00:00
Matthew Poremba	c028af111a	arch-gcn3,gpu-compute: Move TLB to common folder in amdgpu This TLB is more of an "APU" TLB than anything GCN3 specific. It can be used with either GCN3 or Vega. With this change, VEGA_X86 builds and one can run binaries with Vega ISA code using the same steps as GCN3 but building the Vega ISA instead. Change-Id: I0c92bcd0379a18628dc05cb5af070bdc7e692c7c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53803 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-09 17:26:15 +00:00
Gabe Black	1c233ee9d2	scons: Add sim_object and enums arguments to SimObject(). This will explicitly declare what SimObject and Enum types need to be set up in C++, which will make importing all the SimObject modules during the setup phase of SCons uneccessary. Change-Id: Id2d7603daf33b236ceaa0789e2f089f589d34e62 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49406 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-08 08:01:23 +00:00
Gabe Black	f315461bb7	arch,cpu: Stop using and remove ThreadContext::instAddr. Change-Id: I9cd8077fd72a9d7bff20f1bd7ba37e4e038b8fac Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/52062 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabe.black@gmail.com>	2021-11-30 23:30:06 +00:00
Kyle Roarty	223cd52431	arch-gcn3,arch-vega: Don't write exec in v_cmp_f_i32 Per the GCN3 and VEGA ISAs, v_cmpx_* writes exec, while v_cmp_* doesn't. This removes the erroneous exec write in the VOP3 implementation of v_cmp_f_i32. Change-Id: I048e35917163c45b879f38d31a88f3f3d56c0baf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/52445 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-11-05 19:52:40 +00:00
Matthew Poremba	3112a7f0d0	arch-gcn3,gpu-compute: Move GCN3 specific TLB to arch Move GpuTLB and TLBCoalescer to GCN3 as the TLB format is specific to GCN3 and SE mode / APU simulation. Vega will have its own TLB, coalescer, and walker suitable for a dGPU. This also adds a using alias for the TLB translation state to reduce the number of references to TheISA and X86ISA. X86 specific includes are also removed. Change-Id: I34448bb4e5ddb9980b34a55bc717bbcea0e03db5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49847 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-04 23:47:03 +00:00
Matthew Poremba	c15e472199	arch-vega: Rework flat instructions to support global Global instructions are new in Vega and are essentially FLAT instructions from GCN3 but guaranteed to go to global memory where as flat can go to global or local memory. This reworks the flat instruction classes so that the initiateAcc / execute / completeAcc logic can be reused for flat, global, and later scratch subtypes of flat instructions. The decoder creates a flat instruction class which sets instruction flags based on the flat instruction's SEG field. There are new initOperandInfo and generateDissasmbly methods for flat and global. The number of operands and operand index getters are modified to check the flags and return the correct value for the subtype. Change-Id: I1db4a3742aeec62424189e54c38c59d6b1a8d3c1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47106 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Kyle Roarty <kyleroarty1716@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-04 22:51:37 +00:00
Kyle Roarty	008659bee1	arch-gcn3: Fix MUBUF out-of-bounds case 1 This patch upates the out-of-bounds check to properly check against the correct buffer_offset, which is different depending on if the const_swizzle_enable is true or false. Change-Id: I5c687c09ee7f8e446618084b8545b74a84211d4d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51127 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-30 20:22:35 +00:00
Matthew Poremba	5a884fdab5	arch-vega: Fix VEGA_X86 build issues The registerManager was not being dereferenced properly. Also remove non-existant include file. Change-Id: I5dac692abedc327ed83ee904e4c6ac5dac811e4c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47105 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-27 22:30:30 +00:00
Matthew Poremba	e51e698b74	arch-vega: Update instruction stats These stats were moved to a Stats::Group but the instructions were not updated to use the stats struct. Change-Id: I49348e30bc0988a2a873f51bd7079c1f315649b4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47104 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-09-27 22:30:30 +00:00
Matthew Poremba	16de253c15	arch-vega: Add missing functions referenced by insts Some instructions were referencing pc() and isExecMaskRegister() which were not defined. Change-Id: Ic5b3fa9057950ff85603fcb87447a81b6c7f274b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47103 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-09-27 22:30:30 +00:00
Matthew Poremba	dca86ddb8d	arch-vega: Issue flat insts using on executedAs() Similar to the flags issue in the previous patch, the FlatGlobal flag does not exist. Change all of the flat instructions to use the same issue logic as GCN3. A helper function is also added as loads and stores use the same interface. The helper function can be more easily updated to support global and scratch subtypes of flat instructions. Change-Id: I394f1d4c59b029201fe2f6075c9dedb3a37dbe31 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50827 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Kyle Roarty <kyleroarty1716@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-27 22:30:30 +00:00
Matthew Poremba	753a2c8aac	arch-vega: Update Vega instruction flags The instructions file seems to be assuming a newer pipeline which is not released. The flags are therefore not set in Vega as the newer pipeline infers them. This adds back flags for MemoryRef instructions, fixes waitcnt and removes CondBranch which was not checked and changed to Branch. This also removeds unused Cac flags and fixes the casing for ReadsEXEC and WritesEXEC. The remaining flags are not used at all by the pipeline and are removed to avoid confusion as to whether these are needed for GCN3 or not. Change-Id: I976cbd407a466e8ad77c84dbdc29082f49e28f3b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47102 Reviewed-by: Kyle Roarty <kyleroarty1716@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-27 22:30:30 +00:00
Gabe Black	64168fd4ea	scons: Turn the ISA and GPU ISA lists into construction variables. Change-Id: I4135709f5bceee959b5178a4700656aa782b1d6b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48965 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-08-07 03:12:56 +00:00
Gabe Black	c8123df754	arch-gcn3: Fix initAtomicAccess. This function used makeAtomicOpFunctor to create a unique_ptr which pointed to an AtomicOpFunctor *, which it immediately extracted with .get(). Then since the temporary unique_ptr went out of scope, it deleted the AtomicOpFunctor which it just returned a pointer to. Instead, that function should create a local unique_ptr to pass ownership of the object off to. It will still be cleaned up when it goes out of scope, but not before it's done being used. Change-Id: I74a0bcbb719a78a3e9ec8cb2ea5aa15120da0456 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49023 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Kyle Roarty <kyleroarty1716@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-08-06 14:05:30 +00:00
Bobby R. Bruce	76ceda55f7	misc: Merge branch 'release-staging-v21-1' into develop Change-Id: I0f69d3d0863f77c02ac8089fb4dccee3aa70a4ea	2021-07-28 17:37:04 -07:00
Gabe Black	b3b81196aa	misc: Replace type_traits.hh XX::value with XX_v. Now that we're using c++17, the type_traits with a ::value member have a _v alias which reduces verbosity. Or on other words std::is_integral<T>::value can be replaced with std::is_integral_v<T> Make this substitution throughout the code base. In places where gem5 introduced it's own similar templates, add a V alias, spelled differently to match gem5's internal style. gem5: :IsVarArgs<T>::value => gem5::IsVarArgsV<T> Change-Id: I1d84ffc4a236ad699471569e7916ec17fe5f109a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48604 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-28 01:48:03 +00:00
Kyle Roarty	1577897265	arch-gcn3: Validate if scalar sources are scalar gprs Scalar sources can either be a general-purpose register or a constant register that holds a single value. If we don't check for if the register is a general-purpose register, it's possible that we get a constant register, which then causes all of the register mapping code to break, as the constant registers aren't supposed to be mapped like the general-purpose registers are. This fix adds an isScalarReg check to the instruction encodings that were missing it. Change-Id: I3d7d5393aa324737301c3269cc227b60e8a159e4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48344 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-26 18:36:24 +00:00
Kyle Roarty	9a7fc4ff69	arch-gcn3: Implement LDS accesses in Flat instructions Add support for LDS accesses by allowing Flat instructions to dispatch into the local memory pipeline if the requested address is in the group aperture. This requires implementing LDS accesses in the Flat initMemRead/Write functions, in a similar fashion to the DS functions of the same name. Because we now can potentially dispatch to the local memory pipeline, this change also adds a check to regain any tokens we requested as a flat instruction. Change-Id: Id26191f7ee43291a5e5ca5f39af06af981ec23ab Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48343 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-26 18:36:16 +00:00
Bobby R. Bruce	c0a3c70304	misc: Merge branch 'release-staging-v21-1' into develop Change-Id: I6ba57d7f70be70ae43fab396780d18623679a59a	2021-07-26 09:48:25 -07:00
Kyle Roarty	523a92f7f0	arch-gcn3: Implement large ds_read/write instructions This implements the 96 and 128b ds_read/write instructions in a similar fashion to the 3 and 4 dword flat_load/store instructions. These instructions are treated as reads/writes of 3 or 4 dwords, instead of as a single 96b/128b memory transaction, due to the limitations of the VecOperand class used in the amdgpu code. In order to handle treating the memory transaction as multiple dwords, the patch also adds in new initMemRead/initMemWrite functions for ds instructions. These are similar to the functions used in flat instructions for the same purpose. Change-Id: I0f2ba3cb7cf040abb876e6eae55a6d38149ee960 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48342 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-24 17:27:02 +00:00
Kyle Roarty	46e62e5eb3	arch-gcn3: Free dest registers in non-memory Load DS insts Certain DS insts are classfied as Loads, but don't actually go through the memory pipeline. However, any instruction classified as a load marks its destination registers as free in the memory pipeline. Because these instructions didn't use the memory pipeline, they never freed their destination registers, which led to a deadlock. This patch explicitly calls the function used to free the destination registers in the execute() method of those Load instructions that don't use the memory pipeline. Change-Id: Ic2ac2e232c8fbad63d0c62c1862f2bdaeaba4edf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48019 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-14 20:47:27 +00:00
Kyle Roarty	5820818c11	arch-vega: Add fatal when decoding missing insts Certain instructions don't have implementations in instructions.cc, and get decoded as a nullptr. This adds a fatal when decoding a missing instruction, as we aren't able to properly run a program if all its instructions aren't implemented, and it allows us to figure out which instruction is missing due to fatals printing the line they were called. Change-Id: I7e3690f079b790dceee102063773d5fbbc8619f1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47522 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-13 01:26:39 +00:00
Kyle Roarty	06da510020	arch-vega: Add decoding for implemented insts Certain instructions were implemented in instructions.cc, but weren't actually being decoded by the decoder, causing the decoder to return nullptr for valid instructions. This patch fixes the decoder to return the proper instruction class for implemented instructions Change-Id: I8d8525a1c435147017cb38d9df8e1675986ef04b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47521 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Kyle Roarty	9fe9d83e5b	arch-vega: Add missing return to flat_load_dwordx4 Change-Id: Ibf56c25a3d22d3c12ae2c1bb11f00f4a44b5919a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47520 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Kyle Roarty	cb73fe1959	arch-vega: Fix s_endpgm instruction Copy over changes that had been made to s_engpgm in GCN3 but weren't added to the Vega implementation Change-Id: I1063f83b1ce8f7c5e451c8c227265715c8f725b9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47519 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00

1 2

75 Commits