derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Matthew Poremba	7aa3c4c638	gpu-compute: Add missing override in render driver This fixes the build error in the clang-11 compiler check for GCN3_X86. Change-Id: I2245589182b80811b8bc07409196adca98899213 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47479 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-02 00:50:28 +00:00
Daniel R. Carvalho	00cd307b13	mem-garnet: Add a garnet namespace Add a namespace encapsulating all garnet files. GarnetSyntheticTraffic, from cpu/testers/garnet_synthetic_traffic/GarnetSyntheticTraffic.hh has not been added to this namespace. Change-Id: I5304ad3130100ba325e35e20883ee9286f51a75a Issued-on: https://gem5.atlassian.net/browse/GEM5-987 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47306 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-01 19:08:24 +00:00
Daniel R. Carvalho	974a47dfb9	misc: Adopt the gem5 namespace Apply the gem5 namespace to the codebase. Some anonymous namespaces could theoretically be removed, but since this change's main goal was to keep conflicts at a minimum, it was decided not to modify much the general shape of the files. A few missing comments of the form "// namespace X" that occurred before the newly added "} // namespace gem5" have been added for consistency. std out should not be included in the gem5 namespace, so they weren't. ProtoMessage has not been included in the gem5 namespace, since I'm not familiar with how proto works. Regarding the SystemC files, although they belong to gem5, they actually perform integration between gem5 and SystemC; therefore, it deserved its own separate namespace. Files that are automatically generated have been included in the gem5 namespace. The .isa files currently are limited to a single namespace. This limitation should be later removed to make it easier to accomodate a better API. Regarding the files in util, gem5:: was prepended where suitable. Notice that this patch was tested as much as possible given that most of these were already not previously compiling. Change-Id: Ia53d404ec79c46edaa98f654e23bc3b0e179fe2d Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46323 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 19:08:24 +00:00
Matthew Poremba	d4904b3b89	arch-gcn3: Remove unused files These files are not used but are still popping up in style checks, such as the new python Black checks. Removing these to reduce maintenance overhead for GCN3. Change-Id: I8d78c8246c29637958a8af99c4a9eb6bb8e23e3d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47419 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 13:52:52 +00:00
Daniel R. Carvalho	24041d6b77	mem-cache: Add Signature-Based Hit Predictor replacement policy Add the SHiP Replacement Policy, as described in "SHiP: Signature- based Hit Predictor for High Performance Caching", by Wu et al. Instruction Sequence signatures have not been implemented. Change-Id: I44f00d26eab4c96c9c5bc29740862a87356d30d1 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38118 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-01 11:21:30 +00:00
Daniel R. Carvalho	4a3e99fbcb	mem-cache: Allow sending packet information to replacement policy Some replacement policies can use information such as address or PC to improve their re-reference prediction. Change-Id: I412eee09efa2f3511ca1ece76fc2732509df4745 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38117 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-01 11:21:30 +00:00
Daniel R. Carvalho	ef99dc8310	mem-cache: Use PacketPtr in tags's accessBlock Pass the packet to the tags, so that the replacement policies more execution information. Change-Id: I201884a6d60e3299fc3c9befebbb2e8b64a007f0 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38116 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com>	2021-07-01 11:21:30 +00:00
Daniel R. Carvalho	2911839c22	mem-cache: Make WeightedLRU inherit from LRU WeightedLRU adds occupancy information to LRU, so remove the duplicated code. Change-Id: Ifec19ea59fb411a5ed7a891e8957b1ab93cdbf05 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47399 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com>	2021-07-01 11:21:30 +00:00
Sandipan Das	b91cfc14bd	base: Add byte order attribute for object files This adds byte order as an attribute for object files by introducing new members to the ObjectFile class. This is populated by the looking at the ELF headers. Change-Id: Ibe55699175cc0295e0c9d49bdbe02e580988bc4f Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40939 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 01:25:23 +00:00
Sandipan Das	a947b45fcf	arch-power: Refactor argument registers This reintroduces the argument register constants that were removed in commit `7bb456f02` ("arch-power: Delete unused register related constants"), adds a definition for the sixth argument register and switches to these constants to specify the arguments used by the system call ABI. Change-Id: I5804f4d2b27a04d0e7b69132e5abce5761b239f5 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40938 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 01:23:48 +00:00
Sandipan Das	01c9f90e6f	arch-power: Add time base instructions This models a pseudo time base using the simulator ticks and adds the following instructions. * Move From Time Base (mftb) * Move From Time Base Upper (mftbu) Change-Id: Idb619ec3179b2a85925998282075bde8651c68c2 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40937 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 01:22:33 +00:00
Sandipan Das	09ec394844	arch-power: Add move condition field instructions This adds the following instructions. * Move to CR from XER Extended (mcrxrx) * Move To One Condition Register Field (mtocrf) * Move From One Condition Register Field (mfocrf) Change-Id: I5014160d77b1b759c1cb8cba34e6dd20eb2b5205 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40936 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 01:22:33 +00:00
Sandipan Das	c8207e2286	arch-power: Fix move condition field instructions This introduces the S field for X form instructions which is used to specify signed versus unsigned comparison. The Power ISA does not specify a formal name for the third 1-bit opcode field required for decoding XFX form move to and from CR field instructions, the S field can be used to achieve the same as it has the same span and position. This fixes the following instructions. * Move To Condition Register Fields (mtcrf) * Move From Condition Register (mfcr) Change-Id: I8d291f707cd063781f0497f7226bebfc47bd9e63 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40935 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-01 01:22:33 +00:00
Sandipan Das	57181ee613	arch-power: Add trap instructions This introduces new classes and new formats for D and X form instructions, the TO field that is used to encode the trap conditions and adds the following instructions. * Trap Word Immediate (twi) * Trap Word (tw) * Trap Doubleword Immediate (tdi) * Trap Doubleword (td) Change-Id: I029147ef643c2ee6794426e5e90af4d75f22e92e Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40934 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-30 22:55:02 +00:00
Bobby Bruce	84837422d8	util-docker: Deprecate 18.04-min_dependencies for 20.04-min While we still support Ubuntu 18.04, we will focus testing more-so on 20.04. Therefore, our min-dependency target will now be 20.04 intead of 18.04. Change-Id: Ib136480e5f9953d216bd5ffc6f0ae3faa1bf2e82 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47341 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-06-30 16:55:25 +00:00
Bobby Bruce	2d86e88819	tests: Update the documentation in compiler-tests.sh This was out-of-date and has been updated in this commit. Change-Id: I18519bb2111dae4d2f86806618115153fa8d5372 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47339 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-30 16:55:25 +00:00
Kyle Roarty	c96f43d83e	gpu-compute: Initialize GPUDriver member variables before use A few member variables weren't initialized, but we were assuming that they were 0 when first read. This explicitly sets those variables to 0. Change-Id: I2c840d361ed3a7d306e22dc7561a3870f1ef94a1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46248 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	b1df141bed	gpu-compute: Change certain IOCTL errors to warnings There are certain IOCTL errors that were triggering with the change to ROCm 4, however they could be set to warnings without causing any errors in the program Change-Id: Ie0052267f3ccfbdbadb90249b6f19e6a1205f57e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46247 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	5fad68f576	dev-hsa,gpu-compute: IOCTL updates for ROCm 4 This change copies over the up-to-date kfd_ioctl.h file from the linux kernel, and updates the gpu_compute_driver to reflect the changes found in the new version of the kfd_ioctl.h file Change-Id: I51e8e7158762f4b7e06c0f84507e5889a17939a2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46246 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	f2a029058a	gpu-compute: Ignore GPU kernel names ROCm 4 seems to have updated the akc, and the only real issue that has occured is that we're no longer able to read kernel names in the same way as we were in ROCm 1.6. This patch removes the prior method of reading kernel names and gives all kernels a temporary name Change-Id: I0040e0cf4cd35d6f56ded6a8acfb10c600bcc77a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46245 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	a71801b9a0	configs,gpu-compute: Add render driver needed for ROCm 4 ROCm 4 utilizes the render driver located at /dev/dri/renderDXXX. This patch implements a very simple driver that just returns a file descriptor when opened, as testing has shown that's all that's needed Change-Id: I65602346cbf17b2dc80e114046ebf5c9830a1507 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46244 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	bcd12f301d	arch-x86,sim: Implement sched_getaffinity sched_getaffinity is different from other syscalls in the raw syscall return the size of the cpumask being used to represent the CPU bit mask. Because of this, when a library (libnuma in this case) directly called sched_getaffinity and got a return value of 0, it errored out, thinking that there were no CPUs available. This implementation assumes that all CPUs are available, so it sets all simulated CPUs in the bitmask Change-Id: Id95c919986cc98a411877056256604f57a29f0f9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46243 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	2249e4ca71	arch-x86: build with getdents64 if system supports it This patch makes it so the getdents64 syscall is built in gem5 if the underlying host implements the syscall, similar to how the getdents syscall is implemented. The implementation for getdents64 already existed Change-Id: I73b22c8df8df994f3f720e848a7d4f8cd31d318e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46242 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	f40c07c8c6	arch-x86: Ignore certain syscalls called in ROCm 4 fdatasync, sigaltstack, and prctl are called by the ROCm 4 stack, but were unimplemented. Based on testing, we can change these to ignoreFunc without affecting program correctness. sched_yield gets changed to ignoreWarnOnceFunc, as it gets called significantly more in ROCm 4. Change-Id: I566b1d71d989c54bfc559d5b83790dff73a38b28 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46241 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2021-06-30 16:47:43 +00:00
Kyle Roarty	466e2018a2	configs: Add mem_banks to Carrizo topology ROCm 4 iterates through the mem_banks to find an appropriate place to allocate memory. Previously, Carrizo didn't have any mem_banks, which resulted in the ROCm 4 runtime erroring out, as it didn't know where to allocate memory. The implementation is fairly similar to the implementation used for the Fiji or Vega configs Change-Id: I5bb4e89657d44c6cb690fd224ee1bf1d4d6cf2a5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46240 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-06-30 16:47:43 +00:00
Sandipan Das	bc52e3d6c9	configs: Fix waiting on remote debugger Commit `2c75e58cac` ("sim,cpu: Move the remote GDB stub into the workload.") moved "wait_for_remote_gdb" to the Workload class. That breaks se.py since it continues to rely on that being a property of BaseCPU. This ensures that the property is now set via the current Workload instance instead. Also, owing to its boolean nature, the argument should ideally not expect any additional values. Hence, it is associated with the "store_true" action. Change-Id: I4a00b29d283df36ebf833c9125651cd6deb52a4f Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47360 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-30 16:30:59 +00:00
Daniel R. Carvalho	041134e8d9	mem-cache: Change invalidate signature to not const Allow the replacement policy to be modified when an entry is invalidated. Change-Id: I7f5086795dbb93a6fab2b4994c757d509d782d79 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38115 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com>	2021-06-30 02:11:56 +00:00
Daniel R. Carvalho	3f040bab87	mem-cache: Add the DRRIP replacement policy Instantiate the Dynamic Re-Reference Interval Prediction, as defined in "High Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP)", by Jaleel et al. Change-Id: Id1d354c01e63ae49739263647ff25e5665f60d8c Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37898 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-06-30 02:11:56 +00:00
Daniel R. Carvalho	fd79a761ad	mem-cache: Implement a dueling Replacement Policy Implement a template dueling replacement policy which monitors two replacement policies to decide and select the one that provides the least amount of misses. Change-Id: I6a6e96a9388cce8f8c8cd7b9c1dbe9f0554ccc64 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37897 Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-30 02:11:56 +00:00
Daniel R. Carvalho	df5b191609	mem-cache: Creation of dueling classes Table policies (i.e., replacement, compression, etc) behave differently depending on the workload, and it is often desired to be able to selectively switch between them. In this case the relevant metadata for all the policies must be added to all of the entries being analyzed. In order to avoid having to monitor all table entries, a few of these entries are selected to be sampled and estimate overall behavior. These sampled entries belong each to a single policy. Then, based on the predominance of these samples, the winning policy is applied to the other sets (followers). As of now, in order to avoid having to iterate over a vector, there is a limited number of dueling instances, but it may be easily extended, if needed. Based on Set Dueling, proposed in "Adaptive Insertion Policies for High Performance Caching". Change-Id: I692a3e5e0ad98581d68167ad7e6b45ab2f4c7b10 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37895 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-06-30 02:11:56 +00:00
Kyle Roarty	858727a341	util: Update GCN Dockerfile for ROCm 4 This now installs ROCm 4 from source instead of ROCm 1.6. Change-Id: I380ca06e93d48475e93d18f69eb97756186772ab Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46239 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 23:58:03 +00:00
Bobby R. Bruce	32b4a8cd36	mem: Fix use-after-free bug mem_pkt was deleted (via `delete respQueue.front()`) then used in the following if statement (at `mem_pkt->isDram()`). This patch fixes this issue. Issue-on: https://gem5.atlassian.net/browse/GEM5-1009 Change-Id: Iac3b9078ce5acbdd87a0737a2c98ad887459661f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47239 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 20:28:14 +00:00
Michael Boyer	3f5120e01f	arch-vega: Implement non-carry-out VEGA add, sub, and subrev In GCN3, the v_add_u32, v_sub_u32, and v_subrev_u32 instructions write the carry-out value to VCC. VEGA introduces explicit carry-out versions of these instructions (v_add_co_u32, v_sub_co_u32, and v_subrev_co_u32), and modifies the behavior of the baseline, non-carry-out versions to not write to VCC. Previously both the carry-out and non-carry-out versions shared a single implementation that wrote to VCC. This patch correctly implements the non-carry-out versions to avoid the VCC write. This patch also makes the following substitutions for GCN3 instructions that no longer exist in VEGA (this renaming has no functional impact): v_addc_u32 -> v_addc_co_u32 v_subb_u32 -> v_subb_co_u32 v_subbrev_u32 -> v_subbrev_co_u32 Change-Id: I002fa6e9316d38fd4cc3554daff047523cfc12c9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47240 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 17:28:26 +00:00
Sandipan Das	bb46264d40	arch-power: Add doubleword rotate instructions This introduces a new class and a new format for MD and MDS form instructions where the shift amount, mask begin and mask end are specified by two fields that must be concatenated and adds the following instructions. * Rotate Left Doubleword Immediate then Clear Left (rldicl[.]) * Rotate Left Doubleword Immediate then Clear Right (rldicr[.]) * Rotate Left Doubleword Immediate then Clear (rldic[.]) * Rotate Left Doubleword then Clear Left (rldcl[.]) * Rotate Left Doubleword then Clear Right (rldcr[.]) * Rotate Left Doubleword Immediate then Mask Insert (rldimi[.]) Change-Id: Id7f1f24032242ccfdfda2f1aefd6fe9f0331f610 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40933 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	ed99a79ca7	arch-power: Add fields for MD and MDS form instructions This introduces the extended opcode fields for MD and MDS form instructions and the mb and me fields which are concatenated with the MB and ME fields respectively for specifying mask bounds for doubleword operands. Change-Id: I2c3366794ed42f5d31ba1d69e360c0ac67c74e06 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40932 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	59ae544eb3	arch-power: Fix disassembly for rotate instructions This fixes disassembly generated for integer rotate instructions based on special use cases for which the Power ISA provides extended mnemonics. Change-Id: I8c33e7c8128ad62d856ce050df8a91b2dfd52f4c Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40931 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	f6ff1b19f7	arch-power: Fix rotate instructions Now that 64-bit registers are being used, the rotation operation changes for words. Instead of just rotating the lower word of the operand, the lower word is first duplicated in the upper word and then rotated. This fixes the following instructions. * Rotate Left Word Immediate then And with Mask (rlwinm[.]) * Rotate Left Word then And with Mask (rlwnm[.]) * Rotate Left Word Immediate then Mask Insert (rlwimi[.]) Change-Id: Ic743bceb8bafff461276984ecc999dedc1f94e9f Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40930 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	4ccd755b8c	arch-power: Refactor rotate instructions This renames the mask span fields and the rotate helper of the base class. Change-Id: I120006a0c052fcc34eb154a68d4b7f70a464df65 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40929 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	d5e37b4e9f	arch-power: Add doubleword shift instructions This introduces a new class and a new format for XS form instructions where the shift amount is specified by two fields that must be concatenated and adds the following instructions. * Shift Left Doubleword (sld[.]) * Shift Right Doubleword (srd[.]) * Shift Right Algebraic Doubleword (srad[.]) * Shift Right Algebraic Doubleword Immediate (sradi[.]) * Extend-Sign Word and Shift Left Immediate (extswsli[.]) Change-Id: If51c676009ddafb40f855b66c00eeeffa5d8874c Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40928 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	7c60be6012	arch-power: Add fields for XS form instructions This introduces the extended opcode field for XS form instructions and the sh field which is concatenated with the SH field for specifying a shift amount for doubleword operands. Change-Id: I8f7cb3a2fda33b5b0076ffe12ffebeb5ec1c33a6 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40927 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	e4bdd7922c	arch-power: Fix disassembly for shift instructions This fixes disassembly generated for integer shift instructions based on the type of operand used for the specifying the shift amount. Change-Id: I4985334e6eaa9c09ce2d4e79b23e1ae7a9cd28c3 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40926 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	16022594e5	arch-power: Fix shift instructions Now that 64-bit registers are being used, the instructions must use only the lower word of the operand to be shifted. This fixes the following instructions. * Shift Left Word (slw[.]) * Shift Right Word (srw[.]) * Shift Right Algebraic Word (sraw[.]) * Shift Right Algebraic Word Immediate (srawi[.]) Change-Id: Ibc3124b9e3a8660b0ff9d0178218e34bcc028310 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40925 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	4b948e64bd	arch-power: Refactor shift instructions This changes the format for integer shift instructions such that the computation of the carry bit is implicitly handled rather than including it in the definition of an instruction. Change-Id: Ib916597287efd51b2c9e8781209a8019f2fc38e8 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40924 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	ab7155a226	arch-power: Add bit permute instructions This adds the following instructions. * Bit Permute Doubleword (bpermd) Change-Id: Iab3cc6729b9d59c95e29b4f1d3e2c0eb48fde917 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40923 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	7a4ffa7cd5	arch-power: Add parity instructions This adds the following instructions. * Parity Word (prtyw) * Parity Doubleword (prtyd) Change-Id: Ic102d722f1bc8cea4921ddbf9febfa0e7c0f892e Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40922 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	f9653e7685	arch-power: Add population count instructions This adds the following instructions. * Population Count Bytes (popcntb) * Population Count Words (popcntw) * Population Count Doubleword (popcntd) Change-Id: Id15188482b45552735c1d960418d5d6ba1f2ede8 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40921 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	99495afb20	arch-power: Add zero count instructions This introduces new helpers for finding the count of leading and trailing zero bits in a given value and adds the following instructions. * Count Trailing Zeros Word (cnttzw[.]) * Count Leading Zeros Doubleword (cntlzd[.]) * Count Trailing Zeros Doubleword (cnttzd[.]) Change-Id: I69dad34bc2cffb2ac70ecd3dba7301fa1cdcb340 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40920 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	989d1da4ed	arch-power: Add sign-extend instructions This adds the following instructions. * Extend Sign Word (extsw[.]) Change-Id: Ia15fc69de665399f1c8d52ca00d2f7670d553b48 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40919 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	a45afa54cb	arch-power: Fix disassembly for logical instructions This fixes disassembly generated for integer logical instructions based on the type of operands and special use cases for which the Power ISA provides extended mnemonics. Change-Id: I6b67569ef413b0b542e35082ca360c9b4262fc5b Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40918 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00
Sandipan Das	91f13cad50	arch-power: Fix logical instructions Now that 64-bit registers are being used, the instructions performing comparisons must use the entire 64 bits of the register operands. Also, most of these instructions need to determine the nature of the result if the Rc bit is set. This fixes the following instructions. * AND (and[.]) * OR (or[.]) * XOR (xor[.]) * NAND (nand[.]) * NOR (nor[.]) * Equivalent (eqv[.]) * AND with Complement (andc[.]) * OR with Complement (orc[.]) * Extend Sign Byte (extsb[.]) * Extend Sign Halfword (extsh[.]) * Count Leading Zeros Word (cntlzw[.]) * Compare Bytes (cmpb) Change-Id: Ifecb0779fa6e2062d382f9abf8b2cfaf7cea3c96 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40917 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 14:00:38 +00:00

1 2 3 4 5 ...

17548 Commits