derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Hoa Nguyen	d4e5d9b32a	util: Make gerrit bot respect reviewer removal Currently, if a maintainer is removed from a change, the maintainer will be added again. This change prevents the bot from adding the removed maintainer again. The bot will query all updates related to reviewer addition/removal for each new change. If a reviewer has ever been added/removed from a change, that reviewer won't be added to that change again. Change-Id: Ifaab5ebd7ebf3e6453b2551d3e37c1b9e214c906 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50187 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-10-11 17:12:59 +00:00
Tom Rollet	13e3521a00	cpu-o3: remove useless 'using'-s Change-Id: Ifa8ef516d0deabb4308bdf3c4b61b88ece149d0e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51347 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-11 08:19:29 +00:00
Tom Rollet	8a535eac48	cpu-o3: Naming cleanup for LSQRequest and Request 'LSQRequest' are now referred as 'request' 'Request' are now referred as 'req' It makes the code easier to read. Also it makes the naming of Request consistent with the cache. Change-Id: I8ba75b75bd8408e411300d522cc2c8582c334cf5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51067 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Gabe Black <gabe.black@gmail.com>	2021-10-11 08:19:29 +00:00
Tom Rollet	de0d6f4116	cpu-o3: remove LSQSenderState The LSQSenderState that was attached to Request was not useful. All the fields were either a duplicate of information in the LSQRequest or totally unused. The LSQRequest class now inherits from Packet::SenderState and is attached to the Packet that are sent to memory. We do not need anymore the indirection Packet->SenderState->LSQRequest. This helps making the code clearer as it was sometimes hard to follow the difference between what the LSQRequest and LSQSenserState was doing (ex: number of outstanding requests in the memory). Change-Id: I5b21e007e6d183c6aa79c27c1787ca56dcbc3fb0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50733 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-11 08:19:29 +00:00
Matt Sinclair	1120931105	mem-ruby: Move VIPER TCC decrements to action from in_port Currently, the GPU VIPER TCC protocol handles races between atomics in the triggerQueue_in. This in_port does not check for resource availability, which can cause the trigger queue to execute multiple times. Although this is the expected behavior, the code for handling atomic races decrements the atomicDoneCnt flag in the trigger queue, which is not safe since resource contention may cause it to execute multiple times. To resolve this issue, this commit moves the decrementing of this counter to a new action that is called in an event that happens only when the race between atomics is detected. Change-Id: I552fd4f34fdd9ebeec99fb7aeb4eeb7b150f577f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51368 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-08 22:03:13 +00:00
Matt Sinclair	72ee6d1aad	mem-ruby: Update GPU VIPER TCC protocol to resolve deadlock In the GPU VIPER TCC, programs with mixes of atomics and data accesses to the same address, in the same kernel, can experience deadlock when large applications (e.g., Pannotia's graph analytics algorithms) are running on very small GPUs (e.g., the default 4 CU GPU configuration). In this situation, deadlocks occur due to resource stalls interacting with the behavior of the current implementation for handling races between atomic accesses. The specific order of events causing this deadlock are: 1. TCC is waiting on an atomic to return from directory 2. In the meantime it receives another atomic to the same address -- when this happens, the TCC increments number of atomics to this address (numAtomics = 2) that are pending in TBE, and does a write through of the atomic to the directory. 3. When the first atomic returns from the Directory, it decrements the numAtomics counter. numAtomics was at 2 though, because of step #2. So it doesn't deallocate the TBE entry and calls Event:AtomicNotDone. 4. Another request (a LD) to the same address comes along for the same address. The LD does z_stall since the second atomic is pending –- so the LD retries every cycle until the deadlock counter times out (or until the second atomic comes back). 5. The second atomic returns to the TCC. However, because there are so many LD's pending in the cache, all doing z_stall's and retrying every cycle, there are a lot of resource stalls. So, when the second atomic returns, it is forced to retry its operation multiple times -- and each time it decrements the atomicDoneCnt flag (which was added to catch a race between atomics arriving and leaving the TCC in `7246f70bfb`) repeatedly. As a result atomicDoneCnt becomes negative. 6. Since this atomicDoneCnt flag is used to determine when Event:AtomicDone happens, and since the resource stalls caused the atomicDoneCnt flag to become negative, we never complete the atomic. Which means the pending LD can never access the line, because it's stuck waiting for the atomic to complete. 7. Eventually the deadlock threshold is reached. To fix this issue, this commit changes the VIPER TCC protocol from using z_stall to using the stall_and_wait buffer method that the Directory-level of the SLICC already uses. This change effectively prevents resource stalls from dominating the TCC level, by putting pending requests for a given address in a per-address stall buffer. These requests are then woken up when the pending request returns. As part of this change, this change also makes two small changes to the Directory-level protocol (MOESI_AMD_BASE-dir): 1. Updated the names of the wakeup actions to match the TCC wakeup actions, to avoid confusion. 2. Changed transition(B, UnblockWriteThrough, U) to check all stall buffers, as some requests were being placed later in the stall buffer than was being checked. This mirrors the changes in `187c44fe44` to other Directory transitions to resolve races between GPU and DMA requests, but for transitions prior workloads did not stress. Change-Id: I60ac9830a87c125e9ac49515a7fc7731a65723c2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51367 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-08 22:03:13 +00:00
Eduardo José Gómez Hernández	2b46872ee9	arch-x86: Fixed M5InternalError when decoding certain bytes 0F 38 is the two bytes prefixes to decode a three-byte opcode. To prevent errors, the two_bytes_opcode decoder will complain if it tries to decode 38 as the opcode, because it is a prefix. The decoder, will treat 38 as a prefix, preventing it to end in the two_byte_opcode decoder. However, using the VEX prefix is possible to reach this forbidden state. The set of bytes C4 01 01 38 00 will trigger the mentioned M5InternalError. The previous instruction is not valid, but it could be decoded from an speculative path. In its place, a UD2 instructtion should be emitted if the VEX prefix is present. Change-Id: I6b7c4b3593dd8e6e8ac99aaf306b8feeb7784b56 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49990 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-07 07:06:37 +00:00
Austin Harris	2b69ff2afc	ext: Update libelf from elftoolchain 0.7.1 Change-Id: I8f86cd918ad01897c42aa479f9c64520def36830 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50927 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Austin Harris <mail@austin-harris.com> Reviewed-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-06 15:13:01 +00:00
Gabe Black	74c6297453	scons: Pull makeDebugFlagHH into build_tools. Change-Id: I5c6f38a859b3d61aa47fc84e4e17d9ba8624389a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49400 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-05 00:29:25 +00:00
Matthew Poremba	3112a7f0d0	arch-gcn3,gpu-compute: Move GCN3 specific TLB to arch Move GpuTLB and TLBCoalescer to GCN3 as the TLB format is specific to GCN3 and SE mode / APU simulation. Vega will have its own TLB, coalescer, and walker suitable for a dGPU. This also adds a using alias for the TLB translation state to reduce the number of references to TheISA and X86ISA. X86 specific includes are also removed. Change-Id: I34448bb4e5ddb9980b34a55bc717bbcea0e03db5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49847 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-04 23:47:03 +00:00
Matthew Poremba	b459e2caed	misc: Add VEGA_X86 build_opt VEGA_X86 build is the same as GCN3_X86 with vega as the GPU ISA. Change-Id: I995947b30c545b1b5e478e8c60deca20b3c0143d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51107 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-04 22:51:37 +00:00
Matthew Poremba	c15e472199	arch-vega: Rework flat instructions to support global Global instructions are new in Vega and are essentially FLAT instructions from GCN3 but guaranteed to go to global memory where as flat can go to global or local memory. This reworks the flat instruction classes so that the initiateAcc / execute / completeAcc logic can be reused for flat, global, and later scratch subtypes of flat instructions. The decoder creates a flat instruction class which sets instruction flags based on the flat instruction's SEG field. There are new initOperandInfo and generateDissasmbly methods for flat and global. The number of operands and operand index getters are modified to check the flags and return the correct value for the subtype. Change-Id: I1db4a3742aeec62424189e54c38c59d6b1a8d3c1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47106 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Kyle Roarty <kyleroarty1716@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-04 22:51:37 +00:00
Jan Vrany	2b86278a86	base: Only trap to GDB if remote GDB is connected Change-Id: I3a82dc0f3e4f99dd1acfe99c1eb8caaae495e384 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48184 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Boris Shingarov <shingarov@labware.com>	2021-10-04 18:59:14 +00:00
Gabe Black	5ee0b6eab6	scons: Create a Gem5 subclass of the Executable class. The Executable class was used both for the generic gem5 target, and as a base for the GTest binaries, the systemc test binaries, etc. Unfortunately, the gem5 binary needs to include src/base/date.cc, and to ensure that that file is up to date, it needs to depend on all the other object files. No other binary should have that, but it was included by inheritance. Also, depending on the object file works well when those object files and the date.cc object file are all part of the same binary and not mixed and matched. That is not true for the GTest binaries for instance, and so building a unit test would also build all the other unit test object files because they are dependencies for date.to, date.tdo, etc. If they already exist, then they would satisfy the dependency and not be rebuilt. Change-Id: Ia9cdddc5b2593678e714c08655eb440d7f5b5d1f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51088 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-10-02 16:37:28 +00:00
Mahyar Samani	216311560e	tests: Adding tests to evaluate memory modules. This change adds a script to validate the statistics reported by gem5. It also overrides has_dma_ports for TestBoard to allow other cache hierarchies such as MESITwoLevel connect to this board. Change-Id: Iae0e61c1763c099cf10924a08b3e4989dc31e220 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50752 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-01 19:04:18 +00:00
Giacomo Travaglini	496dc2457c	arch-arm: With FEAT_SEL2 isStage2 -> NS doesn't hold anymore Change-Id: If71be69808ae97a12770b477d4f08005d35bff2d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51148 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-01 08:21:51 +00:00
Gabe Black	f1735afad3	misc: Include static_inst_fwd.hh in sim/faults.hh. We only need a StaticInstPtr type, so we don't need to include all of static_inst.hh. Also fix up some other files which were including some other things transitively through sim/faults.hh. Change-Id: I912a84963f33d99617f57d59517c402326f7a494 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50756 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com>	2021-10-01 04:57:56 +00:00
Kyle Roarty	008659bee1	arch-gcn3: Fix MUBUF out-of-bounds case 1 This patch upates the out-of-bounds check to properly check against the correct buffer_offset, which is different depending on if the const_swizzle_enable is true or false. Change-Id: I5c687c09ee7f8e446618084b8545b74a84211d4d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51127 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-30 20:22:35 +00:00
Giacomo Travaglini	a1de03150f	arch-arm: Remove duplicate sys param from ArmMMU Change-Id: I3f906a1c1076170e8b751d3cfdbf16627cf8c2ac Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51147 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-30 16:27:13 +00:00
Giacomo Travaglini	cab5abd0d9	arch-arm: Add an Armv8.4 ArmRelease object Change-Id: I0fc4301be2b4f3fa29c4320ab747bfa88132d434 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51021 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	d2157a81df	arch-arm: Add an Armv8.3 ArmRelease object Change-Id: I7bd5fee4a5958f6669e1d0ac29e0c62f8f019204 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51020 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	8156bc0dc1	arch-arm: Add missing Armv8.3 extensions to the enum Change-Id: Id3897c59a12189f4aac6a3923f656e1f6b8f6723 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51019 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	a174513320	arch-arm: Add an Armv8.2 ArmRelease object Change-Id: I731dde9687b36dc769b18cadcffe07a70868e965 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51018 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	10cbcb14e8	arch-arm: Add missing Armv8.2 extensions to the enum Change-Id: Ie98d06909fada7ca1370f2283ef0fce61b6dc953 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51017 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	d8c61b2e24	arch-arm: Add an Armv8.1 ArmRelease object Change-Id: I5638deb77a165bec1ee47d8f1b2bac31647f173a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51016 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	03f79f47d8	arch-arm: FEAT_SEL2 is not part of ID_AA64ISAR0_EL1 Change-Id: I81cb3e8f400eaf8abc1dea61f592239e52501ab1 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51015 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	e77ebef6b6	arch-arm: Add missing Armv8.1 extensions to the enum Change-Id: I90c7eb2b22d317f5a60b020c731948681e9f91a1 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51014 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	24dfe8a41c	arch-arm: Do not use FEAT_SEL2 in SE mode SecureEL2 doesn't make sense for a userspace only simulation Change-Id: Ieda56cc6684f7c011b31ca754e971fb9a9fb6899 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51013 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Giacomo Travaglini	abbd0fbad9	arch-arm: Use ArmRelease in the ISA class This is removing the cached boolean variables from the ISA class. The ISA is now using a release object. It is importing it from the ArmSystem in case of a FS simulation, and it is using its own ArmRelease object in SE mode This allows us to add/remove SE extensions from python, rather than hardcoding them in the ISA constructor (in case of SE) Change-Id: I2b0b2f113e7bb9e28ac86bf2139413e2a71eeb01 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51012 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 16:03:41 +00:00
Tom Rollet	f61818b370	cpu-o3: remove false dummy entry in LSQ The constructor of the LoadQueue and StoreQueue were adding an additional entry compared to the given configuration. The removed comment was saying that this additional entry was used as a dummy entry. This is not necessary anymore with the current structure. It was even leading to incorrect behavior as a loadQueue could have one more outstanding load than specified by the configuration. Valgrind does not spot any illegal access. Change-Id: I41507d003e4d55e91215e21f57119af7b3e4d465 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50732 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-30 09:00:21 +00:00
Matt Sinclair	43b8a93748	configs, gpu-compute: update GPU scripts to remove master/slave Update apu_se and underlying configuration files for GPU runs to replace the master/slave terminology. Change-Id: Icf309782f0899dc412eccd27e3ac017902316a70 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50967 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-09-30 01:28:38 +00:00
Gabe Black	a861b1d8d2	sim: Get rid of the now unused System::getPageShift() method. The other related getPageBytes method is still used for now. Change-Id: I22e04b47d3932751e03efc0918d44fc1627833bd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50353 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-30 00:31:36 +00:00
Gabe Black	d6974ef636	mem: Add a page_bytes parameter to the classic prefetcher. This parameter is used to figure out if two addresses are on the same or different pages, and could be used to find what page they were on and the page offset, although it doesn't look like the later two are actually used. This value could possibly come from the TLB parameter attached to the prefetcher, but making it explicit makes these more symmetric with the Ruby prefetcher, and reduces the complexity of the TLB implementation. Change-Id: I6921943c49af19971b84225ecfd1127304363426 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50352 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-09-30 00:31:29 +00:00
Gabe Black	13725927a0	mem-ruby: Replace the sys param with a page_shift param. This parameter defaults to a shift which corresponds to a 4K page. Change-Id: I259081a75cd6e7286d65f1e7dcdc657404397426 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50351 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-09-30 00:31:18 +00:00
Gabe Black	df56bf1d4d	sim: Move System specific code out of MemPools. Move that code into SEWorkload which already has to know about System objects. The MemPool(s) object(s) now only have to worry about AddrRanges and AddrRangeLists and don't have to know or care where they came from. Change-Id: Ic23aeb959d6f666b655d010c8572c41c60b5aa57 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50350 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2021-09-30 00:31:10 +00:00
Gabe Black	4f17b72425	sim: Drop a hack from MemPools which reset the free page. This made it skip over 70 pages to be "what it was before" my page table changes. I'm not sure what changes this is referring to, and the class which manages page tables in the guest memory uses the allocPhysPages method to allocate its memory and would cooperate with anything else using this mechanism without having to have special accomodation. I removed this hack and hello world seems to work fine, but there may be some other test case which exposes some problems. Change-Id: I16e0d8835452df9c3e79738a1eed05b4cc9372b7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50349 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Gabe Black <gabe.black@gmail.com>	2021-09-30 00:31:00 +00:00
Gabe Black	1a1ba692c3	sim: Move the MemPools object out of System and into SEWorkload. This removes the need for all the FullSystem checks in the System class, and simplifies that class in general. Change-Id: Ie8a3bc67db9195027d2111009b15ca59221bdeb2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50348 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-30 00:30:49 +00:00
Gabe Black	473d3c72c8	scons: Clone the gem5py_env environment in src/SConscript. This will avoid cross contamination between variants, where the gem5py executable from one variant is used by another variant, potentially crashing SCons on a clean build. Change-Id: I6c1741d431892ff11c2e05a6beb5e87c2b0d67eb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51087 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-29 23:55:35 +00:00
Giacomo Travaglini	152760ee51	arch-arm: Define an ArmRelease class to handle ISA extensions Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I3240853bd2123a6f24b2bb64c90ad457696f0d93 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51010 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-29 22:16:03 +00:00
Matt Sinclair	7756c5e908	tests: add LULESH to weekly regression LULESH is a popular GPU HPC application that acts as a good test for several memory and compute patterns. Thus, including it in the weekly regressions will help verify correctness and functionality for code that affects the GPU. The default LULESH input runs 10 iterations and takes 3-4 hours. Hence, it is not appropriate for nightly regressions. Change-Id: Ic1b73ab32fdd5cb1b973f2676b272adb91b2a98e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50952 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-09-29 17:54:50 +00:00
Matt Sinclair	46c4926485	tests: Add HeteroSync to nightly regression HeteroSync does a good job of testing the GPU memory system and atomics support, without requiring a long runtime. Thus, this commit adds a mutex and barrier test from HeteroSync to the nightly regression to ensure these components are tested. Change-Id: I65998a0a63d41dd3ba165c3a000cee7e42e9034a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50951 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-09-29 17:54:26 +00:00
Gabe Black	663fa147e8	sim: Encapsulate MemPool related System stuff in a MemPools class. Also add a const version of the getPhysMem accessor so it can be used with a const System class. Change-Id: Ieccd0bd4c2c8fe69820eb1a0b0c835722334630d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50343 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2021-09-29 12:17:50 +00:00
Gabe Black	61a6ec7b71	sim: Move serialization logic for MemPools out of System. And move it into the MemPools class itself. The MemPools class should be self contained, and be able to manage its own state. That should not be the responsibility of another containing class. Change-Id: Ib0bf70b57e92698f15bea0cc217dd98ee816d57b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50340 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com>	2021-09-29 12:17:35 +00:00
Gabe Black	0a2ba189d4	sim: Fix a faulty assumption in MemPool. In the MemPool object, the idea of a limit of the pool (largest page) and the total number of pages were conflated, as was the page number of the next "free" page and the total number of pages allocated. Both of those would only be equivalent if the memory pool starts at address zero, which is not generally true and could be true for at most one pool at a time even when it is occasionally true. Instead, this change fixes up MemPool to keep tree values, a starting page number, the page number of the next free page, and the total number of pages in the pool, both allocated and unallocated. With those three values, we can accurately report the number of allocated pages (not just the number of pages of any kind below the next free one), the total number of free pages, and the total number of pages in general (not the largest numbered page in the pool). The value serialized by the System class was adjusted so that it will stay compatible with previous checkpoints. The value unserialized by the system class is passed to the MemPool as a limit, which has not changed and so doesn't need to be updated. It gets translated into the total number of pages in the MemPool constructor. Change-Id: I8268ef410b41bf757df9ee5585ec2f6b0d8499e1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50687 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-29 12:17:21 +00:00
Tom Rollet	133399c91d	cpu-o3: remove useless indirection from lsq to cpu Change-Id: Idd2d4277b542da728f0740590ae7ef9ae9b76629 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50731 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2021-09-29 07:32:27 +00:00
Tom Rollet	d3df30c885	cpu-o3: replace 'stores' counter per storeQueue.size() Change-Id: If816c1f03969665010a5bd7e993fe7f87ac4d0a3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50730 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2021-09-29 07:32:27 +00:00
Tom Rollet	c03ec6432e	cpu-o3: replace 'loads' counter per loadQueue.size() Change-Id: Id65776b385f571e4e325b0ffa022bfa765c224bf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50729 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2021-09-29 07:32:27 +00:00
Giacomo Travaglini	d1ddbdead8	arch-arm: Prefer haveEL over haveSecurity and haveVirtualization The Arm architecture reference manual pseudocode checks for the presence of an exception level (EL) over "security" and "virtualization" Change-Id: Ia91a9d1848eddc40776627208386a13afdaafda3 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51009 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-29 07:27:59 +00:00
Gabe Black	ede1ad4b8c	arch,cpu,mem,sim: Fold arch/locked_mem.hh into the BaseISA class. Turn the functions within it into virtual methods on the ISA classes. Eliminate the implementation in MIPS, which was just copy pasted from Alpha long ago. Fix some minor style issues in ARM. Remove templating. Switch from using an "XC" type parameter to using the ThreadContext * installed in all ISA classes. The ARM version of these functions actually depend on the ExecContext delaying writes to MiscRegs to work correctly. More insiduously than that, they also depend on the conicidental ThreadContext like availability of certain functions like contextId and getCpuPtr which come from the class which happened to implement the type passed into XC. To accomodate that, those functions need both a real ThreadContext, and another object which is either an ExecContext or a ThreadContext depending on how the method is called. Jira Issue: https://gem5.atlassian.net/browse/GEM5-1053 Change-Id: I68f95f7283f831776ba76bc5481bfffd18211bc4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/50087 Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-28 19:56:01 +00:00
Giacomo Travaglini	a78fab909a	configs: Remove security option If willing to run a secure software stack, we recommend to use the baremetal.py platform See [1] on how to run gem5 with TF-A [1]: https://community.arm.com/developer/research/b/articles/\ posts/running-trusted-firmware-a-on-gem5 Change-Id: I69f6d672b24cb588c522c6a468e3b19332c9367b Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51008 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-09-28 07:47:56 +00:00

1 2 3 4 5 ...

18029 Commits