derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Gabe Black	1791b8732c	scons: Pull domain specific build setup out of SConstruct. Use SConsopts files local to individual domains to pull non-foundational build code out of SConstruct. This greatly simplifies SConstruct, and also makes it easier to find build configuration having to do with particular pieces of gem5. This change also converts some python level variables, all_protocols, protocol_dirs, and slicc_includes, into the environment where the timing of their initialization is more flexible. Change-Id: Ie61ceb75ae9e5557cc400603c972a9582e99c1ea Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40872 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2021-04-03 01:18:17 +00:00
Bobby R. Bruce	68064d8043	misc: Merge branch 'release-staging-v21-0' into develop Change-Id: I0ad043ded56fb848e045057a1e7a56ea39797906	2021-03-18 11:13:14 -07:00
Tiago Mück	b13b485095	configs,mem-ruby: CHI-based Ruby protocol This patch add a new Ruby cache coherence protocol based on Arm' AMBA5 CHI specification. The CHI protocol defines and implements two state machine types: - Cache_Controller: generic cache controller that can be configured as: - Top-level L1 I/D cache - A intermediate level (L2, L3, ...) private or shared cache - A CHI home node (i.e. the point of coherence of the system and has the global directory) - A DMA requester - Memory_Controller: implements a CHI slave node and interfaces with gem5 memory controller. This controller has the functionality of a Directory_Controller on the other Ruby protocols, except it doesn't have a directory. The Cache_Controller has multiple cache allocation/deallocation parameters to control the clusivity with respect to upstream caches. Allocation can be completely disabled to use Cache_Controller as a DMA requester or as a home node without a shared LLC. The standard configuration file configs/ruby/CHI.py provides a 'create_system' compatible with configs/example/fs.py and configs/example/se.py and creates a system with private L1/L2 caches per core and a shared LLC at the home nodes. Different cache topologies can be defined by modifying 'create_system' or by creating custom scripts using the structures defined in configs/ruby/CHI.py. This patch also includes the 'CustomMesh' topology script to be used with CHI. CustomMesh generates a 2D mesh topology with the placement of components manually defined in a separate configuration file using the --noc-config parameter. The example in configs/example/noc_config/2x4.yaml creates a simple 2x4 mesh. For example, to run a SE mode simulation, with 4 cores, 4 mem ctnrls, and 4 home nodes (L3 caches): build/ARM/gem5.opt configs/example/se.py \ --cmd 'tests/test-progs/hello/bin/arm/linux/hello' \ --ruby --num-cpus=4 --num-dirs=4 --num-l3caches=4 \ --topology=CustomMesh --noc-config=configs/example/noc_config/2x4.yaml If one doesn't care about the component placement on the interconnect, the 'Crossbar' and 'Pt2Pt' may be used and they do not require the --noc-config option. Additional authors: Joshua Randall <joshua.randall@arm.com> Pedro Benedicte <pedro.benedicteillescas@arm.com> Tuan Ta <tuan.ta2@arm.com> JIRA: https://gem5.atlassian.net/browse/GEM5-908 Change-Id: I856524b0afd30842194190f5bd69e7e6ded906b0 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42563 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-16 15:28:44 +00:00
Kyle Roarty	90d2aac515	mem-ruby: Add missing transitions + wakes for Dma events This also changes one of the wakeUpDependents calls to a wakeUpAllDependentsAddr call to prevent a hang. Change-Id: Ia076414e5c6d9c8c0b2576d1f442195d75d275fc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42463 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-11 21:16:22 +00:00
Tiago Mück	5b9517f196	mem-ruby: renamed prefetch stats Splitting hw_prefetches into prefetch_hits and prefetch_misses so both events can be tracked separately. Also added appropriate functions to increment stats. Renamed m_prefetches for consistency. sw_prefetches is not used and has been removed. The sequencer converts SW prefetch requests into a RubyRequestType_LD/RubyRequestType_ST which are handled as demand requests by the all current protocols. Change-Id: Iafa6b31c84843ddd1fad98fa7e5afed02b8c4b4d Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41816 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-01 22:18:59 +00:00
Tiago Mück	f7a3d8bee4	mem-ruby: fix MI_example functional read Changing AccessPermission to Read_Write for transient states waiting on memory when to or from Invalid. In all cases the memory will have the latest data, so this also modifies functionalRead to always send the access to memory. Change-Id: I99f557539b4f9d0d2f99558752b7ddb7e85ab3c6 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41853 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-01 22:08:25 +00:00
Tiago Mück	9396be08da	mem-ruby: RubyRequest getter for request ptr Change-Id: Ib3d12c9030d18d96388dd66f0a409b42543ee9a8 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41814 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-24 19:29:29 +00:00
Tiago Mück	8633802c3e	mem-ruby: alternative interface for func. reads A single functionalRead may not be able to get the whole latest copy of the block in protocols that have features such as: - a cache line can be partially present and dirty in a controller - a cache line can be transferred over the network using multiple protocol-level messages To support these cases, this patch adds an alternative function: bool functionalRead(PacketPtr, WriteMask&) Protocols that implement this function can partially update the packet and use the WriteMask to mark updated bytes. The top-level RubySystem:functionalRead then issues functionalRead to controllers until the whole block is read. This patch implements functionalRead(PacketPtr, WriteMask&) for all the common messages and SimpleNetwork. A protocol-specific implementation will be provided in a future patch. The new interface is compiled only if required by the protocol (see src/mem/ruby/system/SConscript). Otherwise the original interface is used thus maintaining compatibility with previous protocols. Change-Id: I4600d5f1d7cc170bd7b09ccd09bfd3bb6605f86b Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31416 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-19 15:05:10 +00:00
Matthew Poremba	bd02699932	mem-ruby: Make DMASequencer aware of Atomics Add handling for issuing atomic packet types, setting the WriteMask and AtomicOpFunctor in makeRequest. Add an atomicCallback to handle atomic packet type responses. Change-Id: I9775fc110bb99a1740089746f0d1b3deb124b9f5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33716 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-16 16:48:57 +00:00
Tiago Mück	9c4809b9ab	mem-ruby: intToTick helper Change-Id: I76635228223e9a83eef94a25d166d091315a5e96 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41156 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-12 20:31:38 +00:00
Tiago Mück	d789b75a98	mem-ruby: add andMask to WriteMask Change-Id: Ieeb68b405a68226077a2ffee231408f554e758a5 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41154 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-11 22:16:29 +00:00
Matthew Poremba	7246f70bfb	mem-ruby: Fix race related to atomics in VIPER There is a race condition in VIPER where an atomic issued to the same address can occur resulting in multiple trigger messages signalling the compleition of the atomic operation. The first message was deallocating the TBE causing the second message to dereference a nullptr when looking up the TBE. A counter is added to track the number of in flight AtomicDone trigger messages. The AtomicDone is not called until the last in flight message arrives at the trigger queue. The remaining messages call AtomicNotDone which simply pops the message from the queue and keeps the TBE allocated. Change-Id: Ie1de0436861a7c393ad6d2fb2faceb83c18d4cc3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/39175 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-15 17:46:38 +00:00
Hoa Nguyen	4c42811ff3	mem-ruby: Move CacheMemory stats used in SLICC to a Stats group This change moves some stats that are used in SLICC to a separate Stats::Group. In order to use stats in SLICC, new functions are added in CacheMemory: - profileDemandHit() - profileDemandMiss() The functions increase the corresponding stat by 1. Change-Id: I52b6fefdf6579a49f626f2fca400641f90800017 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37815 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	580eb64195	mem-ruby: Fix cache hits being profiled as cache misses There are some instances where a cache hit is profiled as a cache miss. This commit addresses this error. Change-Id: I7dafa806ef3f1e3717650dc25f8657a0ea741dd1 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37835 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Daniel Gerzhoy <daniel.gerzhoy@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-11-21 00:47:51 +00:00
Brad Beckmann	80221d7e1d	configs,mem-ruby: Remove old GPU ptls These protocols are no longer supported, either because they are not representative of GPU protocols, or because the have not been updated to work with GCN3. Change-Id: I989eeb6826c69225766aaab209302fe638b22719 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34197 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-11-04 21:09:26 +00:00
Daniel Gerzhoy	efabe5ec1b	mem-ruby: L1/L2 hit/miss tracking for MOESI_AMD_BASE/GPU_VIPER L1 and L2 access tracking was not fully implemented. This patch adds the missing tracking actions, and corrects several errors for the ones that were there. Change-Id: I69a59283274c08e94b6650ab5f586cbfe5432503 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33915 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2020-10-22 14:47:06 +00:00
Daniel Gerzhoy	85ede9a180	mem-ruby: L3 hit/miss tracking to MOESI_AMD_BASE-dir L3 access tracking added to the directory controller. This commit adds L3 hit/miss tracking to the controller. Hit/miss status is decided when the tag array of the L3 Cache is checked for the first time for any given request. Change-Id: Icac122f59509d79135265fb38b112d3f47419b6f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33314 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-22 14:45:34 +00:00
Tiago Mück	544bf8bde7	mem-ruby: Expose MessageBuffer methods SLICC interface for checking the capacity of MessageBuffers Change-Id: I28e2d22a405d33fcbe6a183dffc31bd936fa26c4 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31271 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-13 15:25:34 +00:00
Tiago Mück	cb48ce2a34	mem-ruby: add addressOffset util Returns the offset of an address with respect to a base address. Looks unnecessary, but SLICC doesn't support casting and the '-' operator for Addr types, so the alternative to this would be to add more some helpers like 'addrToUint64' and 'uint64ToInt'. Change-Id: I90480cec4c8b2e6bb9706f8b94ed33abe3c93e78 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31270 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-13 15:25:34 +00:00
Tiago Mück	f8e3ba7b7b	mem-ruby: sequencer callback for unique writes A controller may complete a write without obtaining a full copy of the line. This patch adds a specific callback for this purpose that prevents reads to be coalesced with a write on a potentially incomplete line. Change-Id: I3775f81699f38e406fee28f92c9c8e06deb3d528 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31269 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bradford Beckmann <bradford.beckmann@gmail.com>	2020-10-12 14:09:55 +00:00
Tiago Mück	aa8bca47f4	mem-ruby: int to Cycle converter Change-Id: I493b16a0bdd01a4cef4891e273a376ebe9509fe8 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31266 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-12 14:09:55 +00:00
Tiago Mück	2cbbd37a82	mem-ruby: missing method in NetDest interface Change-Id: Ibf651c37c50174186daebebc06aa115e6bc2ed33 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31262 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Bradford Beckmann <bradford.beckmann@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-12 14:09:55 +00:00
Tiago Mück	fd4ae25626	mem-ruby: additional WriteMask methods Change-Id: Ib5d5f892075b38f46d1d802c043853f56e19ea12 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31257 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-09 00:07:27 +00:00
Timothy Hayes	0a8a787de3	mem-ruby: HTM mem implementation This patch augments the MESI_Three_Level Ruby protocol with hardware transactional memory support. The HTM implementation relies on buffering of speculative memory updates. The core notifies the L0 cache controller that a new transaction has started and the controller in turn places itself in transactional state (htmTransactionalState := true). When operating in transactional state, the usual MESI protocol changes slightly. Lines loaded or stored are marked as part of a transaction's read and write set respectively. If there is an invalidation request to cache line in the read/write set, the transaction is marked as failed. Similarly, if there is a read request by another core to a speculatively written cache line, i.e. in the write set, the transaction is marked as failed. If failed, all subsequent loads and stores from the core are made benign, i.e. made into NOPS at the cache controller, and responses are marked to indicate that the transactional state has failed. When the core receives these marked responses, it generates a HtmFailureFault with the reason for the transaction failure. Servicing this fault does two things-- (a) Restores the architectural checkpoint (b) Sends an HTM abort signal to the cache controller The restoration includes all registers in the checkpoint as well as the program counter of the instruction before the transaction started. The abort signal is sent to the L0 cache controller and resets the failed transactional state. It resets the transactional read and write sets and invalidates any speculatively written cache lines. It also exits the transactional state so that the MESI protocol operates as usual. Alternatively, if the instructions within a transaction complete without triggering a HtmFailureFault, the transaction can be committed. The core is responsible for notifying the cache controller that the transaction is complete and the cache controller makes all speculative writes visible to the rest of the system and exits the transactional state. Notifting the cache controller is done through HtmCmd Requests which are a subtype of Load Requests. KUDOS: The code is based on a previous pull request by Pradip Vallathol who developed HTM and TSX support in Gem5 as part of his master’s thesis: http://reviews.gem5.org/r/2308/index.html JIRA: https://gem5.atlassian.net/browse/GEM5-587 Change-Id: Icc328df93363486e923b8bd54f4d77741d8f5650 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30319 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-08 09:13:30 +00:00
Sampad Mohapatra	9d8229c0f1	mem-ruby: Change request to response in MOESI_AMD_Base-dir.sm The responseToDMA MessageBuffer in MOESI_AMD_Base-dir.sm transmits both data and acks, but it's vnet_type is currently set as request. This should be changed to response. Signed-off-by: Sampad Mohapatra <sampad.mohapatra@gmail.com> Change-Id: I0eb9e8fc8e25111849605a710a5150ce5fc3b83b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33755 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 22:54:31 +00:00
Kyle Roarty	187c44fe44	mem-ruby: fix races between data and DMA in MOESI_AMD_Base-dir There are race conditions while running several benchmarks, where the DMA engine and the CorePair simultaneously send requests for the same block. This patch fixes two scenarios (a) If the request from the DMA engine arrives before the one from the CorePair, the directory controller records it as a pending request. However, once the DMA request is serviced, the directory doesn't check for pending requests. The CorePair, consequently, never sees a response to its request and this results in a Deadlock. Added call to wakeUpDependents in the transition from BDR_Pm to U Added call to wakeUpDependents in the transition from BDW_P to U (b) If the request from the CorePair is being serviced by the directory and the DMA requests for the same block, this causes an invalid transition because the current coherence doesn't take care of this scenario. Added transition state where the requests from DMA are added to the stall buffer. Updated B to U CoreUnblock transition to check all buffers, as the DMA requests were being placed later in the stall buffer than was being checked Change-Id: I5a76efef97723bc53cf239ea7e112f84fc874ef8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31996 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:05:17 +00:00
Tony Gutierrez	44807669a0	configs, mem: Support running VIPER with GCN3 This changeset adds the necessary changes for running GCN3 ISA with VIPER in apu_se.py. Changes to the VIPER protocol configs are made to add support for DMA and scalar caches. hsaTopology is added to help the pseudo FS create the files needed by ROCm to understand the device on which the SW is being run. Change-Id: I0f47a6a36bb241a26972c0faafafcf332a7d7d1f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30274 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-28 19:01:09 +00:00
Kyle Roarty	1339a1b080	mem-ruby: add cache hit/miss statistics for TCP and TCC Change-Id: Ifa6fdbb9dd062a3684b9620eac6683c57e651a72 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30174 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com>	2020-06-20 04:20:45 +00:00
Tony Gutierrez	b811d3a342	mem-ruby: Add DMA support to MOESI_AMD_Base-dir.sm This change adds DMA support to the MOESI_AMD_Base-dir.sm, which is needed to support ROCm apps/GCN3 ISA in the VIPER ptl. The DMA controller is copied from the MOESI_hammer-dma.sm with few modifications. Change-Id: I56141436eee1c8f62c2a0915fa3b63b83bbcbc9a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29914 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-19 20:34:06 +00:00
Tuan Ta	18ebe62598	mem-ruby: GCN3 and VIPER integration This patch modifies the Coalescer and VIPER protocol to support memory synchronization requests and write-completion responses that are required by upcoming GCN3 implementation. VIPER protocol is simplified to be a solely write-through protocol. Change-Id: Iccfa3d749a0301172a1cc567c59609bb548dace6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29913 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-19 20:32:54 +00:00
Tony Gutierrez	b8da9abba7	gpu-compute, mem-ruby, configs: Add GCN3 ISA support to GPU model Change-Id: Ibe46970f3ba25d62ca2ade5cbc2054ad746b2254 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29912 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-15 22:45:17 +00:00
adarshpatil	0a5ed3076a	mem-ruby: Fix for Invalid transition in MOESI_CMP_directory Send the correct sharer count from the memory directory to the requesting L2 cache in data message reply. Jira issue: https://gem5.atlassian.net/browse/GEM5-613 Change-Id: If76de630fd0001816e8836d9bf77961a94faaa7c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29552 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-29 19:31:47 +00:00
Tuan Ta	e071f60011	mem-ruby: add function to check for stalled msgs of addr This patch allows a cache controller to check if there is any stalled message of a specific address in the stall_map of an input message buffer. Change-Id: Id2f9bb98a9201a562f2a8cc371e9bb896ac836af Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28133 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-28 23:07:08 +00:00
Tuan Ta	524c22041d	mem-ruby: add slicc stm to defer enqueueing a message This patch enables cache controllers to make response messages in advance, store them in a per-address saved map in an output message buffer and enqueue them altogether in the future. This patch introduces new slicc statement called defer_enqueueing. This patch would help simplify the logic of state machines that deal with coalesing multiple requests from different requestors. Change-Id: I566d4004498b367764238bb251260483c5a1a5e5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28132 Reviewed-by: Tuan Ta <qtt2@cornell.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-28 23:07:08 +00:00
Bobby R. Bruce	e53de444f6	misc: Merge branch 'release-staging-v20.0.0.0' into develop	2020-05-28 01:04:16 -07:00
Timothy Hayes	97daaf1f2e	mem-ruby: MESI_Two_Level missing function compilation fix The recent commit `dd6cd33` removed the Ruby Sequencer function invalidateSC in favour of doing this implicitely via evictionCallback. The protocol MESI_Two_Level still contains one explicit call to this function, however, this is now superflous as forward_eviction_to_cpu is called in the same transition. This patch removes the remaining calls to invalidateSC. JIRA: https://gem5.atlassian.net/browse/GEM5-499 Change-Id: If51d8bebf6aa39d20789639aab0d262d5173ca59 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28747 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Ayaz Akram <yazakram@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-11 07:55:16 +00:00
Timothy Hayes	264a6392be	mem-ruby: MOESI_CMP_directory sync fix The recent commit `dd6cd33` modified the behaviour of the the Ruby sequencer to handle load linked requests as loads rather than stores. This caused the regression test realview-simple-timing-dual-ruby-ARM-x86_64-opt to become stuck when booting Linux. This patch fixes the issue by adding a missing forward_eviction_to_cpu action to the state transition(OM, Fwd_GETX, IM). Change-Id: I8f253c5709488b07ddc5143a15eda406e31f3cc6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28787 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-11 07:54:38 +00:00
Ayaz Akram	4f1c4147de	mem-ruby: Deep renaming of Prefetcher to RubyPrefetcher A recent change (https://gem5-review.googlesource.com/c/ public/gem5/+/27949) updated the ruby prefetcher name, which breaks the use of old name in some SLICC files. This change makes sure that the new name is used at all places. Issue-On: https://gem5.atlassian.net/browse/GEM5-498 Change-Id: Ic667b61eac13dc7c267cee7dce3aa970f7ae9a8b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28667 Reviewed-by: Timothy Hayes <timothy.hayes@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 23:17:52 +00:00
Tiago Mück	d9cb548d83	mem-ruby: fix possible MOESI_CMP deadlock Freeing the L2 block only after local invalidates are acked in the OLSF state may lead to a deadlock. Change-Id: Ia4b60e5bc9e2d3315b874a8c6616478db6eb38c1 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21929 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Tiago Muck	a0130e741b	mem-ruby: Fixed MOESI_CMP_directory resource tracking Fixes a few resource allocation issues in the directory controller: - Added TBE resource checks on allocation. - Now also allocating a TBE when issuing read requests to the controller to allow for a better response to backpressure. Without the TBE as a limiting factor, the directory can have an unbounded amount of outstanding memory requests. - Also allocating a TBE for forwarded requests. Change-Id: I17016668bd64a50a4354baad5d181e6d3802ac46 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21928 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu>	2020-05-06 14:42:33 +00:00
Tiago Muck	8ec2abb98a	mem-ruby: fix MOESI_CMP_directory functional reads This patch properly sets the access permissions in all controllers. 'Busy' was used for all transient states, which is incorrect in lots of cases when we still hold a valid copy of the line and are able to handle a functional read. In the L2 controller these states were split to differentiate the access permissions: IFGXX -> IFGXX, IFGXXD IGMO -> IGMO, IGMOU IGMIOF -> IGMIOF, IGMIOFD Same for the dir. controller: IS -> IS, IS_M MM -> MM, MM_M The dir. controllers also has the states WBI/WBS for lines that have been queued for a writeback. In these states we hold the data in the TBE for replying to functional reads until the memory acks the write and we move to I or S. Other minor changes includes updated debug messages and asserts. Change-Id: Ie4f6eac3b4d2641ec91ac6b168a0a017f61c0d6f Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21927 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Tiago Muck	5abac60ccf	mem-ruby: Fix MOESI_CMP_directory DMA handling This patch fixes some issues in the directory controller regarding DMA handling: 1) Junk data messages were being sent immediately in response to DMA reads for a line in the S state (one or more sharers, clean). Now, data is fetched from memory directly and forwarded to the device. Some existing transitions for handling GETS requests are reused, since it's essentially the same behavior (except we don't update the list of sharers for DMAs) 2) DMA writes for lines in the I or S states would always overwrite the whole line. We now check if it's only a partial line write, in which case we fetch the line from memory, update it, and writeback. 3) Fixed incorrect DMA msg size Some existing functions were renamed for clarity. Change-Id: I759344ea4136cd11c3a52f9eaab2e8ce678edd04 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21926 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu>	2020-05-06 14:42:33 +00:00
Tiago Muck	b85235b5da	mem-ruby: Missing transition in MOESI_CMP_directory Change-Id: I3aa9cd0230c141128ef5bddc728775b1ea6bbe14 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21925 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Tiago Mück	a72eb993e8	mem-ruby: removed unused checkCoherence Change-Id: I108b95513f2828470fe70bad5f136b0721598582 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21924 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Tiago Mück	daa3dc556e	mem-ruby: removed checkCoherence from MOESI_CMP_directory The implementation is empty and this is not used by other protocols Change-Id: Iaed7d6d4b7ef1eb4cd47bdc0710dc9dbb7a86a0c Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21923 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Tiago Mück	8213cbcc97	mem-ruby: Removed invalid transition from MOESI_CMP dir When memory data is received we always have a valid directory entry or are in a transient state. Change-Id: I0e9120e320c157fd306909458cbc446275a4f738 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27848 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Tested-by: Gem5 Cloud Project GCB service account <345032938727@cloudbuild.gserviceaccount.com>	2020-05-06 14:42:33 +00:00
Tiago Mück	852198cd7b	mem-ruby: Deallocating unused entries in MOESI_CMP dir Invalid entries are never removed from the directory the Directory controller. This patch fixes this by deallocating the entries when they become invalid. Change-Id: I616686a78c5eddb7748192bf94bb691a4f158cbc Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27847 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu>	2020-05-06 14:42:33 +00:00
Tiago Mück	9db98e7adb	mem-ruby: Deallocating unused entries in MOESI_CMP L2 Invalid entries are never removed from the directories in the L2 controller. This patch fixes this by deallocating the entries when they become invalid. The NP (not present) state was removed since it's now equivalent to Invalid. Change-Id: Id807b341a2aadb06008491545aca614d5a09b8df Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21922 Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Tiago Muck	efa6c773b3	mem-ruby: Add deallocate to DirectoryMemory Change-Id: Ib261ec8b302b55e539d8e13064957170412b752c Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/21920 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-06 14:42:33 +00:00
Timothy Hayes	003c08418f	mem-ruby: MESI_Three_level prefetcher page crossing This patch allows MESI_Three_level using the Ruby prefetcher to safely cross page boundaries by determining if an address is bad and cannot be mapped to a memory controller. Change-Id: I675a13dfa6deb5b6a9f986ced5a3130436db911d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28048 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-02 06:50:57 +00:00

1 2

68 Commits