derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Samuel Stark	38d360a475	configs, mem-ruby: Implement DVMOps in CHI 1) Handling TLBI/TLBI_SYNC requests from the PE in the CHI Request Node (Generating DVMOps) 2) Adding a new machine type for the Misc Node (MN) that handles DVMOps from the Request Node (RN), following the protocol specified within the Amba 5 CHI Architecture Specification [1] JIRA: https://gem5.atlassian.net/browse/GEM5-1097 [1]: https://developer.arm.com/documentation/ihi0050/latest Change-Id: I9ac00463ec3080c90bb81af721d88d44047123b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57298 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-05-18 08:52:53 +00:00
Tiago Mück	eb0b4ba657	mem-ruby: CHI fix for WUs on local+upstream line Fix for WriteUnique operations on cache lines that are both local and upstream JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I99def32948d3f0ced9cfc7f7712a0f4ae9aab0cd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57299 Reviewed-by: Tiago Muck <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-12 10:21:57 +00:00
Samuel Stark	65f8bf4460	mem-ruby: Support for unaddressed mem requests in the RubyRequest JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I5aa44186888b95f81bec524ff57e8dbf4c9166f8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57293 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 07:31:34 +00:00
Gabe Black	e6c0ba97db	scons: Put all config variables in an env['CONF'] sub-dict. This makes what are configuration and what are internal SCons variables explicit and separate, and makes it unnecessary to call out what variables to export to C++. These variables will also be plumbed into and out of kconfiglib in later changes. Change-Id: Iaf5e098d7404af06285c421dbdf8ef4171b3f001 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56892 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 20:31:21 +00:00
Gabe Black	f10fe51e18	scons: Don't accumulate SLICC_INCLUDES. Presumably, these are fixed for whatever protocol that gets selected. We don't need to accumulate includes, we need to set includes to something in particular. If there is a common include which always needs to be used, we can handle that in the SConscript separately from SLICC_INCLUDES. Change-Id: I996d08566944e38e388dc287f644c40366ebba0d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56754 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com>	2022-03-24 22:09:09 +00:00
Matthew Poremba	20d8b388ad	mem-ruby: Enhance MOESI_AMD DmaWrite This enhances MOESI_AMD_Base-dir DmaWrite to enable partial writes. This is currently done by assuming a full cache line, invalidating caches, and transitioning back to unblocked state. The enhanced write supports partial writes (i.e., smaller than cache line size) by first reading memory, merging the modified data, and then writing back to memory. Implementation of this mirrors that of DmaRead in terms of state. This means for each DmaRead state (BDR_PM, BDR_Pm, and BDR_M) there is a write analogue (BDW_PM, BDW_Pm, and BDR_M) and the BDR_P state is removed. Furthermore, this enhanced DmaWrite ... actually writes data to memory instead of relying on DirectoryEntry / backing store for correct data. There are two possible state transitions for DmaWrite now. (1) Memory data arrives before probe response and (2) probe response arrives before memory data. In case (1), probe data overwrites memory data and merges the partial write using the TBE write mask then updates write mask to 'filled' state. In case (2), probe data is merged with the partial data using the TBE write mask then updates write mask to 'filled' state. The memory data will then be clobbered by copying the TBE data over the response since the write mask is now full. Change-Id: I1eebb882b464c4c5ee5fd60932fd38d271ada4d7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57410 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-13 15:31:32 +00:00
Matthew Poremba	bfcab1258f	mem-ruby: Remove DataBlk from MOESI_AMD DirectoryEntry This protocol is using an old style where read/writes to memory were being done by writing to a DataBlock in a DirectoryMemory entry. This results in having multiple copies of memory, leads to stale copies in at least one memory (usually DRAM), and require --access-backing-store in most cases to work properly. This changeset removes all references to getDirectoryEntry(...).DataBlk and instead forwards those reads and writes to DRAM always. Change-Id: If2e52151789ad82c7b55c8fa2b41c1f4e5b65994 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57409 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-13 15:31:32 +00:00
Matthew Poremba	6a9dfcef52	mem-ruby: Revert `7018c2b34` This reverts commit `7018c2b34e`. This commit needs more work which will take a while. Meanwhile the nightly tests are broken because of this. Change-Id: I11d01d50ab3a2d8fd649f1a825911e14815b1ca6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57109 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-26 15:19:51 +00:00
Matthew Poremba	1bc23ca966	mem-ruby: Add protocol prints to MOESI_AMD_BASE-dma Change-Id: I59ed7311a8dc2a06ce1df0027891ba8e24e8a89e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56447 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Matthew Poremba	7018c2b34e	mem-ruby: Remove DirectoryMemory storage in MOESI_AMD_BASE-dir This protocol is using an old style where read/writes to memory were being done by writing to a DataBlock in a DirectoryMemory entry. This results in having multiple copies of memory, leads to stale copies in at least one memory (usually DRAM), and require --access-backing-store in most cases to work properly. This changeset removes all references to getDirectoryEntry(...).DataBlk and instead forwards those reads and writes to DRAM always. This results in new transient states BL_WM, BDW_WM, and B_WM which are blocked states waiting on memory acks indicating a write request is complete. The appropriate transitions are updates to move to these new states and stall states are updated to include them. DMA write ACK is also moved to when the request is sent to memory, rather than when the request is received. Change-Id: Ic5bd6a8a8881d7df782e0f7eed8be9d873610e04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56446 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Matthew Poremba	54fc137945	mem-ruby: Ensure MOESI_AMD_Base-dir has probe destinations The directory has an assert that this is at least one destination for a probe when sending an invalidation or shared probe to coherence end points in the protocol (TCC, LLC). This is not necessarily request and for certain configurations there will be no probes required and none will be sent. One such configuration is the GPU protocol tester which would not require a probe to the CPU if it does not exist. To fix this we first collect the probe destinations. Then we check if any destinations exist. If so, we send the probe message. Otherwise we immediately enqueue a probe complete message to the trigger queue. This reorganization prevents messages with no destinations from being enqueued, meeting the criteria for the assertion. Change-Id: If016f457cb8c9e0277a910ac2c3f315c25b50ce8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55543 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Tiago Mück	b354e1a252	mem-ruby: Fix handling of stale CleanUnique JIRA: https://gem5.atlassian.net/browse/GEM5-1185 Fixed an issue in which a CleanUnique responder would incorrectly deallocate the cache block when handling an stale CU when the state is UD_RU or UC_RU (thus incorrectly transitioning to RU). The fix is to handle stale CUs similarly to stale WBs where we override the dataValid TBE field to prevent the wrong state transition. This patch moves the stale code path to a separate transition (similarly to stale WBs/Evicts) and moves the dataValid override to Initiate_Request_Stale so it applies to all stale request types. Notice now the stale field is also set on stale Comp_UC responses. Additional minor change: CheckUpgrade_FromRU is the same as CheckUpgrade_FromStore so it was removed. Change-Id: I0a2cedcfde1dc30d67aa2c16d71b7470369c2b6e Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56810 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Meatboy 106 <garbage2collector@gmail.com>	2022-02-17 15:21:45 +00:00
Gabriel Busnot	8a7fcd340f	mem-ruby: Add missing CHI transition SD_RSC + *_Stale->BUSY_BLKD Related JIRA: https://gem5.atlassian.net/browse/GEM5-1180 Change-Id: Ife83bebcaa48345633fce0a0de08394e30c1a796 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56083 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Muck <tiago.muck@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-28 07:01:14 +00:00
Matthew Poremba	9313294efe	misc: Remove AMD license addition Remove the line "For use for simulation and test purposes only" in files were AMD is the only copyright holder listed in the header. This happens to be the case for all files where this line exists, removing it completely from gem5. Change-Id: I623f266b002f564301b28774f49081099cfc60fd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53943 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 04:00:56 +00:00
Gabe Black	1c233ee9d2	scons: Add sim_object and enums arguments to SimObject(). This will explicitly declare what SimObject and Enum types need to be set up in C++, which will make importing all the SimObject modules during the setup phase of SCons uneccessary. Change-Id: Id2d7603daf33b236ceaa0789e2f089f589d34e62 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49406 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-08 08:01:23 +00:00
Matthew Poremba	e0d62e510d	configs,mem-ruby: Remove reference to old GPU ptls GPU_VIPER_Baseline, GPU_VIPER_Region, and GPU_RfO were removed some time ago. Change-Id: If873b0cfe8cc2b3096cbe97d4e13a8e02d2ec567 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53703 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-07 20:26:17 +00:00
Matthew Poremba	c5ba40cfe1	mem-ruby: Add GPUonly parameter for VIPER Currently MOESI_AMD_Base used in VIPER has a CPUonly parameter which indicates that messages should not try to add GPU SLICC controllers as destinations. This adds the analogue GPUonly parameter which indicates that requests should not try to add CPU SLICC controllers. Also adds an assert to ensure the outgoing message has at least one destination. This assert would indicate a misconfiguration. Change-Id: Ibb0affd4606084fca021f0e7c117d4ff8c06d429 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51928 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2021-10-26 15:52:11 +00:00
Matthew Poremba	55fdf4be52	mem-ruby: Add missing CPUonly check for VIPER The CPUonly variable in MOESI_AMD_Base's Directory indicates that probes should not be sent to any GPU SLICC controllers as they are not part of CPU. There is one CPUonly check missing which causes problems in GPU-only Ruby networks as there is no route to any controllers with that MachineType. Add a condition to check CPUonly and do nothing in that case. Change-Id: I41b6c04feec473e34b04402adfb5978e75b847b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51927 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-26 15:52:11 +00:00
Matt Sinclair	118677218d	mem-ruby: fix typo in GPU VIPER TCC comment `72ee6d1a` fixed a deadlock in the GPU VIPER TCC. However, it inadvertently added a typo to the comments explaining the change. This commit fixes that. Change-Id: Ibba835aa907be33fc3dd8e576ad2901d5f8f509c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51687 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-17 04:07:49 +00:00
Matt Sinclair	1120931105	mem-ruby: Move VIPER TCC decrements to action from in_port Currently, the GPU VIPER TCC protocol handles races between atomics in the triggerQueue_in. This in_port does not check for resource availability, which can cause the trigger queue to execute multiple times. Although this is the expected behavior, the code for handling atomic races decrements the atomicDoneCnt flag in the trigger queue, which is not safe since resource contention may cause it to execute multiple times. To resolve this issue, this commit moves the decrementing of this counter to a new action that is called in an event that happens only when the race between atomics is detected. Change-Id: I552fd4f34fdd9ebeec99fb7aeb4eeb7b150f577f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51368 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-08 22:03:13 +00:00
Matt Sinclair	72ee6d1aad	mem-ruby: Update GPU VIPER TCC protocol to resolve deadlock In the GPU VIPER TCC, programs with mixes of atomics and data accesses to the same address, in the same kernel, can experience deadlock when large applications (e.g., Pannotia's graph analytics algorithms) are running on very small GPUs (e.g., the default 4 CU GPU configuration). In this situation, deadlocks occur due to resource stalls interacting with the behavior of the current implementation for handling races between atomic accesses. The specific order of events causing this deadlock are: 1. TCC is waiting on an atomic to return from directory 2. In the meantime it receives another atomic to the same address -- when this happens, the TCC increments number of atomics to this address (numAtomics = 2) that are pending in TBE, and does a write through of the atomic to the directory. 3. When the first atomic returns from the Directory, it decrements the numAtomics counter. numAtomics was at 2 though, because of step #2. So it doesn't deallocate the TBE entry and calls Event:AtomicNotDone. 4. Another request (a LD) to the same address comes along for the same address. The LD does z_stall since the second atomic is pending –- so the LD retries every cycle until the deadlock counter times out (or until the second atomic comes back). 5. The second atomic returns to the TCC. However, because there are so many LD's pending in the cache, all doing z_stall's and retrying every cycle, there are a lot of resource stalls. So, when the second atomic returns, it is forced to retry its operation multiple times -- and each time it decrements the atomicDoneCnt flag (which was added to catch a race between atomics arriving and leaving the TCC in `7246f70bfb`) repeatedly. As a result atomicDoneCnt becomes negative. 6. Since this atomicDoneCnt flag is used to determine when Event:AtomicDone happens, and since the resource stalls caused the atomicDoneCnt flag to become negative, we never complete the atomic. Which means the pending LD can never access the line, because it's stuck waiting for the atomic to complete. 7. Eventually the deadlock threshold is reached. To fix this issue, this commit changes the VIPER TCC protocol from using z_stall to using the stall_and_wait buffer method that the Directory-level of the SLICC already uses. This change effectively prevents resource stalls from dominating the TCC level, by putting pending requests for a given address in a per-address stall buffer. These requests are then woken up when the pending request returns. As part of this change, this change also makes two small changes to the Directory-level protocol (MOESI_AMD_BASE-dir): 1. Updated the names of the wakeup actions to match the TCC wakeup actions, to avoid confusion. 2. Changed transition(B, UnblockWriteThrough, U) to check all stall buffers, as some requests were being placed later in the stall buffer than was being checked. This mirrors the changes in `187c44fe44` to other Directory transitions to resolve races between GPU and DMA requests, but for transitions prior workloads did not stress. Change-Id: I60ac9830a87c125e9ac49515a7fc7731a65723c2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51367 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-08 22:03:13 +00:00
Daecheol You	82db312550	mem-ruby: Add (RUSC, LocalHN_Eviction) transition During full system simulation on CHI, LocalHN_Eviction event on the RUSC state occured occasionally. Thus, the change adds RUSC state to the transition. Change-Id: Ibff382c38a092895bc03a4a64cf072ae752decf3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49263 Reviewed-by: Tiago Mück <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-08-24 00:17:32 +00:00
Carlos Falquez	6d07200693	mem-ruby: Add (BUSY_BLKD,SnpOnceFwd) transition Add (BUSY_BLKD,SnpOnceFwd) cache transition to the Ruby CHI protocol. Change-Id: I150880b26dee869b48cfd16fb661b9487527a8cd Signed-off-by: Carlos Falquez <c.falquez@fz-juelich.de> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46901 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 02:13:54 +00:00
Gabriel Busnot	9c2aac17b9	mem-ruby: Rename WriteMask::cmpMask to containsMask Avoids confusion as the function tests for inclusions and not for equality. Change-Id: I4cd10e08af46f69feed26afc2d6c7f809bc5192b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46560 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-08 07:56:09 +00:00
Gabriel Busnot	5a5fb03c77	mem-ruby: Fix wrong test in CHI functional reads A bad write mask inclusion test in CHI cache functionalRead and CHI data message functionalRead was causing clean data not to be read in some cases. The issue is detailed in issue GEM5-1002. Change-Id: I91254fa87636e8d22a8b2f27ad375f68f997932d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46559 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-08 07:56:09 +00:00
Hoa Nguyen	5c34457a38	mem-ruby: replace desks, add desc where required Events in *.sm are required to have "desc" defined. JIRA: https://gem5.atlassian.net/browse/GEM5-999 Change-Id: I95f59c422bdd264a9e1077b75bf7a0e9f39685aa Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46119 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-27 23:42:53 +00:00
Bobby R. Bruce	4607a67c74	mem-ruby: Fix nonsensical check in MOESI_CMP_token-L1cache This check always equated to False. It should be an 'or' not an 'and' comparison. The Clang 11 compiler threw an "overlapping comparisons always evaluate to false" error for the code generaed from this. Change-Id: I299dc6fa8206d5e85d59ba8353bf16102b8e5e1b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45799 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-05-20 23:23:42 +00:00
Matt Sinclair	0eef1069cb	ruby: fix typo in VIPER TCC triggerQueue The GPU VIPER TCC protocol accidentally used "TiggerMsg" instead of "TriggerMsg" for the triggerQueue_in port. This was a benign bug beacuse the msg type is not used in the in_port implementation but still makes the SLICC harder to understand, so fixing it is worthwhile. Change-Id: I88cbc72bac93bcc58a66f057a32f7bddf821cac9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44905 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-28 01:25:45 +00:00
Bobby R. Bruce	291bc67ef1	misc,mem-ruby: Fixing unused variable error for fast builds This fixes the broken compiler tests for .fast builds: https://www.mail-archive.com/gem5-dev@gem5.org/msg38412.html Change-Id: Ibc377a57ce6455ca709003f326b0ca8d4c01377b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44086 Reviewed-by: Gabe Black <gabe.black@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-07 16:55:22 +00:00
Gabe Black	dd0d54d749	scons: Narrow the scope of the -Wno-parentheses flag. This was added to avoid warnings from code generated as part of Ruby's AST. Instead of applying this to all of gem5, apply it only to files generated by Ruby. Change-Id: I2b11d2df3cb631debdc594059d9d480a0e695c59 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40958 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-04-03 05:53:01 +00:00
Gabe Black	1791b8732c	scons: Pull domain specific build setup out of SConstruct. Use SConsopts files local to individual domains to pull non-foundational build code out of SConstruct. This greatly simplifies SConstruct, and also makes it easier to find build configuration having to do with particular pieces of gem5. This change also converts some python level variables, all_protocols, protocol_dirs, and slicc_includes, into the environment where the timing of their initialization is more flexible. Change-Id: Ie61ceb75ae9e5557cc400603c972a9582e99c1ea Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40872 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2021-04-03 01:18:17 +00:00
Bobby R. Bruce	68064d8043	misc: Merge branch 'release-staging-v21-0' into develop Change-Id: I0ad043ded56fb848e045057a1e7a56ea39797906	2021-03-18 11:13:14 -07:00
Tiago Mück	b13b485095	configs,mem-ruby: CHI-based Ruby protocol This patch add a new Ruby cache coherence protocol based on Arm' AMBA5 CHI specification. The CHI protocol defines and implements two state machine types: - Cache_Controller: generic cache controller that can be configured as: - Top-level L1 I/D cache - A intermediate level (L2, L3, ...) private or shared cache - A CHI home node (i.e. the point of coherence of the system and has the global directory) - A DMA requester - Memory_Controller: implements a CHI slave node and interfaces with gem5 memory controller. This controller has the functionality of a Directory_Controller on the other Ruby protocols, except it doesn't have a directory. The Cache_Controller has multiple cache allocation/deallocation parameters to control the clusivity with respect to upstream caches. Allocation can be completely disabled to use Cache_Controller as a DMA requester or as a home node without a shared LLC. The standard configuration file configs/ruby/CHI.py provides a 'create_system' compatible with configs/example/fs.py and configs/example/se.py and creates a system with private L1/L2 caches per core and a shared LLC at the home nodes. Different cache topologies can be defined by modifying 'create_system' or by creating custom scripts using the structures defined in configs/ruby/CHI.py. This patch also includes the 'CustomMesh' topology script to be used with CHI. CustomMesh generates a 2D mesh topology with the placement of components manually defined in a separate configuration file using the --noc-config parameter. The example in configs/example/noc_config/2x4.yaml creates a simple 2x4 mesh. For example, to run a SE mode simulation, with 4 cores, 4 mem ctnrls, and 4 home nodes (L3 caches): build/ARM/gem5.opt configs/example/se.py \ --cmd 'tests/test-progs/hello/bin/arm/linux/hello' \ --ruby --num-cpus=4 --num-dirs=4 --num-l3caches=4 \ --topology=CustomMesh --noc-config=configs/example/noc_config/2x4.yaml If one doesn't care about the component placement on the interconnect, the 'Crossbar' and 'Pt2Pt' may be used and they do not require the --noc-config option. Additional authors: Joshua Randall <joshua.randall@arm.com> Pedro Benedicte <pedro.benedicteillescas@arm.com> Tuan Ta <tuan.ta2@arm.com> JIRA: https://gem5.atlassian.net/browse/GEM5-908 Change-Id: I856524b0afd30842194190f5bd69e7e6ded906b0 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42563 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-16 15:28:44 +00:00
Kyle Roarty	90d2aac515	mem-ruby: Add missing transitions + wakes for Dma events This also changes one of the wakeUpDependents calls to a wakeUpAllDependentsAddr call to prevent a hang. Change-Id: Ia076414e5c6d9c8c0b2576d1f442195d75d275fc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/42463 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-11 21:16:22 +00:00
Tiago Mück	5b9517f196	mem-ruby: renamed prefetch stats Splitting hw_prefetches into prefetch_hits and prefetch_misses so both events can be tracked separately. Also added appropriate functions to increment stats. Renamed m_prefetches for consistency. sw_prefetches is not used and has been removed. The sequencer converts SW prefetch requests into a RubyRequestType_LD/RubyRequestType_ST which are handled as demand requests by the all current protocols. Change-Id: Iafa6b31c84843ddd1fad98fa7e5afed02b8c4b4d Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41816 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-01 22:18:59 +00:00
Tiago Mück	f7a3d8bee4	mem-ruby: fix MI_example functional read Changing AccessPermission to Read_Write for transient states waiting on memory when to or from Invalid. In all cases the memory will have the latest data, so this also modifies functionalRead to always send the access to memory. Change-Id: I99f557539b4f9d0d2f99558752b7ddb7e85ab3c6 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41853 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-03-01 22:08:25 +00:00
Tiago Mück	9396be08da	mem-ruby: RubyRequest getter for request ptr Change-Id: Ib3d12c9030d18d96388dd66f0a409b42543ee9a8 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41814 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-24 19:29:29 +00:00
Tiago Mück	8633802c3e	mem-ruby: alternative interface for func. reads A single functionalRead may not be able to get the whole latest copy of the block in protocols that have features such as: - a cache line can be partially present and dirty in a controller - a cache line can be transferred over the network using multiple protocol-level messages To support these cases, this patch adds an alternative function: bool functionalRead(PacketPtr, WriteMask&) Protocols that implement this function can partially update the packet and use the WriteMask to mark updated bytes. The top-level RubySystem:functionalRead then issues functionalRead to controllers until the whole block is read. This patch implements functionalRead(PacketPtr, WriteMask&) for all the common messages and SimpleNetwork. A protocol-specific implementation will be provided in a future patch. The new interface is compiled only if required by the protocol (see src/mem/ruby/system/SConscript). Otherwise the original interface is used thus maintaining compatibility with previous protocols. Change-Id: I4600d5f1d7cc170bd7b09ccd09bfd3bb6605f86b Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31416 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-19 15:05:10 +00:00
Matthew Poremba	bd02699932	mem-ruby: Make DMASequencer aware of Atomics Add handling for issuing atomic packet types, setting the WriteMask and AtomicOpFunctor in makeRequest. Add an atomicCallback to handle atomic packet type responses. Change-Id: I9775fc110bb99a1740089746f0d1b3deb124b9f5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33716 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-16 16:48:57 +00:00
Tiago Mück	9c4809b9ab	mem-ruby: intToTick helper Change-Id: I76635228223e9a83eef94a25d166d091315a5e96 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41156 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-12 20:31:38 +00:00
Tiago Mück	d789b75a98	mem-ruby: add andMask to WriteMask Change-Id: Ieeb68b405a68226077a2ffee231408f554e758a5 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41154 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-02-11 22:16:29 +00:00
Matthew Poremba	7246f70bfb	mem-ruby: Fix race related to atomics in VIPER There is a race condition in VIPER where an atomic issued to the same address can occur resulting in multiple trigger messages signalling the compleition of the atomic operation. The first message was deallocating the TBE causing the second message to dereference a nullptr when looking up the TBE. A counter is added to track the number of in flight AtomicDone trigger messages. The AtomicDone is not called until the last in flight message arrives at the trigger queue. The remaining messages call AtomicNotDone which simply pops the message from the queue and keeps the TBE allocated. Change-Id: Ie1de0436861a7c393ad6d2fb2faceb83c18d4cc3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/39175 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-15 17:46:38 +00:00
Hoa Nguyen	4c42811ff3	mem-ruby: Move CacheMemory stats used in SLICC to a Stats group This change moves some stats that are used in SLICC to a separate Stats::Group. In order to use stats in SLICC, new functions are added in CacheMemory: - profileDemandHit() - profileDemandMiss() The functions increase the corresponding stat by 1. Change-Id: I52b6fefdf6579a49f626f2fca400641f90800017 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37815 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	580eb64195	mem-ruby: Fix cache hits being profiled as cache misses There are some instances where a cache hit is profiled as a cache miss. This commit addresses this error. Change-Id: I7dafa806ef3f1e3717650dc25f8657a0ea741dd1 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37835 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Daniel Gerzhoy <daniel.gerzhoy@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-11-21 00:47:51 +00:00
Brad Beckmann	80221d7e1d	configs,mem-ruby: Remove old GPU ptls These protocols are no longer supported, either because they are not representative of GPU protocols, or because the have not been updated to work with GCN3. Change-Id: I989eeb6826c69225766aaab209302fe638b22719 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34197 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-11-04 21:09:26 +00:00
Daniel Gerzhoy	efabe5ec1b	mem-ruby: L1/L2 hit/miss tracking for MOESI_AMD_BASE/GPU_VIPER L1 and L2 access tracking was not fully implemented. This patch adds the missing tracking actions, and corrects several errors for the ones that were there. Change-Id: I69a59283274c08e94b6650ab5f586cbfe5432503 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33915 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2020-10-22 14:47:06 +00:00
Daniel Gerzhoy	85ede9a180	mem-ruby: L3 hit/miss tracking to MOESI_AMD_BASE-dir L3 access tracking added to the directory controller. This commit adds L3 hit/miss tracking to the controller. Hit/miss status is decided when the tag array of the L3 Cache is checked for the first time for any given request. Change-Id: Icac122f59509d79135265fb38b112d3f47419b6f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33314 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-22 14:45:34 +00:00
Tiago Mück	544bf8bde7	mem-ruby: Expose MessageBuffer methods SLICC interface for checking the capacity of MessageBuffers Change-Id: I28e2d22a405d33fcbe6a183dffc31bd936fa26c4 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31271 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-13 15:25:34 +00:00
Tiago Mück	cb48ce2a34	mem-ruby: add addressOffset util Returns the offset of an address with respect to a base address. Looks unnecessary, but SLICC doesn't support casting and the '-' operator for Addr types, so the alternative to this would be to add more some helpers like 'addrToUint64' and 'uint64ToInt'. Change-Id: I90480cec4c8b2e6bb9706f8b94ed33abe3c93e78 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31270 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-13 15:25:34 +00:00
Tiago Mück	f8e3ba7b7b	mem-ruby: sequencer callback for unique writes A controller may complete a write without obtaining a full copy of the line. This patch adds a specific callback for this purpose that prevents reads to be coalesced with a write on a potentially incomplete line. Change-Id: I3775f81699f38e406fee28f92c9c8e06deb3d528 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31269 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bradford Beckmann <bradford.beckmann@gmail.com>	2020-10-12 14:09:55 +00:00

1 2

98 Commits