derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Tiago Mück	027b508a38	mem-ruby: fix missing transition in CHI-mem JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: I0aae4b9042cb6565c77cc8781b514a9e65ab161b Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63676 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-09-28 18:56:04 +00:00
Tiago Mück	c6a460eff4	mem-ruby: fix CHI memory controller Break up the transition to READING_MEM into two separate steps so contention at the requestToMemory queue won't block the TBE initialization. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: Ifa0ee589bde67eb30e7c0b315ff41f22b61e8db7 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63675 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-09-28 18:56:04 +00:00
Daecheol You	e8ff8817e3	mem-ruby: bug fix for stale WriteBack Finish_CopyBack_Stale is scheduled only when the requestor is the last sharer. This prevents the cacahe evicting the line which was already evicted while the stale WriteBack transaction was stalled. Wrong condition check in Finish_CopyBack_Stale for eviction is also removed. Change-Id: Ib66acc1b9e4a6f7cea373e1fb37375427897d48d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63611 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-09-19 01:57:23 +00:00
Tiago Muck	f6b2793b91	Revert "mem-ruby: bug fix for Finish_CopyBack_Stale" This reverts commit `f7cf47bc31`. Reason for revert: introduces an issue when handling a stale WriteBack Change-Id: I4bd370911cb003c0c99e5fd14866b8c98afa80e2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63412 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-09-12 14:52:38 +00:00
Daecheol You	f7cf47bc31	mem-ruby: bug fix for Finish_CopyBack_Stale I made a mistake in the change below: https://gem5-review.googlesource.com/c/public/gem5/+/58413 Checking the requestor in the sharer list for eviction should be removed now. If the sharer count is zero, the requestor can't be in the sharer list. Change-Id: I304d2dd7df1aff4907801664a260c35c490a2136 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62991 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-09-09 20:38:20 +00:00
Jarvis Jia	b86088008a	mem-ruby: Fix replacement policy updates with stores in MI_example The current MI_example protocol's L1 caches updates the MRU information twice per store requests that miss -- once when the request reaches Ruby and once when the store miss is returned from another level of the memory hierarchy. Although this approach does not cause any correctness bugs for replacement policies like LRU since this request is the LRU in both cases, it does not work correctly for other policies like SecondChance and LFU, where updating the information twice (for misses) causes them to devolve to LRU. Note that this was not directly a problem with Ruby previously, because it only supported LRU-based policies that were unaffected by this. However, with the integration of 20879 Ruby now uses the same replacement policies as Classic (which has additional, non-LRU based replacement policies). This patch resolves this problem by not updating the MRU information a second time for the misses. It has been tested and validated with the replacement policy tests in 20880, and it modifies the store instead of the load in 62232. Change-Id: I8436e3e537da0ee5841c59a94fa5e5c30105529f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63191 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-09-09 15:19:54 +00:00
Bobby R. Bruce	2bc5a8b71a	misc: Run pre-commit run on all files in repo The following command was run: ``` pre-commit run --all-files ``` This ensures all the files in the repository are formatted to pass our checks. Change-Id: Ia2fe3529a50ad925d1076a612d60a4280adc40de Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62572 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-08-24 21:47:07 +00:00
Jarvis Jia	2816598831	mem-ruby: Fix replacement policy updates in MI_example The current MI_example protocol's L1 caches updates the MRU information twice per request on misses -- once when the request reaches Ruby and once when the miss is returned from another level of the memory hierarchy. Although this approach does not cause any correctness bugs for replacement policies like LRU since this request is the LRU in both cases, it does not work correctly for other policies like SecondChance and LFU, where updating the information twice (for misses) causes them to devolve to LRU. Note that this was not directly a problem with Ruby previously, because it only supported LRU-based policies that were unaffected by this. However, with the integration of 20879 Ruby now uses the same replacement policies as Classic (which has additional, non-LRU based replacement policies). This patch resolves this problem by not updating the MRU information a second time for the misses. It has been tested and validated with the replacement policy tests in 20880. Change-Id: I82a57abf2a16d70820413ba8118378f2e91fd7fb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62232 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-08-19 03:08:02 +00:00
Richard Cooper	b893344b7d	mem-ruby: Add descriptions to the CHI DVM symbols. This commit adds `desc` descriptions to the new symbols introduced with CHI DVM support. The generation of the SLICC HTML documentation requires each symbol to have a description, so a build with `SLICC_HTML=True` will fail without this change. Change-Id: I06f3bdd33edd1ff6e4bec35b01a460b9359ed9f6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/60869 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-07-06 17:09:46 +00:00
Matt Sinclair	9c1af09605	mem-ruby, gpu-compute: update TCP,SQC to pass hit/miss Previously, the GPU SQC and TCP Ruby protocols always told the Sequencer that the externalHit field was false. This impacts the statistics and profiling, because the Sequencer uses this hit/miss information both for profiling and the coalescer's statistics. To resolve this, this commit updates the GPU SQC and TCP Ruby protocols to pass the appropriate hit/miss information into the Sequencer's readCallback and hitCallback functions. Change-Id: Ib74af09b66fa8866eee72d3a9ab0e8a8f2196c03 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/60652 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-21 22:59:05 +00:00
Matt Sinclair	669eb6a6fa	mem-ruby, gpu-compute: add hit/miss profiling to SQC This commit updates the Ruby SQC (GPU L1 I$) to perform hit and miss profiling on each request that reaches it. Change-Id: I736521b89b5d37d950265f32cf1a6d2ee5316dba Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/60651 Maintainer: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-21 22:58:42 +00:00
Kyle Roarty	f876e60bc2	mem-ruby: Fix deadlock in GPU VIPER TCC A deadlock occured where we got a RdBlk while in W, which put us in WI while we wait for a writeback to complete. This would cause the request to be stalled while the writeback was occuring, but when the writeback completed (WBAck), we never woke up the requests and thus never completed the RdBlk. This commit adds a wakeup when we receive a WBAck while in WI. Change-Id: I01edf1d7a47757b4f680baf9f33a1a6aa37e7e25 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59352 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-06 18:28:52 +00:00
Daecheol You	9bfffe0f34	mem-ruby: modify the TBE data state for ReadOnce_HitUpstream When ReadOnce request hits upstream, set dataToBeInvalid to true for R* states so that the line from the upstream is successfully dropped at the end by Finalize_UpdateCacheFromTBE. For UD_RU and UC_RU state, set dataValid to true to prevent it changing to RU state when it doesn't get the snoop data response. Change-Id: Ie83c511e8d158e18abc5c9c16bc6040ce73587bf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58411 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Muck <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-03 09:31:21 +00:00
Tiago Mück	e4274cabd9	mem-ruby: fix Evict request for CHI excl. caches Assume core C1 with private L1/L2 and a shared exclusive L3. C1 has a line in SC state, while the state in the L3 is RUSC (L3 has exclusive accesses and upstream requester has line in SC). When C1 evicts the line (Evict request), the L3 has to issue a WriteEvictFull to the home node, however the L3 doesn't have a copy of the line. This fix handling Evict requests when the line state is RUSC. When the last sharer issues an Evict request, the responder may issue SnpOnce the obtain a copy the line if needed. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: Ic8f4e10b38d95cd6d84f8d65b87b0c94fcf52eea Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59991 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	612f242359	mem-ruby: fix CHI snoops clearing WU data When just forwarding a WU request, the controller waits until the WU is acked from downstream before sending the ack upstream. This prevents snoops clearing valid WU data. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 This was more likely to happen with shared exclusive caches, e.g: assume core C1 and C2 with private L1/L2 and a shared exclusive L3. C1 has as dirty copy of the line while C2 issues a WriteUnique request to that line. The line state is RU in the L3, so the L3 will just forward the request to the HNF, so: - C2 issues WU to L3 cache - L3 acks the WU, allowing C2 to send the data, while concurrently forwarding the WU to the HNF. - L3 receives data from C2 - HNF sends invalidating snoops upstream because line is RU - The snoop hazards with the pending WU at the L3 and invalidates the data previously received. This causes an assertion to fail when we resume handling the WU. Change-Id: I51e457e0bdb648c0fff3f702b7d2c95dcf431dc5 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59990 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	1dfd319d98	mem-ruby: fix data state for partial WU When receiving data from a WriteUniquePtl we were wrongfully clearing the data valid flag. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: I5c17433f1cfb706e443a0169a9f0e99ff5c1fcc0 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59989 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	dc33a16993	mem-ruby: fix functionalRead on pending CU Normally we don't check the TBE data if there are outstanding response messages for the transaction because that means the latest valid data is either in another cache or within an inflight message. However this is not the case when we have either a pending CleanUnique or we are handling CleanUnique. So bypass the pending message check in this case. Change-Id: I5f31039ca2a01a6a68fee8e0f3cf02c7e437b43e Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57395 Reviewed-by: Daecheol You <daecheol.you@samsung.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	23888df8a9	mem-ruby: fix MaintainCoherence typo Change-Id: Iee3319e1d470898c727747894287029e1b0ab102 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57394 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	12641069de	mem-ruby: reuse existing event on CleanUnique Reuse the existing MaintainCoherence event to schedule writebacks or cache fill after a CleanUnique. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: I127ebf78736b8312ccf2b18cf7c586eb5a77f373 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57393 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	d1d6b4cb9e	mem-ruby: fix inconsistent WBs for dirty data Initiate_MaitainCoherence would not trigger a writeback if tbe.dataMaybeDirtyUpstream is set due to the assumption that the upstream cache would writeback any dirty data. However this is not the case if we use this action finalize a CleanUnique, e.g.: - L1-A has data in SC - L1-B has data in SD - L2 has data in RUSD (L2 is an exclusive cache) - L1-A sends CleanUnique to L2 - L2 invalidates L1-B and receives dirty data. - L2 acks the CleanUnique; L1-A is now UC - L2 has the dirty data but drops it because dataMaybeDirtyUpstream - L1-A doesn't modify the data and eventually evicts it with WriteEvict - Data from WriteEvicts are dropped at the HNF and we lose the line This patch removes the tbe.dataMaybeDirtyUpstream check. Instead it only skips the WriteBack if an upstream cache is in SD state, when it's guaranteed it will writeback the dirty data. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: I6722bc25068b0c44afcf261abc8824f1d80c09f9 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57392 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Daecheol You <daecheol.you@samsung.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	183e8e2b61	mem-ruby: fix state updates on WriteCleanFull - fix wrong variable check at UpdateDirState_FromReqDataResp - even after a WriteClean, dataMaybeDirtyUpstream still applies if there is an exclusive owner upstream. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: If1fa3ee40e30226db3a66c34633316e751eb7c4d Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57391 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daecheol You <daecheol.you@samsung.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	5faa7aaffd	mem-ruby: removed check for WriteCleanFull Relaxed check on Send_WriteCleanFull. That data state may actually happen if the writeback was triggered by a CleanUnique request. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: I33ec5693df09efe39345f403c5b6d3388f1a5056 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57390 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daecheol You <daecheol.you@samsung.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-01 15:23:47 +00:00
Tiago Mück	ff5aafa1e9	mem-ruby: fix CHI wrong response to ReadShared When an exclusive cache is responding to a ReadShared and the line is unique, it send the data in unique state without checking if the line already has other sharers in other upstream caches. This patch fixes this issue and also cleans up Send_CompData. JIRA: https://gem5.atlassian.net/browse/GEM5-1195 Change-Id: Ica7c2afafb55750681b39ae7de99a665689ecb8a Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57389 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-06-01 15:23:47 +00:00
Daecheol You	073dc853f4	mem-ruby: fix the condition for stale WriteCleanFull WriteCleanFull can be requested for the cache line in SD state (e.g. Local eviction of a cache line in SD_RSC state). In this case, the requestor is the owner of the cache line, but it doesn't have it with exclusive right. Thus, 'ownerIsExcl == false' should be removed from the stale condition. Change-Id: I4d34021ac31b2e8600c24689a03a3b8fa18aa1f7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58412 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-05-28 04:57:39 +00:00
Daecheol You	eaf23bcd9f	mem-ruby: fix sharer update for stale WriteCleanFull Initiate_CopyBack_Stale removes the requestor from the sharer list. However, if CBWrData_SC is the data response of stale WriteCleanFull, the requestor should remain in the sharer list. Thus, whether to send a Evict or not can be decided after the data response arrives. For this, FinishCopyBack_Stale event was added as the last event to handle Evict. Change-Id: Ic3e3a1e4d74b24b9aa328b2ddfa817db44f24e4e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58413 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Muck <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-05-27 04:02:56 +00:00
Daecheol You	8b648ac856	mem-ruby: add missing response for ReadOnce When HNF snoops an RNF with SnpOnce to process ReadOnce request (e.g. DMA read request), the RNF can respond with SnpRespData_UC if the cache line is in UC. Thus, SnpRespData_UC was added to the transition events. Change-Id: Ife242e75feb9d2451eb99511e21833d9d190a6c3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58410 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Muck <tiago.muck@arm.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-05-26 00:38:23 +00:00
Bobby R. Bruce	770f470495	arch-arm: Fixed ARM/gem5.fast compilation failures The compiler-tests were failing: https://jenkins.gem5.org/job/compiler-checks/238 This was due to an `error: unused variable` error being thrown in cases where a variable was declared and used soley in an `assert` within a SLICC file. Assertions of this kind are stripped during .fast compilation. This patch fixes this. Change-Id: I3a91ac8b1a51de7ddffd6a1cff602a934862b49c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59829 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-05-20 17:59:57 +00:00
Samuel Stark	38d360a475	configs, mem-ruby: Implement DVMOps in CHI 1) Handling TLBI/TLBI_SYNC requests from the PE in the CHI Request Node (Generating DVMOps) 2) Adding a new machine type for the Misc Node (MN) that handles DVMOps from the Request Node (RN), following the protocol specified within the Amba 5 CHI Architecture Specification [1] JIRA: https://gem5.atlassian.net/browse/GEM5-1097 [1]: https://developer.arm.com/documentation/ihi0050/latest Change-Id: I9ac00463ec3080c90bb81af721d88d44047123b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57298 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-05-18 08:52:53 +00:00
Tiago Mück	eb0b4ba657	mem-ruby: CHI fix for WUs on local+upstream line Fix for WriteUnique operations on cache lines that are both local and upstream JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I99def32948d3f0ced9cfc7f7712a0f4ae9aab0cd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57299 Reviewed-by: Tiago Muck <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-12 10:21:57 +00:00
Samuel Stark	65f8bf4460	mem-ruby: Support for unaddressed mem requests in the RubyRequest JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I5aa44186888b95f81bec524ff57e8dbf4c9166f8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57293 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 07:31:34 +00:00
Gabe Black	e6c0ba97db	scons: Put all config variables in an env['CONF'] sub-dict. This makes what are configuration and what are internal SCons variables explicit and separate, and makes it unnecessary to call out what variables to export to C++. These variables will also be plumbed into and out of kconfiglib in later changes. Change-Id: Iaf5e098d7404af06285c421dbdf8ef4171b3f001 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56892 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 20:31:21 +00:00
Gabe Black	f10fe51e18	scons: Don't accumulate SLICC_INCLUDES. Presumably, these are fixed for whatever protocol that gets selected. We don't need to accumulate includes, we need to set includes to something in particular. If there is a common include which always needs to be used, we can handle that in the SConscript separately from SLICC_INCLUDES. Change-Id: I996d08566944e38e388dc287f644c40366ebba0d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56754 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com>	2022-03-24 22:09:09 +00:00
Matthew Poremba	20d8b388ad	mem-ruby: Enhance MOESI_AMD DmaWrite This enhances MOESI_AMD_Base-dir DmaWrite to enable partial writes. This is currently done by assuming a full cache line, invalidating caches, and transitioning back to unblocked state. The enhanced write supports partial writes (i.e., smaller than cache line size) by first reading memory, merging the modified data, and then writing back to memory. Implementation of this mirrors that of DmaRead in terms of state. This means for each DmaRead state (BDR_PM, BDR_Pm, and BDR_M) there is a write analogue (BDW_PM, BDW_Pm, and BDR_M) and the BDR_P state is removed. Furthermore, this enhanced DmaWrite ... actually writes data to memory instead of relying on DirectoryEntry / backing store for correct data. There are two possible state transitions for DmaWrite now. (1) Memory data arrives before probe response and (2) probe response arrives before memory data. In case (1), probe data overwrites memory data and merges the partial write using the TBE write mask then updates write mask to 'filled' state. In case (2), probe data is merged with the partial data using the TBE write mask then updates write mask to 'filled' state. The memory data will then be clobbered by copying the TBE data over the response since the write mask is now full. Change-Id: I1eebb882b464c4c5ee5fd60932fd38d271ada4d7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57410 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-13 15:31:32 +00:00
Matthew Poremba	bfcab1258f	mem-ruby: Remove DataBlk from MOESI_AMD DirectoryEntry This protocol is using an old style where read/writes to memory were being done by writing to a DataBlock in a DirectoryMemory entry. This results in having multiple copies of memory, leads to stale copies in at least one memory (usually DRAM), and require --access-backing-store in most cases to work properly. This changeset removes all references to getDirectoryEntry(...).DataBlk and instead forwards those reads and writes to DRAM always. Change-Id: If2e52151789ad82c7b55c8fa2b41c1f4e5b65994 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57409 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-13 15:31:32 +00:00
Matthew Poremba	6a9dfcef52	mem-ruby: Revert `7018c2b34` This reverts commit `7018c2b34e`. This commit needs more work which will take a while. Meanwhile the nightly tests are broken because of this. Change-Id: I11d01d50ab3a2d8fd649f1a825911e14815b1ca6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57109 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-26 15:19:51 +00:00
Matthew Poremba	1bc23ca966	mem-ruby: Add protocol prints to MOESI_AMD_BASE-dma Change-Id: I59ed7311a8dc2a06ce1df0027891ba8e24e8a89e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56447 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Matthew Poremba	7018c2b34e	mem-ruby: Remove DirectoryMemory storage in MOESI_AMD_BASE-dir This protocol is using an old style where read/writes to memory were being done by writing to a DataBlock in a DirectoryMemory entry. This results in having multiple copies of memory, leads to stale copies in at least one memory (usually DRAM), and require --access-backing-store in most cases to work properly. This changeset removes all references to getDirectoryEntry(...).DataBlk and instead forwards those reads and writes to DRAM always. This results in new transient states BL_WM, BDW_WM, and B_WM which are blocked states waiting on memory acks indicating a write request is complete. The appropriate transitions are updates to move to these new states and stall states are updated to include them. DMA write ACK is also moved to when the request is sent to memory, rather than when the request is received. Change-Id: Ic5bd6a8a8881d7df782e0f7eed8be9d873610e04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56446 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Matthew Poremba	54fc137945	mem-ruby: Ensure MOESI_AMD_Base-dir has probe destinations The directory has an assert that this is at least one destination for a probe when sending an invalidation or shared probe to coherence end points in the protocol (TCC, LLC). This is not necessarily request and for certain configurations there will be no probes required and none will be sent. One such configuration is the GPU protocol tester which would not require a probe to the CPU if it does not exist. To fix this we first collect the probe destinations. Then we check if any destinations exist. If so, we send the probe message. Otherwise we immediately enqueue a probe complete message to the trigger queue. This reorganization prevents messages with no destinations from being enqueued, meeting the criteria for the assertion. Change-Id: If016f457cb8c9e0277a910ac2c3f315c25b50ce8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55543 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Tiago Mück	b354e1a252	mem-ruby: Fix handling of stale CleanUnique JIRA: https://gem5.atlassian.net/browse/GEM5-1185 Fixed an issue in which a CleanUnique responder would incorrectly deallocate the cache block when handling an stale CU when the state is UD_RU or UC_RU (thus incorrectly transitioning to RU). The fix is to handle stale CUs similarly to stale WBs where we override the dataValid TBE field to prevent the wrong state transition. This patch moves the stale code path to a separate transition (similarly to stale WBs/Evicts) and moves the dataValid override to Initiate_Request_Stale so it applies to all stale request types. Notice now the stale field is also set on stale Comp_UC responses. Additional minor change: CheckUpgrade_FromRU is the same as CheckUpgrade_FromStore so it was removed. Change-Id: I0a2cedcfde1dc30d67aa2c16d71b7470369c2b6e Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56810 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Meatboy 106 <garbage2collector@gmail.com>	2022-02-17 15:21:45 +00:00
Gabriel Busnot	8a7fcd340f	mem-ruby: Add missing CHI transition SD_RSC + *_Stale->BUSY_BLKD Related JIRA: https://gem5.atlassian.net/browse/GEM5-1180 Change-Id: Ife83bebcaa48345633fce0a0de08394e30c1a796 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56083 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Muck <tiago.muck@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-28 07:01:14 +00:00
Matthew Poremba	9313294efe	misc: Remove AMD license addition Remove the line "For use for simulation and test purposes only" in files were AMD is the only copyright holder listed in the header. This happens to be the case for all files where this line exists, removing it completely from gem5. Change-Id: I623f266b002f564301b28774f49081099cfc60fd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53943 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-11 04:00:56 +00:00
Gabe Black	1c233ee9d2	scons: Add sim_object and enums arguments to SimObject(). This will explicitly declare what SimObject and Enum types need to be set up in C++, which will make importing all the SimObject modules during the setup phase of SCons uneccessary. Change-Id: Id2d7603daf33b236ceaa0789e2f089f589d34e62 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49406 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-08 08:01:23 +00:00
Matthew Poremba	e0d62e510d	configs,mem-ruby: Remove reference to old GPU ptls GPU_VIPER_Baseline, GPU_VIPER_Region, and GPU_RfO were removed some time ago. Change-Id: If873b0cfe8cc2b3096cbe97d4e13a8e02d2ec567 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53703 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-07 20:26:17 +00:00
Matthew Poremba	c5ba40cfe1	mem-ruby: Add GPUonly parameter for VIPER Currently MOESI_AMD_Base used in VIPER has a CPUonly parameter which indicates that messages should not try to add GPU SLICC controllers as destinations. This adds the analogue GPUonly parameter which indicates that requests should not try to add CPU SLICC controllers. Also adds an assert to ensure the outgoing message has at least one destination. This assert would indicate a misconfiguration. Change-Id: Ibb0affd4606084fca021f0e7c117d4ff8c06d429 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51928 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2021-10-26 15:52:11 +00:00
Matthew Poremba	55fdf4be52	mem-ruby: Add missing CPUonly check for VIPER The CPUonly variable in MOESI_AMD_Base's Directory indicates that probes should not be sent to any GPU SLICC controllers as they are not part of CPU. There is one CPUonly check missing which causes problems in GPU-only Ruby networks as there is no route to any controllers with that MachineType. Add a condition to check CPUonly and do nothing in that case. Change-Id: I41b6c04feec473e34b04402adfb5978e75b847b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51927 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-26 15:52:11 +00:00
Matt Sinclair	118677218d	mem-ruby: fix typo in GPU VIPER TCC comment `72ee6d1a` fixed a deadlock in the GPU VIPER TCC. However, it inadvertently added a typo to the comments explaining the change. This commit fixes that. Change-Id: Ibba835aa907be33fc3dd8e576ad2901d5f8f509c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51687 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-17 04:07:49 +00:00
Matt Sinclair	1120931105	mem-ruby: Move VIPER TCC decrements to action from in_port Currently, the GPU VIPER TCC protocol handles races between atomics in the triggerQueue_in. This in_port does not check for resource availability, which can cause the trigger queue to execute multiple times. Although this is the expected behavior, the code for handling atomic races decrements the atomicDoneCnt flag in the trigger queue, which is not safe since resource contention may cause it to execute multiple times. To resolve this issue, this commit moves the decrementing of this counter to a new action that is called in an event that happens only when the race between atomics is detected. Change-Id: I552fd4f34fdd9ebeec99fb7aeb4eeb7b150f577f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51368 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-08 22:03:13 +00:00
Matt Sinclair	72ee6d1aad	mem-ruby: Update GPU VIPER TCC protocol to resolve deadlock In the GPU VIPER TCC, programs with mixes of atomics and data accesses to the same address, in the same kernel, can experience deadlock when large applications (e.g., Pannotia's graph analytics algorithms) are running on very small GPUs (e.g., the default 4 CU GPU configuration). In this situation, deadlocks occur due to resource stalls interacting with the behavior of the current implementation for handling races between atomic accesses. The specific order of events causing this deadlock are: 1. TCC is waiting on an atomic to return from directory 2. In the meantime it receives another atomic to the same address -- when this happens, the TCC increments number of atomics to this address (numAtomics = 2) that are pending in TBE, and does a write through of the atomic to the directory. 3. When the first atomic returns from the Directory, it decrements the numAtomics counter. numAtomics was at 2 though, because of step #2. So it doesn't deallocate the TBE entry and calls Event:AtomicNotDone. 4. Another request (a LD) to the same address comes along for the same address. The LD does z_stall since the second atomic is pending –- so the LD retries every cycle until the deadlock counter times out (or until the second atomic comes back). 5. The second atomic returns to the TCC. However, because there are so many LD's pending in the cache, all doing z_stall's and retrying every cycle, there are a lot of resource stalls. So, when the second atomic returns, it is forced to retry its operation multiple times -- and each time it decrements the atomicDoneCnt flag (which was added to catch a race between atomics arriving and leaving the TCC in `7246f70bfb`) repeatedly. As a result atomicDoneCnt becomes negative. 6. Since this atomicDoneCnt flag is used to determine when Event:AtomicDone happens, and since the resource stalls caused the atomicDoneCnt flag to become negative, we never complete the atomic. Which means the pending LD can never access the line, because it's stuck waiting for the atomic to complete. 7. Eventually the deadlock threshold is reached. To fix this issue, this commit changes the VIPER TCC protocol from using z_stall to using the stall_and_wait buffer method that the Directory-level of the SLICC already uses. This change effectively prevents resource stalls from dominating the TCC level, by putting pending requests for a given address in a per-address stall buffer. These requests are then woken up when the pending request returns. As part of this change, this change also makes two small changes to the Directory-level protocol (MOESI_AMD_BASE-dir): 1. Updated the names of the wakeup actions to match the TCC wakeup actions, to avoid confusion. 2. Changed transition(B, UnblockWriteThrough, U) to check all stall buffers, as some requests were being placed later in the stall buffer than was being checked. This mirrors the changes in `187c44fe44` to other Directory transitions to resolve races between GPU and DMA requests, but for transitions prior workloads did not stress. Change-Id: I60ac9830a87c125e9ac49515a7fc7731a65723c2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51367 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-10-08 22:03:13 +00:00
Daecheol You	82db312550	mem-ruby: Add (RUSC, LocalHN_Eviction) transition During full system simulation on CHI, LocalHN_Eviction event on the RUSC state occured occasionally. Thus, the change adds RUSC state to the transition. Change-Id: Ibff382c38a092895bc03a4a64cf072ae752decf3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49263 Reviewed-by: Tiago Mück <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-08-24 00:17:32 +00:00
Carlos Falquez	6d07200693	mem-ruby: Add (BUSY_BLKD,SnpOnceFwd) transition Add (BUSY_BLKD,SnpOnceFwd) cache transition to the Ruby CHI protocol. Change-Id: I150880b26dee869b48cfd16fb661b9487527a8cd Signed-off-by: Carlos Falquez <c.falquez@fz-juelich.de> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46901 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-06-29 02:13:54 +00:00

1 2 3

125 Commits