derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Tiago Mück	eb0b4ba657	mem-ruby: CHI fix for WUs on local+upstream line Fix for WriteUnique operations on cache lines that are both local and upstream JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I99def32948d3f0ced9cfc7f7712a0f4ae9aab0cd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57299 Reviewed-by: Tiago Muck <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-12 10:21:57 +00:00
Samuel Stark	7e84a14a26	mem-ruby: AbstractController unaddressed profiling Adds support for profiling "unaddressed" transactions, which are associated with a unique ID rather than a memory address, to AbstractController. JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: Ib75f3f38dc4910acc2ad4f1c7bf88c9193568203 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57297 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 09:57:43 +00:00
Samuel Stark	920859e191	mem-ruby: Added upstream_nodes to AbstractController Added support for an upstream_nodes NetAddr list in AbstractController, which will be used in future CHI work. JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I30a6d621d7f201d89f0b13dab8ed4dd1f1f6caa3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57296 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 09:57:43 +00:00
Samuel Stark	65f8bf4460	mem-ruby: Support for unaddressed mem requests in the RubyRequest JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I5aa44186888b95f81bec524ff57e8dbf4c9166f8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57293 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 07:31:34 +00:00
Samuel Stark	32ed7794d8	mem-ruby: Add TLBI callbacks to the RubyPort JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I984fd497b7209772106150abb853c91c3d818dfd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57295 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-11 07:31:34 +00:00
Giacomo Travaglini	38fe886ee3	mem-ruby: Support for mem commands in the Sequencer The isPhysMemAddress checks if a valid memory address refers to physical memory. This can't be used for memory commands a they don't hold a valid address/size Change-Id: Ib39c759aa90ab50ffe2036b5f0ae17627f57e5f5 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58510 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-06 08:37:11 +00:00
Giacomo Travaglini	5747822292	mem: Add Request factory method for memory management command This should be used to construct memory management Requests (Not requiring an address nor a size) Change-Id: Id1b6f1032c1390210a216cd77c7dd0cec14e962f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58357 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-05 09:24:16 +00:00
Giacomo Travaglini	05f1975832	mem: Introduce Request::isMemMgmt to cover memory management cmds It will check if the request is a TLB invalidation or a transactional memory request Change-Id: I84351a13a6806d8119e4efa8ef98ab150976c8ab Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58509 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-05 09:24:16 +00:00
Jui-min Lee	118b069d5d	mem: Align mmap offset to page boundary If we create abstract memories with a sub-page size on a system with shared backstore, the offset of next mmap might become non-page-align and cause an invalid argument error. In this CL, we always upscale the range size to multiple of page before updating the offset, so the offset is always on page boundary. Change-Id: I3a6adf312f2cb5a09ee6a24a87adc62b630eac66 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58289 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-30 05:18:52 +00:00
Gabe Black	e6c0ba97db	scons: Put all config variables in an env['CONF'] sub-dict. This makes what are configuration and what are internal SCons variables explicit and separate, and makes it unnecessary to call out what variables to export to C++. These variables will also be plumbed into and out of kconfiglib in later changes. Change-Id: Iaf5e098d7404af06285c421dbdf8ef4171b3f001 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56892 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 20:31:21 +00:00
Jui-min Lee	75eedb1d0b	mem: Add SharedMemoryServer Add an utility class that provides a service for another process query and get the fd of the corresponding region in gem5's physmem. Basically, the service works in this way: 1. client connect to the unix socket created by a SharedMemoryServer 2. client send a request {start, end} to gem5 3. the server locates the corresponding shared memory 4. gem5 response {offset} and pass {fd} in ancillary data mmap fd at offset will provide the client the view into the physical memory of the request range. Change-Id: I9d42fd8a41fc28dcfebb45dec10bc9ebb8e21d11 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57729 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 08:26:44 +00:00
Matthew Poremba	9df61a8aea	mem: Add setter for RequestorID in request This is more convenient than setVirt for changing the requestor ID. This field is modified frequently in disjoint Ruby network topologies to specify which Ruby network a request should be routed through. Change-Id: If37d13207e3b2b5c62362bab9a0e1250c392be63 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57650 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 19:51:29 +00:00
Gabe Black	f10fe51e18	scons: Don't accumulate SLICC_INCLUDES. Presumably, these are fixed for whatever protocol that gets selected. We don't need to accumulate includes, we need to set includes to something in particular. If there is a common include which always needs to be used, we can handle that in the SConscript separately from SLICC_INCLUDES. Change-Id: I996d08566944e38e388dc287f644c40366ebba0d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56754 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com>	2022-03-24 22:09:09 +00:00
Jui-min Lee	667308ae7f	mem: Add option to remove shared memory at the end Add a new option `auto_unlink_shared_backstore` to System so it will remove the shared backstore used in physical memories when the System is getting destructed. This will prevent unintended memory leak. If the shared memory is designed to live through multiple round of simulations, you may set the option to false to prevent the removal. Test: Run a simulation with shared_backstore set, and see whether there is anything left in /dev/shm/ after simulation ends. Change-Id: I0267b643bd24e62cb7571674fe98f831c13a586d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57469 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-17 01:29:54 +00:00
Matthew Poremba	7cfe88df74	mem: Add system request flag for dGPUs dGPUs can translate a virtual address and will not know if the address resides in system/host memory or device/dGPU memory until the translation is complete. In order to mark requests as going to either system memory or device memory we add a field to the Request class. Change-Id: Ib1e80e8d03ecdfeb11c24d979ccc4b912ce07f91 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51852 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-03-17 00:11:14 +00:00
Samuel Stark	e41323fb93	mem: Add TlbiExtSync packet type JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I45435326daca599ac973c747777ecac52bf7fd33 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57290 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-15 16:20:49 +00:00
Samuel Stark	d64a2ba541	mem: Add external TLBI flags to the Request object * TLBI_EXT_SYNC: This flag tells the CPU model that a remote TLBI Sync has been requested * TLBI_EXT_SYNC_COMP: This flag tells the interconnect that a remote TLBI Sync request has completed JIRA: https://gem5.atlassian.net/browse/GEM5-1097 Change-Id: I459d22f112038cc1427e24999904ba74c1c08cfb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57289 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-15 16:20:26 +00:00
Gabe Black	06117275fa	scons: Make all sticky variables automatically exported. All sticky vars are exported, but not all exported vars are sticky. The vars which are exported but not sticky are (at least in general) found with Configure() style measurement. Change-Id: Idebf17e44c2eeca745cdfdd9f42eddcfdb0cf9ed Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56891 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2022-03-15 00:45:30 +00:00
Matthew Poremba	20d8b388ad	mem-ruby: Enhance MOESI_AMD DmaWrite This enhances MOESI_AMD_Base-dir DmaWrite to enable partial writes. This is currently done by assuming a full cache line, invalidating caches, and transitioning back to unblocked state. The enhanced write supports partial writes (i.e., smaller than cache line size) by first reading memory, merging the modified data, and then writing back to memory. Implementation of this mirrors that of DmaRead in terms of state. This means for each DmaRead state (BDR_PM, BDR_Pm, and BDR_M) there is a write analogue (BDW_PM, BDW_Pm, and BDR_M) and the BDR_P state is removed. Furthermore, this enhanced DmaWrite ... actually writes data to memory instead of relying on DirectoryEntry / backing store for correct data. There are two possible state transitions for DmaWrite now. (1) Memory data arrives before probe response and (2) probe response arrives before memory data. In case (1), probe data overwrites memory data and merges the partial write using the TBE write mask then updates write mask to 'filled' state. In case (2), probe data is merged with the partial data using the TBE write mask then updates write mask to 'filled' state. The memory data will then be clobbered by copying the TBE data over the response since the write mask is now full. Change-Id: I1eebb882b464c4c5ee5fd60932fd38d271ada4d7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57410 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-13 15:31:32 +00:00
Matthew Poremba	bfcab1258f	mem-ruby: Remove DataBlk from MOESI_AMD DirectoryEntry This protocol is using an old style where read/writes to memory were being done by writing to a DataBlock in a DirectoryMemory entry. This results in having multiple copies of memory, leads to stale copies in at least one memory (usually DRAM), and require --access-backing-store in most cases to work properly. This changeset removes all references to getDirectoryEntry(...).DataBlk and instead forwards those reads and writes to DRAM always. Change-Id: If2e52151789ad82c7b55c8fa2b41c1f4e5b65994 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57409 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-13 15:31:32 +00:00
Jui-min Lee	23e6607507	mem: Fix phy mem with shm and multiple abstr mem Previously, all abstract memory backed by the same physical memory will use the exact same chunk of shared memory if sharedBackstore is set. It means that all abstract memories, despite setting to a different range, will still be map to the same chunk of memory. As a result, setting the sharedBackstore not only allows our host system to share gem5 memory, it also enforces multiple gem5 memories to share the same content. Which will significantly affect the simulation result. Furthermore, the actual size of the shared memory will be determined by the last backingStore created. If the last one is unfortunately smaller than any previous backingStore, this may invalid previous mapped region and cause a SIGBUS upon access (on linux). In this CL, we put all backingStores of those abstract memories side by side instead of stacking them all together. So the behavior of abstract memories will be kept consistent whether the sharedBackstore is set or not, yet presist the ability to access those memories from host. Change-Id: Ic4ec25c99fe72744afaa2dfbb48cd0d65230e9a8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57369 Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-10 05:59:56 +00:00
Gabe Black	288e5c47fa	mem: Create a SysBridge object to bridge between Systems interconnect. It's possible to bridge together the memory interconnect of two systems, either as parallel peers, or one nested inside the other. Each System will have its own set of RequestorIDs, and using an ID from one System inside the other can lead to a number of different problems. This change adds a new SimObject called SysBridge which connects two Systems interconnect together. The object allocates a requestor ID in each system, and for all PacketPtrs passing through it, the requestor ID from the target system is installed in the associated Request. On the way back, either inline or in a split, delayed response, the original RequestorID is restored by reinstalling the original Request object. Change-Id: I237c668962a04ef6dfc872df16762a884c05ede9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54743 Reviewed-by: Jesse Pai <jessepai@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-10 04:03:39 +00:00
Jason Lowe-Power	01785b5d0e	mem-ruby: Reset stats in Ruby correctly Change-Id: Ie60c6f4be7b2a2705dc6da77b8b3d03717f13188 Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57269 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-03-03 02:06:54 +00:00
Alex Richardson	6de0156cf7	mem-cache: Avoid calling .front() on a possibly empty std::list In the call to MSHR::promoteWritable() the deferredTargets list can be empty, so we should check that case before calling .front(). The new logic matches MSHR::promoteReadable(). Change-Id: Ic1d05e42f32b2c02226ca88d2155225f592f667f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57249 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-02 09:59:59 +00:00
Samuel Stark	77263615db	mem: Add TLB invalidation flags to the Request object Some ISAs implement TLB invalidation across multiple cores (TLB shootdown) by broadcasting invalidation messages to every PE in a target shareability domain. These messages originate by specific instructions and can be cathegorized in two macro groups 1) TLB Invalidation instructions: generating the invalidation request Example: * Arm: TLBI instruction [1] * AMD64: INVLPGB instruction [2] 2) TLB Invalidation sync instructions: serialization point, ensuring completion of outstanding invalidation requests Example: * Arm: DSB instruction [1] * AMD64: TLBSYNC instruction [2] This patch is introducing TLBI and SYNC operations in the memory subsystem by adding the following Request flags: * TLBI (1) * TLBI_SYNC (2) JIRA: https://gem5.atlassian.net/browse/GEM5-1097 [1]: https://developer.arm.com/documentation/ddi0487/gb/ [2]: https://www.amd.com/system/files/TechDocs/24594.pdf Change-Id: Ib5b025d0f6bc0edaf4f11a66593947a72ba32b8f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56596 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-02 08:22:41 +00:00
Hoa Nguyen	0fefc76fe6	mem-cache: Fix unit inconsistencies in base cache stats Most latency stats are described to have Cycle unit in the comments. However, most of them are calculated from Tick. Also, the unit of `demandAvgMissLatency` is incorrect. Change-Id: Ib1b9b7c6fa4404cecb3982b3799753df19774623 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56989 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-27 23:01:03 +00:00
Matthew Poremba	6a9dfcef52	mem-ruby: Revert `7018c2b34` This reverts commit `7018c2b34e`. This commit needs more work which will take a while. Meanwhile the nightly tests are broken because of this. Change-Id: I11d01d50ab3a2d8fd649f1a825911e14815b1ca6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57109 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-26 15:19:51 +00:00
Gabe Black	001e17890c	misc: Use the new bufval helpers in RegClass and Packet. Those makes generally useful mechanisms are now available to any code that wants to use it, and are covered by a unit test. Change-Id: If918eba3b81443019c5789ab132de45c65f93072 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57150 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-26 09:56:26 +00:00
Matthew Poremba	1bc23ca966	mem-ruby: Add protocol prints to MOESI_AMD_BASE-dma Change-Id: I59ed7311a8dc2a06ce1df0027891ba8e24e8a89e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56447 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Matthew Poremba	7018c2b34e	mem-ruby: Remove DirectoryMemory storage in MOESI_AMD_BASE-dir This protocol is using an old style where read/writes to memory were being done by writing to a DataBlock in a DirectoryMemory entry. This results in having multiple copies of memory, leads to stale copies in at least one memory (usually DRAM), and require --access-backing-store in most cases to work properly. This changeset removes all references to getDirectoryEntry(...).DataBlk and instead forwards those reads and writes to DRAM always. This results in new transient states BL_WM, BDW_WM, and B_WM which are blocked states waiting on memory acks indicating a write request is complete. The appropriate transitions are updates to move to these new states and stall states are updated to include them. DMA write ACK is also moved to when the request is sent to memory, rather than when the request is received. Change-Id: Ic5bd6a8a8881d7df782e0f7eed8be9d873610e04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56446 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Matthew Poremba	54fc137945	mem-ruby: Ensure MOESI_AMD_Base-dir has probe destinations The directory has an assert that this is at least one destination for a probe when sending an invalidation or shared probe to coherence end points in the protocol (TCC, LLC). This is not necessarily request and for certain configurations there will be no probes required and none will be sent. One such configuration is the GPU protocol tester which would not require a probe to the CPU if it does not exist. To fix this we first collect the probe destinations. Then we check if any destinations exist. If so, we send the probe message. Otherwise we immediately enqueue a probe complete message to the trigger queue. This reorganization prevents messages with no destinations from being enqueued, meeting the criteria for the assertion. Change-Id: If016f457cb8c9e0277a910ac2c3f315c25b50ce8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55543 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-17 17:03:19 +00:00
Tiago Mück	b354e1a252	mem-ruby: Fix handling of stale CleanUnique JIRA: https://gem5.atlassian.net/browse/GEM5-1185 Fixed an issue in which a CleanUnique responder would incorrectly deallocate the cache block when handling an stale CU when the state is UD_RU or UC_RU (thus incorrectly transitioning to RU). The fix is to handle stale CUs similarly to stale WBs where we override the dataValid TBE field to prevent the wrong state transition. This patch moves the stale code path to a separate transition (similarly to stale WBs/Evicts) and moves the dataValid override to Initiate_Request_Stale so it applies to all stale request types. Notice now the stale field is also set on stale Comp_UC responses. Additional minor change: CheckUpgrade_FromRU is the same as CheckUpgrade_FromStore so it was removed. Change-Id: I0a2cedcfde1dc30d67aa2c16d71b7470369c2b6e Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56810 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Meatboy 106 <garbage2collector@gmail.com>	2022-02-17 15:21:45 +00:00
Daniel R. Carvalho	43df899229	mem-cache,tests: Add unit test for ReplaceableEntry Add a unit test for ReplacementPolicy::ReplaceableEntry. Change-Id: Iaa0c0cfdf1745b7b4d9efbe8ccab8f002a1bcee8 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44110 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-09 21:16:35 +00:00
Majid Jalili	714b9b2356	mem-cache: adding round-robin aribitration to multiprefetchers To find a candidate in cache base.cc, function getPacket is called. In case of multi-prefetchers, we alyways start from the first prefetcher. Given the default value for "latency" is 1, there is always a candidate ready for prefech by prefetcher 0. Hence, we need an arbitration mechansim to cycle through all prefechers. To make this fair, we added a variable to save what prefetcher first used to get a packet from, and in the next round, we start from the next prefetcher to give every prefetcher a chance to be the first one in a round-robin fashion. JIRA Ticket: https://gem5.atlassian.net/browse/GEM5-1169 Change-Id: I1c6a267b2bf71764559a080371c1d7f8be95ac71 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56265 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-08 16:51:46 +00:00
Giacomo Travaglini	7129e2559e	mem-ruby: Fix -Werror=unused-variable from recent ruby patch One of the recent ruby patches [1] adopted iteration over an unordered_map via structured binding. As of now it is not possible to ignore one of the unpacked variables, and, if unused, a warning might be triggered by some compilers. With this patch we are fixing the building error by using range-based for loops without structured binding [1]: https://gem5-review.googlesource.com/c/public/gem5/+/55723 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I882158cc2aeccc58d30318f29470505c53baf3e2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56104 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Meatboy 106 <garbage2collector@gmail.com>	2022-01-28 09:05:22 +00:00
Gabriel Busnot	8a7fcd340f	mem-ruby: Add missing CHI transition SD_RSC + *_Stale->BUSY_BLKD Related JIRA: https://gem5.atlassian.net/browse/GEM5-1180 Change-Id: Ife83bebcaa48345633fce0a0de08394e30c1a796 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56083 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Muck <tiago.muck@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-28 07:01:14 +00:00
Gabriel Busnot	748b613c94	mem-ruby: Fix switch storage in SimpleNetwork In SimpleNetwork, switches were assigned an index depending on their position in params().routers. But switches are also referenced by their router_id parameter in other locations of the ruby network system (e.g., src and dst node parameter in links). If the router_id does not match the position in SimpleNetwork::m_switches, the network initialization might fail or implement a different topology from what the user intended. This patch fixes this issue by storing switches in a map instead of a vector. Change-Id: I398f950ad404efbf9516ea9bbced598970a2bc24 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55723 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-26 06:43:27 +00:00
Tiago Muck	85a1d43c10	mem-ruby: additional SimpleNetwork stats Additional stats allow more detailed monitoring of switch bandwidth and stalls. Also cleaned up previous Throttle stats to match new stat API. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I56604f315024f19df5f89c6f6ea1e3aa0ea185ea Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41865 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:37:46 +00:00
Tiago Mück	9c8f79310f	mem-ruby: add priorities in SimpleNetwork routing Configurations can specify a routing priority for message buffers. This priority is used by SimpleNetwork when checking for messages in the routers' input ports. Higher priority ports are always checked first. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I7e2b35e2cae63086a76def1145f9b4b56220a2ba Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41864 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:37:46 +00:00
Tiago Mück	b476d7c1d3	mem-ruby: fine tunning SimpleNetwork buffers If physical_vnets_channels is set we adjust the link buffer sizes and the max_dequeue_rate in order to achieve the expected maximum throughput assuming a fully pipelined link, i.e., throughput of 1 msg per cycle per channel (assuming the channels width matches the protocol logical message size, otherwise maximum throughput may be smaller). JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: Id99ab745ed54686d8ffcc630d622fb07ac0fc352 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41863 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:37:46 +00:00
Tiago Mück	986e7b90d3	mem-ruby: int/ext SimpleNetwork routing latency One now may specify separate routing latencies for internal and external links using the router's int_routing_latency and ext_routing_latency, respectively. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I5532668bf23fc61d02b978bfd9479023a6ce2b16 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41861 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:37:46 +00:00
Tiago Mück	ac278e44f9	mem-ruby: fix SimpleNetwork WeightBased routing Individual link weights are propagated to the routing algorithms and WeightBased routing now uses this information to select the output link when multiple routing options exist. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I86a4deb610a1b94abf745e9ef249961fb52e9800 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41860 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:37:46 +00:00
Tiago Mück	f748fbe7e1	mem-ruby: refactor SimpleNetwork buffers This removes the int_link_buffers param from SimpleNetwork. Internal link buffers are now created as children of SimpleIntLink objects. This results in a cleaner configuration and simplifies some code in SimpleNetwork.cc. setup_buffers is also split between Switch.setup_buffers and SimpleIntLink.setup_buffers for clarity. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I68ad36ec0e682b8d5600c2950bcb56debe186af3 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41859 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:37:46 +00:00
Tiago Mück	c3880c2c46	mem-ruby: refactored SimpleNetwork routing The routing algorithm is encapsulated in a separate SimObject to allow user to implement different routing strategies. The default implementation (WeightBased) maintains the original behavior. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I5c8927f358b8b04b2da55e59679c2f629c7cd2f9 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41858 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-24 19:09:26 +00:00
Tiago Mück	286c23da52	mem-ruby: fixed SimpleNetwork starvation The round-robing scheduling seed is shared across all ports and vnets in the router and it's possible that, under certain heavy traffic scenarios, the same port will always fill the input buffers before any other port is checked. This patch removes the round-robin scheduling. The port to be checked first is always the one with the oldest message. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I918694d46faa0abd00ce9180bc98c58a9b5af0b5 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41857 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Meatboy 106 <garbage2collector@gmail.com>	2022-01-20 15:26:58 +00:00
Tiago Muck	72185e51b2	mem-ruby: SimpleNetwork router latencies SimpleNetwork takes into account the network router latency parameter. The latency may be set to zero. PerfectSwitch and Throttle events were assigned different priorities to ensure they always execute in the same order for zero-latency forwarding. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I6cae6a0fc22b25078c27a1e2f71744c08efd7753 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41856 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 15:26:58 +00:00
Tiago Muck	43232cdb9f	mem-ruby: Optionally set Consumer ev. priority JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I62dc6656bbed4e7f4d575a6a82ac254382294ed1 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41855 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 15:26:58 +00:00
Tiago Muck	bab3ce1661	configs,mem-ruby: SimpleNetwork physical channels Setting the physical_vnets_channels parameter enables the emulation of the bandwidth impact of having multiple physical channels for each virtual network. This is implemented by computing bandwidth in a per-vnet/channel basis within Throttle objects. The size of the message buffers are also scaled according to this setting (when buffer are not unlimited). The physical_vnets_bandwidth can be used to override the channel width set for each link and assign different widths for each virtual network. The --simple-physical-channels option can be used with the generic configuration scripts to automatically assign a single physical channel to each virtual network defined in the protocol. JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: Ia8c9ec8651405eac8710d3f4d67f637a8054a76b Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41854 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 15:26:58 +00:00
Tiago Mück	87cdf354be	mem-ruby: dequeue rate limit for message buffers The 'max_dequeue_rate' parameter limits the rate at which messages can be dequeued in a single cycle. When set, 'isReady' returns false if after max_dequeue_rate is reached. This can be used to fine tune the performance of cache controllers. For the record, other ways of achieving a similar effect could be: 1) Modifying the SLICC compiler to limit message consumption in the generated wakeup() function 2) Set the buffer size to max_dequeue_rate. This can potentially cut the the expected throughput in half. For instance if a producer can enqueue every cycle, and a consumer can dequeue every cycle, a message can only be actually enqueued every two (assuming buffer_size=1) since the buffer entries available after dequeue are only visible in the next cycle (even if the consumer executes before the producer). JIRA: https://gem5.atlassian.net/browse/GEM5-920 Change-Id: I3a446c7276b80a0e3f409b4fbab0ab65ff5c1f81 Signed-off-by: Tiago Mück <tiago.muck@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/41862 Reviewed-by: Meatboy 106 <garbage2collector@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 15:26:58 +00:00
Yu-hsin Wang	52661838a4	ext: upgrade to googletest 1.11.x Upgrade googletest to 1.11.x upstream commit: 8306020a3e9eceafec65508868d7ab5c63bb41f7 sha1sum df8cdd26ee7cdf2a3d9c05a92d3630a96f406422 generated by command: find . -type f ! -name SConscript ! -path "./.*" -print0 \ \| sort -z \| xargs -0 sha1sum \| sha1sum This upgrade is mainly for providing ConditionalMatcher support. Change-Id: I27d971c02c59a3ad42c3002f1b4e1a8b18269c56 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55384 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-01-20 01:16:02 +00:00

1 2 3 4 5 ...

3055 Commits