derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Gabe Black	b877efa6d4	misc: Update attribute syntax, and reorganize compiler.hh. This change replaces the __attribute__ syntax with the now standard [[]] syntax. It also reorganizes compiler.hh so that all special macros have some explanatory text saying what they do, and each attribute which has a standard version can use that if available and what version of c++ it's standard in is put in a comment. Also, the requirements as far as where you put [[]] style attributes are a little more strict than the old school __attribute__ style. The use of the attribute macros was updated to fit these new, more strict requirements. Change-Id: Iace44306a534111f1c38b9856dc9e88cd9b49d2a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35219 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 21:52:59 +00:00
Gabe Black	e5a3584df7	mem-ruby: Remove conditional includes based on THE_ISA in ruby. These were including instruction class definitions from x86 for some reason. There was no code in those .cc files which actually used anything from them, as evidenced by the fact that the GCN3_X86 build still works. No other code in the file was conditionally compiled as of today. Change-Id: I3cef8348fb601dd7af67665cf64bbf514c91c3db Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34577 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-16 06:08:22 +00:00
Jason Lowe-Power	90a6e80962	mem-ruby: Update port names in Ruby After the terminology update commit there were still many confusing names in the Ruby ports. This changeset is a proposal for updating these names. For an example use case, see the following resources changeset. https://gem5-review.googlesource.com/c/public/gem5-resources/+/34416 Change-Id: I01d4f24a70b300e39438ee147dfab7a8d674d5c7 Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34417 Reviewed-by: Ayaz Akram <yazakram@ucdavis.edu> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-15 00:25:01 +00:00
Srikant Bharadwaj	7957b1c43b	mem-garnet: Upgrade garnet version to 3.0 This version of garnet includes HeteroGarnet which supports heterogenous interconnect systems, flexible router and link configurations, and better debugging resources. This patch changes the garnet directory structure to not include the version number. The user will be informed about the garnet version being used. Change-Id: Id4763421528305193ae0cd10c159b385a9513553 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34259 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-11 05:38:09 +00:00
Srikant Bharadwaj	28d41f213a	mem-garnet: Allow empty vnet list for garnet network links An empty supporting_vnet list is the default and implies that all vnets are supported. This removes the assert which requires the list to have a minimum list size of 1. Change-Id: I6710ba06041164bbd597d98e75374a26a1aa5655 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34258 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-11 02:00:13 +00:00
Srikant Bharadwaj	94f7736489	mem-garnet: Fix default value of network bridge Initializing the network bridge with NULL causes it to have an class error when instatiating a link. The bridge is only needed whne either a CDC or SerDes is enabled. This is handled later during construction of the GarnetLink. Change-Id: If19a21a6d9bf49449b9c390467d08d3422ae991a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34257 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-11 02:00:13 +00:00
Shivani Parekh	392c1ced53	misc: Replaced master/slave terminology Change-Id: I4df2557c71e38cc4e3a485b0e590e85eb45de8b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33553 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-10 23:02:28 +00:00
Michael Boyer	a9e40fd03a	mem-ruby: Check number of vnets when creating links Added error checking to ensure that the system has sufficient virtual networks when setting latency and weight values. Change-Id: I1b28144bbe9fefab0c0a6227f1fdf4ea10403061 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32603 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-09-08 22:09:23 +00:00
Timothy Hayes	0a8a787de3	mem-ruby: HTM mem implementation This patch augments the MESI_Three_Level Ruby protocol with hardware transactional memory support. The HTM implementation relies on buffering of speculative memory updates. The core notifies the L0 cache controller that a new transaction has started and the controller in turn places itself in transactional state (htmTransactionalState := true). When operating in transactional state, the usual MESI protocol changes slightly. Lines loaded or stored are marked as part of a transaction's read and write set respectively. If there is an invalidation request to cache line in the read/write set, the transaction is marked as failed. Similarly, if there is a read request by another core to a speculatively written cache line, i.e. in the write set, the transaction is marked as failed. If failed, all subsequent loads and stores from the core are made benign, i.e. made into NOPS at the cache controller, and responses are marked to indicate that the transactional state has failed. When the core receives these marked responses, it generates a HtmFailureFault with the reason for the transaction failure. Servicing this fault does two things-- (a) Restores the architectural checkpoint (b) Sends an HTM abort signal to the cache controller The restoration includes all registers in the checkpoint as well as the program counter of the instruction before the transaction started. The abort signal is sent to the L0 cache controller and resets the failed transactional state. It resets the transactional read and write sets and invalidates any speculatively written cache lines. It also exits the transactional state so that the MESI protocol operates as usual. Alternatively, if the instructions within a transaction complete without triggering a HtmFailureFault, the transaction can be committed. The core is responsible for notifying the cache controller that the transaction is complete and the cache controller makes all speculative writes visible to the rest of the system and exits the transactional state. Notifting the cache controller is done through HtmCmd Requests which are a subtype of Load Requests. KUDOS: The code is based on a previous pull request by Pradip Vallathol who developed HTM and TSX support in Gem5 as part of his master’s thesis: http://reviews.gem5.org/r/2308/index.html JIRA: https://gem5.atlassian.net/browse/GEM5-587 Change-Id: Icc328df93363486e923b8bd54f4d77741d8f5650 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30319 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-08 09:13:30 +00:00
Srikant Bharadwaj	d690ebed46	mem-garnet: Separable allocator in Garnet not fair enough. Currently there are independent round robin arbiter at each input port and output port. Every time a VC is selected for output allocation round robin is incremented irrespective of if it is selected by its output port or not. This leads to unfair arbitration at input port and is well known[1]. This patch fixes it to increment only if the output port also selects it. [1] D. U. Becker and W. J. Dally, "Allocator implementations for network-on-chip routers," Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, Portland, OR, 2009, pp. 1-12 Change-Id: I65963fb8082c51c0e3c6e031a8b87b4f5c3626e1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32601 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-09-04 22:17:36 +00:00
Srikant Bharadwaj	028a1fa87e	mem-garnet: Add a check to see if router is already scheduled Currently the Switch Allocator takes up most of the simulation wall clock time. This function checks for all VCs to see if it should wakeup next. The input units which are simulated before the switch allocator could have scheduled it already. This patch adds a check for it. Change-Id: I8609d4e7f925aa5e97198f6cd07466530f6fcf4c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32600 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-09-04 22:17:36 +00:00
Srikant Bharadwaj	615067c163	mem-garnet: Flexible VCs per Vnet for each router This change allows configuring each router with a certain number of VCs for each VNET. This is beneficial when dealing with heterogenous link widths in a system. Configuring VCs for each router allows one to ensure equal throughput within the network while avoiding head-of-line blocking. Changing a router's VCs number can be done in topology files using the vcs_per_vnet value argument of router. Change-Id: Icf4f510248128429a1a11f19f9802ee96f340611 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32599 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-04 22:17:36 +00:00
Michael LeBeane	6be16d84da	mem-garnet: Initialize unused Credit members The Credit class doesn't initialize a number of its unused base class fields. This leads to non-determanistic traces when printing flits that are Credits. This patch initializes all unused fields to 0. Change-Id: Ib73c652c71a10be57b24c0d6e1ac22eafa421e11 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32598 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-09-04 22:17:36 +00:00
Srikant Bharadwaj	b9f1c71fe7	mem-garnet: Integration of HeteroGarnet This upgrades the garnet model to support HeteroGarnet 1) Static and dynamic multi-freq domains in network 2) Support for CDC 3) Separate links for each message class 4) Separate linkwidth for each message class 5) Support for SerDes Change-Id: I6d00e3b5cb3745e849d221066cb46b2138c47871 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32597 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2020-09-04 22:17:36 +00:00
Sampad Mohapatra	9d8229c0f1	mem-ruby: Change request to response in MOESI_AMD_Base-dir.sm The responseToDMA MessageBuffer in MOESI_AMD_Base-dir.sm transmits both data and acks, but it's vnet_type is currently set as request. This should be changed to response. Signed-off-by: Sampad Mohapatra <sampad.mohapatra@gmail.com> Change-Id: I0eb9e8fc8e25111849605a710a5150ce5fc3b83b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33755 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 22:54:31 +00:00
Shivani Parekh	cf43bc3c8b	mem: Update port terminology Change-Id: Ib4fc8cad7139d4971e74930295a69e576f6da3cf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32314 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-26 16:48:13 +00:00
Kyle Roarty	b872f02ab1	configs,gpu-compute,mem-ruby: connect gmTokenPorts in apu_se This patch adds gmTokenPorts to the ComputeUnit and RubyGPUCoalescer python classes so the gmTokenPorts can be connected in apu_se. Change-Id: Icf3cb05c757754d6935b46f14e4b1b1d5072c4ca Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32677 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 23:47:16 +00:00
Gabe Black	9ed3c7668b	misc: Make the stats callbacks use CallbackQueue2. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: Idcbe04bdf4299925f321aa0ece263d86ed3fc8df Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32645 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 19:53:31 +00:00
Pouya Fotouhi	762153a421	mem-ruby: Fix debug prints for regular Stores In the updated implementation of LL/SC (27103) the default value of success was changed, which results in printing "SC_Failed" for any regular stores. Change-Id: I4f2e0b26233ce0cbdf948aadd19c9d81bf18bec0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32514 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:48:30 +00:00
Kyle Roarty	187c44fe44	mem-ruby: fix races between data and DMA in MOESI_AMD_Base-dir There are race conditions while running several benchmarks, where the DMA engine and the CorePair simultaneously send requests for the same block. This patch fixes two scenarios (a) If the request from the DMA engine arrives before the one from the CorePair, the directory controller records it as a pending request. However, once the DMA request is serviced, the directory doesn't check for pending requests. The CorePair, consequently, never sees a response to its request and this results in a Deadlock. Added call to wakeUpDependents in the transition from BDR_Pm to U Added call to wakeUpDependents in the transition from BDW_P to U (b) If the request from the CorePair is being serviced by the directory and the DMA requests for the same block, this causes an invalid transition because the current coherence doesn't take care of this scenario. Added transition state where the requests from DMA are added to the stall buffer. Updated B to U CoreUnblock transition to check all buffers, as the DMA requests were being placed later in the stall buffer than was being checked Change-Id: I5a76efef97723bc53cf239ea7e112f84fc874ef8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31996 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:05:17 +00:00
Tony Gutierrez	44807669a0	configs, mem: Support running VIPER with GCN3 This changeset adds the necessary changes for running GCN3 ISA with VIPER in apu_se.py. Changes to the VIPER protocol configs are made to add support for DMA and scalar caches. hsaTopology is added to help the pseudo FS create the files needed by ROCm to understand the device on which the SW is being run. Change-Id: I0f47a6a36bb241a26972c0faafafcf332a7d7d1f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30274 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-28 19:01:09 +00:00
Daniel R. Carvalho	1ad015389c	mem-ruby: Use lookup function in cache There is a function to perform lookups; there is no need to replicate its code everywhere. Change-Id: I1290594615d282722cd91071be8ef3c372414e4e Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23946 Reviewed-by: John Alsop <johnathan.alsop@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-25 10:51:06 +00:00
Daniel R. Carvalho	f54af2863c	mem-ruby: Cleanup replacement_data usage The replacement_data can be assigned as soon as a block is allocated. With this cleanup the lookup function can be used to avoid code duplication. Change-Id: I7561fddaa3ed348866699ecaf1e6aa477ba0bc9a Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23945 Reviewed-by: John Alsop <johnathan.alsop@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-25 10:51:06 +00:00
Matthew Poremba	33f3659825	mem-ruby: Getter/setter for atomic ops in WriteMask Adding getter and setter methods for getting and setting the atomic ops in the WriteMask class. This allows for message types with WriteMasks to get or set the atomic ops without explicitly modifying the constructor for the message type. This will beused by the DMASequencer which uses the SequencerMsg type where the constructor is auto generated via SLICC. Change-Id: I71787d294c1b89547618e9a13e386b65bb3e1021 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31474 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-24 18:30:08 +00:00
Tony Gutierrez	a408b1ada7	mem-ruby: Add support for MemSync reqs in VIPER Change-Id: Ib129e82be5348c641a8ae18093324bcedfb38abe Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29939 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-15 18:14:41 +00:00
seanzw	75257c7a42	mem-ruby: Fix type casting in makeNextStrideAddress The RubyPrefetcher uses makeNextStrideAddress() with a negative stride to find prefetched address. The type of this expression is: uint64_t + uint32_t * int; This gives wrong result due to implicit conversion. Fix this with static cast and it works correctly: uint64_t + int * int; Change-Id: I36e17e00d5c66c3699fe1d5b287971225a162d04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31314 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-15 17:38:12 +00:00
Matthew Poremba	675e01216d	mem-ruby: Support device memories Adds support for device memories in the system and RubySystem classes. Devices may register memory ranges with the system class and packets which originate from the device MasterID will update the device memory in Ruby. In RubySystem functional access is updated to keep the packets within the Ruby network they originated from. Change-Id: I47850df1dc1994485d471ccd9da89e8d88eb0d20 JIRA: https://gem5.atlassian.net/browse/GEM5-470 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29653 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-01 14:38:11 +00:00
Kyle Roarty	1339a1b080	mem-ruby: add cache hit/miss statistics for TCP and TCC Change-Id: Ifa6fdbb9dd062a3684b9620eac6683c57e651a72 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30174 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com>	2020-06-20 04:20:45 +00:00
Matt Sinclair	8177fc4392	arch-gcn3: add support for unaligned accesses Previously, with HSAIL, we were guaranteed by the HSA specification that the GPU will never issue unaligned accesses. However, now that we are directly running GCN this is no longer true. Accordingly, this commit adds support for unaligned accesses. Moreover, to reduce the replication of nearly identical code for the different request types, I also added new helper functions that are called by all the different memory request producing instruction types in op_encodings.hh. Adding support for unaligned instructions requires changing the statusBitVector used to track the status of the memory requests for each lane from a bit per lane to an int per lane. This is necessary because an unaligned access may span multiple cache lines. In the worst case, each lane may span multiple cache lines. There are corresponding changes in the files that use the statusBitVector. Change-Id: I319bf2f0f644083e98ca546d2bfe68cf87a5f967 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29920 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-19 20:41:18 +00:00
Tony Gutierrez	b811d3a342	mem-ruby: Add DMA support to MOESI_AMD_Base-dir.sm This change adds DMA support to the MOESI_AMD_Base-dir.sm, which is needed to support ROCm apps/GCN3 ISA in the VIPER ptl. The DMA controller is copied from the MOESI_hammer-dma.sm with few modifications. Change-Id: I56141436eee1c8f62c2a0915fa3b63b83bbcbc9a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29914 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-19 20:34:06 +00:00
Tuan Ta	18ebe62598	mem-ruby: GCN3 and VIPER integration This patch modifies the Coalescer and VIPER protocol to support memory synchronization requests and write-completion responses that are required by upcoming GCN3 implementation. VIPER protocol is simplified to be a solely write-through protocol. Change-Id: Iccfa3d749a0301172a1cc567c59609bb548dace6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29913 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-19 20:32:54 +00:00
Tony Gutierrez	b8da9abba7	gpu-compute, mem-ruby, configs: Add GCN3 ISA support to GPU model Change-Id: Ibe46970f3ba25d62ca2ade5cbc2054ad746b2254 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29912 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-15 22:45:17 +00:00
Gabe Black	1008b70f31	mem-ruby: Add a missing override. Change-Id: I7651ca0f4658ddd49cfd13d9d5f7e430f416f41f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30254 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-12 06:23:23 +00:00
Xianwei Zhang	7f4d6c8388	mem-ruby: Add codes for pure virtual functions for compilation Change-Id: Ic34f9ccf10ec28d68eed236dc6246e2ae2ef1b89 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28409 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com>	2020-06-09 20:00:13 +00:00
Tuan Ta	adc9de4d61	mem-ruby: update memory interfaces to support GPU ISA This patch deprecates HSA-based memory request types and adds new types that can be used by real ISA instructions. Change-Id: Ie107a69d8a35e9de0853f1407392ad01a8b3e930 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28408 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-09 20:00:13 +00:00
Matthew Poremba	bbb6a3fe8d	mem-ruby: Allow MachineID to be unordered key Define an std::hash function so that MachineID may be used as a key type for unordered STL containers. Change-Id: Ibc3bc78149c69683207d8967542fa6e8d545f75c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29652 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-08 16:39:21 +00:00
adarshpatil	0a5ed3076a	mem-ruby: Fix for Invalid transition in MOESI_CMP_directory Send the correct sharer count from the memory directory to the requesting L2 cache in data message reply. Jira issue: https://gem5.atlassian.net/browse/GEM5-613 Change-Id: If76de630fd0001816e8836d9bf77961a94faaa7c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29552 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-29 19:31:47 +00:00
Onur Kayiran	5587dd94f0	mem-ruby: Generate address with masking cacheline bits makeLineAddress function uses m_block_size_bits to create masked addresses. m_block_size_bits is used to specify cache, directory, and memory controller interleaving, and it can be larger than the cache line size. To generate addresses that can align with the cache line rather than the interleaving granularity, a version of makeLineAddress is created to specify bits that need to be masked. Change-Id: I06deec4949da7fa46f1d6f7575334f18ee61c786 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28135 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Onur Kayıran <onur.kayiran@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-05-28 23:07:08 +00:00
Tuan Ta	e071f60011	mem-ruby: add function to check for stalled msgs of addr This patch allows a cache controller to check if there is any stalled message of a specific address in the stall_map of an input message buffer. Change-Id: Id2f9bb98a9201a562f2a8cc371e9bb896ac836af Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28133 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-28 23:07:08 +00:00
Tuan Ta	524c22041d	mem-ruby: add slicc stm to defer enqueueing a message This patch enables cache controllers to make response messages in advance, store them in a per-address saved map in an output message buffer and enqueue them altogether in the future. This patch introduces new slicc statement called defer_enqueueing. This patch would help simplify the logic of state machines that deal with coalesing multiple requests from different requestors. Change-Id: I566d4004498b367764238bb251260483c5a1a5e5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28132 Reviewed-by: Tuan Ta <qtt2@cornell.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-28 23:07:08 +00:00
Bobby R. Bruce	e53de444f6	misc: Merge branch 'release-staging-v20.0.0.0' into develop	2020-05-28 01:04:16 -07:00
Matthew Poremba	14e349c729	mem-ruby,mem-garnet: Multiple networks per RubySystem Add support for multiple networks per RubySystem. This is done by introducing local IDs to each network and translating from a global ID passed around through Ruby and SLICC code. The local IDs represents the NodeID of a MachineType in the network and are ordered the same way that NodeIDs are ordered using MachineType_base_number. If there are not multiple networks in a RubySystem the local and global IDs are the same value. This is useful in cases where multiple isolated networks are needed to support devices with Ruby caches which do not interact with other networks. For example, a dGPU device will have a cache hierarchy that will not interact with the CPU cache hierachy. Change-Id: I33a917b3a394eec84b16fbf001c3c2c44c047f66 JIRA: https://gem5.atlassian.net/browse/GEM5-445 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27927 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-27 17:06:31 +00:00
Polydoros Petrakis	7695a21404	mem-garnet: Remove extraneous loop in Router resetStats. This outer loop makes no sense. Change-Id: Ibe4b8b50c5843fba2119906f59ea1cb6c1d8c762 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29254 Reviewed-by: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-21 20:56:49 +00:00
Polydoros Petrakis	8fdad96b7c	mem-garnet,mem-ruby: Properly reset garnet2.0 statistics. Statistics for crossbar activity, and link related statistics were not getting reset when using m5_reset_stats. Change-Id: Ib84c55200e4a86c6f9190de28498112bd43dde9d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29253 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-21 20:56:25 +00:00
Bobby R. Bruce	ebf5755cce	mem-ruby: Added M5_CLASS_VAR_USED to m_id in OutputUnit Clang 9 throws an error that 'm_id' is unused (encountered when compiling X86.fast). M5_CLASS_VAR_USED has been added to avoid this error. Change-Id: I722edd1429a074ff484b5ebbdc431af0089561b5 Issue-on: https://gem5.atlassian.net/browse/GEM5-560 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29304 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-20 01:02:33 +00:00
Bobby R. Bruce	a257eef1d2	misc,sim: Tagged API methods in sim/serialize.hh Within this some light refactoring has been carried out to avoid accessing member variable directly and removing some unused/unneeded ones from the codebase. Change-Id: I458494f6466628b213816c81f6a8ce42fb91dc3f Issue-on: https://gem5.atlassian.net/browse/GEM5-172 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27989 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-19 21:58:24 +00:00
Matthew Poremba	27426fab83	mem: Remove infinite queue between Ruby and memory AbstractController sends requests using a QueuedMasterPort which has an implicit buffer which is unbounded. Remove this by changing the port to a MasterPort and implement a retry mechanism for AbstractController. Although the request remains in the MessageBuffer if a retry is needed, the additional retry logic optimizes serviceMemoryQueue slightly and prevents the DRAMCtrl retry stats from being incorrect due to multiple calls to sendTimingReq. Change-Id: I8c592af92a1a499a418f34cfee16dd69d84803ad Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28387 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-12 21:12:21 +00:00
Matthew Poremba	3d57eaf9f5	gpu-compute,mem-ruby: Refactor GPU coalescer Remove the read/write tables and coalescing table and introduce a two levels of tables for uncoalesced and coalesced packets. Tokens are granted to GPU instructions to place in uncoalesced table. If tokens are available, the operation always succeeds such that the 'Aliased' status is never returned. Coalesced accesses are placed in the coalesced table while requests are outstanding. Requests to the same address are added as targets to the table similar to how MSHRs operate. Change-Id: I44983610307b638a97472db3576d0a30df2de600 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27429 Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-11 21:25:19 +00:00
Timothy Hayes	97daaf1f2e	mem-ruby: MESI_Two_Level missing function compilation fix The recent commit `dd6cd33` removed the Ruby Sequencer function invalidateSC in favour of doing this implicitely via evictionCallback. The protocol MESI_Two_Level still contains one explicit call to this function, however, this is now superflous as forward_eviction_to_cpu is called in the same transition. This patch removes the remaining calls to invalidateSC. JIRA: https://gem5.atlassian.net/browse/GEM5-499 Change-Id: If51d8bebf6aa39d20789639aab0d262d5173ca59 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28747 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Ayaz Akram <yazakram@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-11 07:55:16 +00:00
Timothy Hayes	264a6392be	mem-ruby: MOESI_CMP_directory sync fix The recent commit `dd6cd33` modified the behaviour of the the Ruby sequencer to handle load linked requests as loads rather than stores. This caused the regression test realview-simple-timing-dual-ruby-ARM-x86_64-opt to become stuck when booting Linux. This patch fixes the issue by adding a missing forward_eviction_to_cpu action to the state transition(OM, Fwd_GETX, IM). Change-Id: I8f253c5709488b07ddc5143a15eda406e31f3cc6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/28787 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-05-11 07:54:38 +00:00

... 5 6 7 8 9 ...

1167 Commits