derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Mahyar Samani	acd63fe5ac	mem,ext: Fixed DRAMSim2 Integration Fixed the way callbacks were used due to changes in src/sim/callback.hh. Removed author line in SConsript. Change-Id: I2c2b8dbe13e4f58680806126cd9cf209748e788a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33938 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-02 15:50:43 +00:00
Timothy Hayes	2427fc2c82	mem: Relax packet limit in packet queue JIRA: https://gem5.atlassian.net/browse/GEM5-587 Change-Id: I4ac24bf18a0aff08a5b33c48179b882b27ef910c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30317 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-02 08:36:21 +00:00
Timothy Hayes	b01b455537	arch, mem: Initial Hardware Transactional Memory implementation Gem5 Hardware Transactional Memory (HTM) Here we provide a brief note describing HTM support in Gem5 at a high level. HTM is an architectural feature that enables speculative concurrency in a shared-memory system; groups of instructions known as transactions are executed as an atomic unit. The system allows that transactions be executed concurrently but intervenes if a transaction's atomicity/isolation is jeapordised and takes corrective action. In this implementation, corrective active explicitely means rolling back a thread's architectural state and reverting any memory updates to a point just before the transaction began. This HTM implementation relies on-- (1) A checkpointing mechanism for architectural register state. (2) Buffering speculative memory updates. This patch is focusing on the definition of the HTM checkpoint (1) The checkpointing mechanism is architecture dependent. Each ISA leveraging HTM support can define a class HTMCheckpoint inhereting from the generic one (GenericISA::HTMCheckpoint). Those will need to save/restore the architectural state by overriding the virtual HTMCheckpoint::save (when starting a transaction) and HTMCheckpoint::restore (when aborting a transaction). Instances of this class live in O3's ThreadState and Atomic's SimpleThread. It is up to the ISA to populate this instance when executing an instruction that begins a new transaction. JIRA: https://gem5.atlassian.net/browse/GEM5-587 Change-Id: Icd8d1913d23652d78fe89e930ab1e302eb52363d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30314 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-02 08:30:11 +00:00
Daniel R. Carvalho	79e83c7d95	mem-cache: Fix copy ellision on base compressor Newer compiler versions have a problem with this move as it prevents copy elision. Change-Id: I802703df12e171d6a377b673d0ad7e202456b516 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33835 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-01 06:29:10 +00:00
Sampad Mohapatra	9d8229c0f1	mem-ruby: Change request to response in MOESI_AMD_Base-dir.sm The responseToDMA MessageBuffer in MOESI_AMD_Base-dir.sm transmits both data and acks, but it's vnet_type is currently set as request. This should be changed to response. Signed-off-by: Sampad Mohapatra <sampad.mohapatra@gmail.com> Change-Id: I0eb9e8fc8e25111849605a710a5150ce5fc3b83b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33755 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Srikant Bharadwaj <srikant.bharadwaj@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 22:54:31 +00:00
Daniel R. Carvalho	c0d67b2263	mem-cache: Use cache's max CR on perfect compressor Use cache's max_compression_ratio to setup the max_compression_ratio of the PerfectCompressor. Change-Id: Ib44aa61975fb2cc52f27f64a86c9df9c5531aa1a Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33387 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	8626fe101d	mem-cache: Explicitly define threshold of BDI's sub-compressors Allow all sub-compressors of BDI to be successful as long as they are able to compress. Then, BDI's actual size threshold acts as the cutting point. This situation arises on any multi compressor; yet, generalizing this assumption might be too bold. Change-Id: Iec5057d16d4a7ba5fb573133a30ea10869bd67e0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33386 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	605a0917bb	mem-cache: Make compression size threshold a percentage By changing the parameter into a percentage, changing the block size will automatically reconfigure the size threshold. Also, change the default percentage to 50% to avoid storing blocks unlikely to co-allocate in compressed format. Change-Id: I1458f19db39becc2d40c00269132fea01770016f Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33385 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	bc92a06cf5	mem-cache: Add stats for failed compressions Add statistics to keep track of the number of times compression has failed to provide blocks whose compressed size passes the size threshold. Also, update the compressed data's size if compression fails. Change-Id: If3479572bf114f07911238c602ffef3a90b6a931 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33384 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	53ef7c1e6c	mem-cache: Handle zero sizes on compression The size can be zero in special occasions, which would generate divisions by zero. This patch expands the stats to support them. It also fixes the compression factor calculation in the Multi compressor. As a side effect, now that zero sizes are handled, allow the Zero compressor to generate it. Change-Id: I9f7dee76576b09fdc9bef3e1f3f89be3726dcbd9 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33383 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	58c7fc72d3	mem-cache: Add an extra decomp lat to multi compressor There is extra hardware required when dealing with multi compressors. As such, add a parameter to allowing increasing their decompression latency to account for any extra delay. Change-Id: I153e4c5ab6927ac092e2ebd767fe88974597bb20 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33382 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	99a8c5a27a	mem-cache: Store BDI's encoding in tags According to the original paper the compressors' encodings are stored in the tag-store (Storage cost analysis section). Change-Id: I4c34f86022eea6d1ba0ae29dd74d5714bbad367a Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33381 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	35f77329e8	mem-cache: Add encoding bits to the data of multi compressors When compressing using a multi-compressor, one must be able to identify which sub-compressor should be used to decompress data. This can be achieved by either adding encoding bits to block's tag or data entry. It was previously assumed that these encoding bits would be added to the tag, but now make it a parameter that defaults to the data entry. Change-Id: Id322425e7a6ad59cb2ec7a4167a43de4c55c482c Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33380 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	de94a29f85	mem-cache: Standardize data parsing in compressors The compressors are not able to process a whole line at once, so they must divide it into multiple same-sized chunks. This patch makes the base compressor responsible for this division, so that the derived classes are mostly agnostic to this translation. This change has been coupled with a change of the signature of the public compress() to avoid introducing a temporary function rename. Previously, this function did not return the compressed data, under the assumption that everything related to the compressed data would be handled by the compressor. However, sometimes the units using the compressor could need to know or store the compressed data. For example, when sharing dictionaries the compressed data must be checked to determine if two blocks can co-allocate (DISH, Panda et al. 2016). Change-Id: Id8dbf68936b1457ca8292cc0a852b0f0a2eeeb51 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33379 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	3fc4c0a415	mem-cache: Allow inheriting from DitionaryCompressor's comp data Previously either the compression data was the one declared within DictionaryCompressor, or the derived class would have to override the compress() to use a derived compression data. With this change, the instantiation can be overridden, and thus any derived class can choose the compression data pointer type they need to use. Change-Id: I387936265a3de6785a6096c7a6bd21774202b1c7 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33378 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	8bb0e3749b	mem-cache: Upgrade Compressor::Multi's stats Use new style stats API for Compressor::Multi's stats. Change-Id: Ia0313704cae4e7bd6bc675c71ea75b42a8e542f2 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33377 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	c85c793499	mem-cache: Upgrade BaseDictionaryCompressor's stats Upgrade this compressor's stats to match current stats API. Change-Id: I1cb69230f8deca053bc860cedafc9e6e78446df7 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33376 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	0658b53e93	mem-cache: Fix RepeatedQwords compressor This compressor does not allocate dictionary entries when there is a match. This was causing the compressor to always fail. Change-Id: I50eb56fa284854f3ee87f33af2c6e0a5c5248d7c Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33375 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
Daniel R. Carvalho	9e7dbd0544	mem-cache: Fix integer promotion of mask When applying the bitwise not to a short integer the compiler automatically promotes it to an integer. For example, if a 8-bit mask=0xFF, and the compiler decides to promote the mask to 32-bit to apply the bitwise not, ~mask=0xFFFFFF00, which will yield wrong results for popcount(): expected=0, got=24. Change-Id: I95efba5532c27ca004ff6947d4b51a8a14f09741 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33374 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 17:45:43 +00:00
eavivi	9547cd285c	mem: convert base prefetcher and queued to new style stats Base and Queued inside src/mem/cache/prefetch converted Change-Id: I3d5907b58efefc4d8522b89f073507f2548bff2f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33475 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-31 16:36:53 +00:00
Gabe Black	1d755b4ba1	misc: Clean up usage of arch/isa_traits.hh. isa_traits.hh used to have much more in it, but now it only has PageShift, PageBytes, and (for now) the guest endianness. These values should only be retrieved from the System class generally speaking, so only the system class should include arch/isa_traits.hh. Some gpu compute related files need PageBytes or PageShift. Even though those files don't advertise their ISA dependence, they are tied to x86. In those files, they can include arch/x86/isa_traits.hh. The only other file which legitimately needs arch/isa_traits.hh is the decoder cache since it uses PageBytes to size an array. Change-Id: I12686368715623e3140a68a7027c136bd52567b1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33203 Reviewed-by: Gabe Black <gabeblack@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-28 07:20:58 +00:00
Shivani Parekh	cf43bc3c8b	mem: Update port terminology Change-Id: Ib4fc8cad7139d4971e74930295a69e576f6da3cf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32314 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-26 16:48:13 +00:00
Emily Brickey	4810c36401	misc: Updated port classes & refs to remove slaveBind()/UnBind() Change-Id: I9106397b8816d8148dd916510bbcf65ed499d303 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32309 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-26 16:48:13 +00:00
Shivani	e1f6d22234	mem: Deprecate SlavePort and MasterPort classes After this change, if you use these classes or inherit from these classes, the compiler will now give you a warning that these names are deprecated. Instead, you should use ResponsePort and RequestPort, respectively. This patch simply deprecates these names. The following patches will convert all of the code in gem5 to use these new names. The first step is converting the class names and the uses of these classes, then we will update the variable names to be more precise as well. Change-Id: I5e6e90b2916df4dbfccdaabe97423f377a1f6e3f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32308 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-26 16:48:13 +00:00
Daniel R. Carvalho	101e16facf	mem-cache: Create Compressor namespace Creation of the Compressor namespace. It encapsulates all the cache compressors, and other classes used by them. The following classes have been renamed: BaseCacheCompressor -> Base PerfectCompressor - Perfect RepeatedQwordsCompressor -> RepeatedQwords ZeroCompressor -> Zero BaseDictionaryCompressor and DictionaryCompressor were not renamed because the there is a high probability that users may want to create a Dictionary class that encompasses the dictionary contained by these compressors. To apply this patch one must force recompilation (e.g., by deleting it) of build/<arch>/params/BaseCache.hh (and any other files that were previously using these compressors). Change-Id: I78cb3b6fb8e3e50a52a04268e0e08dd664d81230 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33294 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-25 15:13:05 +00:00
Gabe Black	1cf7b28ba6	mem: Use getGuestByteOrder in the indirect memory prefetcher. Use that instead of accessing TheISA::GuestByteOrder directly. Change-Id: I6fbeb7501aceadb95739bb482215097af18da2fa Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32926 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-21 22:18:23 +00:00
Gabe Black	1f7cc16a70	mem: Use the System object's getGuestByteOrder in AbstractMemory. Change-Id: Ifcf3d8dcbee73555b23ec0a8c25572921fca13a6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32925 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-20 05:02:07 +00:00
Kyle Roarty	b872f02ab1	configs,gpu-compute,mem-ruby: connect gmTokenPorts in apu_se This patch adds gmTokenPorts to the ComputeUnit and RubyGPUCoalescer python classes so the gmTokenPorts can be connected in apu_se. Change-Id: Icf3cb05c757754d6935b46f14e4b1b1d5072c4ca Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32677 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 23:47:16 +00:00
Gabe Black	17afbc2416	misc: Rename CallbackQueue2 to CallbackQueue. Now that the original CallbackQueue has been removed, CallbackQueue2 can fully take it's place. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: I925f647cbbd393045a22f7cbd5d8b4d7d23d19b0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32651 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 23:21:30 +00:00
Gabe Black	9ed3c7668b	misc: Make the stats callbacks use CallbackQueue2. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: Idcbe04bdf4299925f321aa0ece263d86ed3fc8df Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32645 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 19:53:31 +00:00
Gabe Black	40e8cac306	misc: Make registerExitCallback use CallbackQueue2. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: I526d4a19ca4e54a6469a4ee26693c1c0400fcc70 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32644 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 11:49:06 +00:00
Gabe Black	316f7d42dc	mem: Use the new type of CallbackQueue in the MemBackdoor. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: Ide40528f8c613b46204550d6e6840a7b274a366a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32643 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 11:48:59 +00:00
Isaac Sánchez Barrera	7740fd7714	mem-cache,python: Allow custom TLB and events in each prefetcher. The `BasePrefetcher` python class had members `_events` and `_tlbs` defined as lists, meaning that any call to `list.append` on them would affect `_events` and `_tlbs` for all prefetchers, not just the calling object. This change redefines them as instance members to fix the problem. Change-Id: I68feb1d6d78e2fa5e8775afba8c81c6dd0de6c60 Signed-off-by: Isaac Sánchez Barrera <isaac.sanchez@bsc.es> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32394 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com>	2020-08-17 11:35:48 +00:00
Pouya Fotouhi	762153a421	mem-ruby: Fix debug prints for regular Stores In the updated implementation of LL/SC (27103) the default value of success was changed, which results in printing "SC_Failed" for any regular stores. Change-Id: I4f2e0b26233ce0cbdf948aadd19c9d81bf18bec0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32514 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:48:30 +00:00
Kyle Roarty	187c44fe44	mem-ruby: fix races between data and DMA in MOESI_AMD_Base-dir There are race conditions while running several benchmarks, where the DMA engine and the CorePair simultaneously send requests for the same block. This patch fixes two scenarios (a) If the request from the DMA engine arrives before the one from the CorePair, the directory controller records it as a pending request. However, once the DMA request is serviced, the directory doesn't check for pending requests. The CorePair, consequently, never sees a response to its request and this results in a Deadlock. Added call to wakeUpDependents in the transition from BDR_Pm to U Added call to wakeUpDependents in the transition from BDW_P to U (b) If the request from the CorePair is being serviced by the directory and the DMA requests for the same block, this causes an invalid transition because the current coherence doesn't take care of this scenario. Added transition state where the requests from DMA are added to the stall buffer. Updated B to U CoreUnblock transition to check all buffers, as the DMA requests were being placed later in the stall buffer than was being checked Change-Id: I5a76efef97723bc53cf239ea7e112f84fc874ef8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31996 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:05:17 +00:00
Ian Jiang	78bccaf7a8	sim: Move checkpoint parameters for ptable into seperate section In checkpoint output files, the parameters for page table including size and entries are organized not very clearly. For example: [system.cpu.workload] ... ptable.size=... [system.cpu.workload.Entry0] vaddr=... paddr=... flags=... [system.cpu.workload.Entry1] ... This commit moves these parameters into a separate section named 'ptable'. For example: [system.cpu.workload.ptable] size=... [system.cpu.workload.ptable.Entry0] vaddr=... paddr=... flags=... [system.cpu.workload.ptable.Entry1] ... Change-Id: Iaa4129b3f4f090e8c3651bde90524abba0999c7f Signed-off-by: Ian Jiang <ianjiang.ict@gmail.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31874 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-30 07:12:00 +00:00
Tony Gutierrez	44807669a0	configs, mem: Support running VIPER with GCN3 This changeset adds the necessary changes for running GCN3 ISA with VIPER in apu_se.py. Changes to the VIPER protocol configs are made to add support for DMA and scalar caches. hsaTopology is added to help the pseudo FS create the files needed by ROCm to understand the device on which the SW is being run. Change-Id: I0f47a6a36bb241a26972c0faafafcf332a7d7d1f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30274 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-28 19:01:09 +00:00
Daniel R. Carvalho	1ad015389c	mem-ruby: Use lookup function in cache There is a function to perform lookups; there is no need to replicate its code everywhere. Change-Id: I1290594615d282722cd91071be8ef3c372414e4e Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23946 Reviewed-by: John Alsop <johnathan.alsop@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-25 10:51:06 +00:00
Daniel R. Carvalho	f54af2863c	mem-ruby: Cleanup replacement_data usage The replacement_data can be assigned as soon as a block is allocated. With this cleanup the lookup function can be used to avoid code duplication. Change-Id: I7561fddaa3ed348866699ecaf1e6aa477ba0bc9a Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23945 Reviewed-by: John Alsop <johnathan.alsop@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-25 10:51:06 +00:00
Matthew Poremba	33f3659825	mem-ruby: Getter/setter for atomic ops in WriteMask Adding getter and setter methods for getting and setting the atomic ops in the WriteMask class. This allows for message types with WriteMasks to get or set the atomic ops without explicitly modifying the constructor for the message type. This will beused by the DMASequencer which uses the SequencerMsg type where the constructor is auto generated via SLICC. Change-Id: I71787d294c1b89547618e9a13e386b65bb3e1021 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31474 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-24 18:30:08 +00:00
Nikos Nikoleris	e50ab5a2ec	mem: Use beats_per_clock as the DDR data rate for DRAMPower The data rate is used by the drampower lib to estimate the power consumption of the DRAM Core. Previously, we used the formula: burst_cycles = divCeil(p->tBURST_MAX, p->tCK); data_rate = p->burst_length / burst_cycles; to derive the data_rate. However, under certain configurations this formula computes the wrong result due to rounding errors. This patch simplifies the way we derive the data_rate by passing the value of the DRAM parameter beats_per_clock. Change-Id: Ic8cd35bb4641d9c0a704675d2672a6fe4f4ec13e Signed-off-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Wendy Elsasser <wendy.elsasser@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30056 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br>	2020-07-20 11:47:01 +00:00
Mahyar Samani	0bd936d071	sim: Fixed error when compiling gem5 with dramsim2. Compiling gem5 with dramsim2 included fails due to some inconsistencies in including SimObjects. In this patch this issue is fixed along with temporarily disabling -Werror=nonnull-compare in CCFLAGS. Also, the remote for cloning dramsim2 has been changed. Change-Id: Ia24095150d026d736352aaf0d735b7554ede10bb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31434 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-17 17:22:47 +00:00
Tony Gutierrez	a408b1ada7	mem-ruby: Add support for MemSync reqs in VIPER Change-Id: Ib129e82be5348c641a8ae18093324bcedfb38abe Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29939 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-15 18:14:41 +00:00
seanzw	75257c7a42	mem-ruby: Fix type casting in makeNextStrideAddress The RubyPrefetcher uses makeNextStrideAddress() with a negative stride to find prefetched address. The type of this expression is: uint64_t + uint32_t * int; This gives wrong result due to implicit conversion. Fix this with static cast and it works correctly: uint64_t + int * int; Change-Id: I36e17e00d5c66c3699fe1d5b287971225a162d04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31314 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-15 17:38:12 +00:00
Xianwei Zhang	024f978cff	gpu-compute: enable kernel-end WB functionality Change-Id: Ib17e1d700586d1aa04d408e7b924270f0de82efe Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29938 Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Xianwei Zhang <xianwei.zhang@amd.com>	2020-07-13 23:32:37 +00:00
Boris Shingarov	f7e5985e7b	mem: Optionally share the backing store This patch adds the ability for a host-OS process external to gem5 to access the backing store via POSIX shared memory. The new param shared_backstore of the System object is the filename of the shared memory (i.e., the first argument to shm_open()). Change-Id: I98c948a32a15049a4515e6c02a14595fb5fe379f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30994 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-08 17:42:25 +00:00
Matthew Poremba	675e01216d	mem-ruby: Support device memories Adds support for device memories in the system and RubySystem classes. Devices may register memory ranges with the system class and packets which originate from the device MasterID will update the device memory in Ruby. In RubySystem functional access is updated to keep the packets within the Ruby network they originated from. Change-Id: I47850df1dc1994485d471ccd9da89e8d88eb0d20 JIRA: https://gem5.atlassian.net/browse/GEM5-470 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29653 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-01 14:38:11 +00:00
Hoa Nguyen	01dd6dd460	mem: Fix python3 incompatibility issue in slicc's HTML builder In python3, an iterator does not have the next() method. next(iterator) works in both python2.7+ and python3. Change-Id: Ic1ceb993018a0f37e8d30086a054ffc2e311bb46 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30874 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-07-01 06:41:09 +00:00
Kyle Roarty	1339a1b080	mem-ruby: add cache hit/miss statistics for TCP and TCC Change-Id: Ifa6fdbb9dd062a3684b9620eac6683c57e651a72 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30174 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com>	2020-06-20 04:20:45 +00:00
Matt Sinclair	8177fc4392	arch-gcn3: add support for unaligned accesses Previously, with HSAIL, we were guaranteed by the HSA specification that the GPU will never issue unaligned accesses. However, now that we are directly running GCN this is no longer true. Accordingly, this commit adds support for unaligned accesses. Moreover, to reduce the replication of nearly identical code for the different request types, I also added new helper functions that are called by all the different memory request producing instruction types in op_encodings.hh. Adding support for unaligned instructions requires changing the statusBitVector used to track the status of the memory requests for each lane from a bit per lane to an int per lane. This is necessary because an unaligned access may span multiple cache lines. In the worst case, each lane may span multiple cache lines. There are corresponding changes in the files that use the statusBitVector. Change-Id: I319bf2f0f644083e98ca546d2bfe68cf87a5f967 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29920 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-06-19 20:41:18 +00:00

1 2 3 4 5 ...

2680 Commits