derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Gabe Black	00876fff20	misc: Replace the GEM5_VAR_USED macro with [[maybe_unused]]. The [[maybe_unused]] attribute is now standard, so we can use that directly without hiding it behind a macro. Change-Id: If24ffd7e50bdb503cb3e6ea61f226ea794e84b8f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48511 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-29 10:17:51 +00:00
Yu-hsin Wang	8fa9dceaa8	fastmodel: Remove CortexA76 unpresented resource Change-Id: I8fde5f90cca45df9430c5f4159fa6e8319ad12df Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/44527 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-29 05:46:45 +00:00
Giacomo Travaglini	46a8bc2f56	arch: Provide an alternative view of the TLBs in the BaseMMU It is possible from the MMU to traverse the entire hierarchy of TLBs, starting from the DTB and ITB (generally speaking from the first level) up to the last level via the nextLevel pointer. So in theory no extra data should be stored in the BaseMMU. This design makes some operations a bit more complex. For example if we have a unified (I+D) L2, it will be pointed by both ITB and DTB. If we want to invalidate all TLB entries, we should be careful to not invalidate L2 twice, but if we simply follow the next level pointer, we might do so. This is not a problem from a functional perspective but alters the TLB statistics (a single invalidation is recorded twice) We then provide a different view of the set of TLBs in the system. At the init phase we traverse the TLB hierarchy and we add every TLB to the appropriate set. This makes invalidation (and any operation targeting a specific kind of TLBs) easier. JIRA: https://gem5.atlassian.net/browse/GEM5-790 Change-Id: Ieb833c2328e9daeaf50a32b79b970f77f3e874f7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48146 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-07-28 08:13:09 +00:00
Giacomo Travaglini	76996ea806	arch: Add a nextLevel pointer to BaseTLB This is a step towards supporting multi-level TLBs: Every TLB will have a pointer to the next level TLB in the hierarchy. Example: * L1 I-TLB * L1 D-TLB * L2 shared TLB (I+D) l2 = BaseTLB() itb = BaseTLB(next_level=l2) dtb = BaseTLB(next_level=l2) JIRA: https://gem5.atlassian.net/browse/GEM5-790 Change-Id: I398a17919564aad4b18efb8dace096965781ece1 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48145 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-07-28 08:13:09 +00:00
Giacomo Travaglini	1320dc4278	arch-arm: Remove unused parameter from TLB::insert Change-Id: Iab395834fe8b3fabf4f5f666af1b8790af08182d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48144 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-28 08:13:09 +00:00
Giacomo Travaglini	65195c8011	arch-arm, configs: Remove ArmITB/ArmDTB Removing ArmITB and ArmDTB makes sense as it implies a fixed 2 TLBs system; by using the generic ArmTLB class we open up to a more generic configuration This is also aligning to the other ISAs Change-Id: Ifc5cf7c41484d4f45b14d1766833ad4c4f7e9e86 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48143 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-07-28 08:13:09 +00:00
Giacomo Travaglini	9964a3aca7	arch: Add TypeTLB Param in BaseTLB This patch is adding an enum Param in the BaseTLB to tag which kind of translation entries the TLB is holding * instruction: holding instruction entries * data: holding data entries * unified: holding instruction and data entries JIRA: https://gem5.atlassian.net/browse/GEM5-790 Change-Id: I033f840652f354523f48e9eb78033ea759b5d0e0 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48142 Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-28 08:13:09 +00:00
Giacomo Travaglini	870f93301f	arch-arm: Move translation logic from the ArmTLB to the ArmMMU This patch is moving most of the TLB code to the MMU. In this way the TLB stops being the main translating agent and becomes a simple "passive" translation cache. All the logic behind virtual memory translation, like * Checking permission/alignment * Issuing page table walks * etc Is now embedded in the MMU model. This will allow us to stack multiple TLBs and to compose arbitrary hierarchies as their sole purpose now is to cache translations JIRA: https://gem5.atlassian.net/browse/GEM5-790 Change-Id: I687c639a56263d5e3bb6633dd8c9666c85edba3a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48141 Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-28 08:13:09 +00:00
Gabe Black	b3b81196aa	misc: Replace type_traits.hh XX::value with XX_v. Now that we're using c++17, the type_traits with a ::value member have a _v alias which reduces verbosity. Or on other words std::is_integral<T>::value can be replaced with std::is_integral_v<T> Make this substitution throughout the code base. In places where gem5 introduced it's own similar templates, add a V alias, spelled differently to match gem5's internal style. gem5: :IsVarArgs<T>::value => gem5::IsVarArgsV<T> Change-Id: I1d84ffc4a236ad699471569e7916ec17fe5f109a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48604 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-28 01:48:03 +00:00
Yu-hsin Wang	8b53b8bcdf	fastmodel: Use Iris API to access memory Memory space is not always outside of the CPU. For example the tightly coupled memory (TCM) is inside of the core. To make gdb access those kind of memory, we should use Iris memory API to read and write memory. If we access a memory address not inside the CPU with Iris memory API. The CPU would fire a request via amba transport_dbg. So the change also covers the original behavior. Change-Id: Ie223ab12f9a746ebafa21026a8680222f6ebd593 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45581 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-28 00:33:02 +00:00
Gabe Black	be3e6174d6	fastmodel: Minimally implement reading MiscRegs for the CortexR52. This currently supports only the CPSR and SPSR currently. The CPSR is needed to be able to read the PC since that also reads other related info which ultimately comes from the CPSR. The SPSR is also set up since it was easy to do at the same time. Change-Id: I977fde47c81927f4972d4da2e781df306dfa3f4e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46139 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-27 21:29:06 +00:00
Yu-hsin Wang	4560cf8531	fastmodel: add iris readMem and writeMem function Iris memory API allows us to access the memory inside the core, for example the tightly coupled memory (TCM). If we access a memory address which is not in the CPU, it also fire a request to memory system. Change-Id: I5925214534a10e3a55b780c3d4ed06e7559aafe0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45268 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-27 01:07:11 +00:00
Giacomo Travaglini	4ae8db4aa4	arch, arch-arm: Make BaseMMU translate methods virtual As we are shifting towards making the MMU the main translating agent, we need to make those methods virtual to let all ISAs move their TLB::translate* methods to the MMU class JIRA: https://gem5.atlassian.net/browse/GEM5-790 Change-Id: I50c84784546e8148230d79efe4bf010d0e36d6ab Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48140 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-26 23:29:46 +00:00
Bobby R. Bruce	c0a3c70304	misc: Merge branch 'release-staging-v21-1' into develop Change-Id: I6ba57d7f70be70ae43fab396780d18623679a59a	2021-07-26 09:48:25 -07:00
Gabe Black	cb266a099f	misc: Replace GEM5_FALLTHROUGH with [[fallthrough]]. Now that the [[fallthrough]] attribute is standard (as of c++-17), we can use it directly instead of hiding it behind a macro. Change-Id: I4d11e35b619532b1a3fd8d042265e18c80d86f9b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48505 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-24 21:57:04 +00:00
Kyle Roarty	523a92f7f0	arch-gcn3: Implement large ds_read/write instructions This implements the 96 and 128b ds_read/write instructions in a similar fashion to the 3 and 4 dword flat_load/store instructions. These instructions are treated as reads/writes of 3 or 4 dwords, instead of as a single 96b/128b memory transaction, due to the limitations of the VecOperand class used in the amdgpu code. In order to handle treating the memory transaction as multiple dwords, the patch also adds in new initMemRead/initMemWrite functions for ds instructions. These are similar to the functions used in flat instructions for the same purpose. Change-Id: I0f2ba3cb7cf040abb876e6eae55a6d38149ee960 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48342 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-24 17:27:02 +00:00
Gabe Black	7daeed83f7	cpu,fastmodel: Eliminate the now unnecessary initMemProxies method. The proxies this method initializes no longer exist, since they're now created locally. Change-Id: I5fd1c99fbc00f5057ea8868e91be02d577b1c176 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45909 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-23 08:59:54 +00:00
Gabe Black	a6e023906e	fastmodel,cpu: Eliminate the unused getVirtProxy. Change-Id: I84683a3297143102a74ac6dfe744cd5804b83fe4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45908 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-23 08:59:36 +00:00
Gabe Black	83b14e569b	misc: Stop using getVirtProxy. The proxies are not used on the critical path, and it's usually implicit whether they should be the FS or SE version. Ideally in the future we won't need to worry about which version we need to use, but the differences haven't quite been abstracted away, and occasionally we need to decide between the two. Change-Id: Idb363d6ddc681f7c1ad5e7aba69865f40aa30dc8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45907 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2021-07-23 03:42:17 +00:00
Gabe Black	21c1d03dda	arch-x86: De-conditionalize segmentation microops. These were never used with conditions, so the condition check just added overhead. Also, the not-taken path through the instruction didn't actually set the destination to something, meaning that it would set it to something arbitrary and not actually leave it unmodified. Change-Id: I33fef088979b14ad74adf22b26419a1cacf386dd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45305 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com>	2021-07-21 09:45:32 +00:00
Giacomo Travaglini	a10106e94a	arch-arm: Stage1&2 TableWalkers sharing same port This patch reverts part of the changes made by the removal of the Stage2MMU class [1]: Prior to that patch the stage1 and stage2 walkers were sharing the same port (which was instantiated in the Stage2MMU). By removing the Stage2MMU we provided every table walker a unique port. With this patch we are reintroducing port sharing to temporarily fix existing platforms using walker caches. (The long term design goal will be to have a unique page table walker) Those complain if we try to connect a single ported cache to 2 table walker ports (stage1 and stage2) [1]: https://gem5-review.googlesource.com/c/public/gem5/+/45780 Change-Id: Ib68ef97f1e9772a698771269c9a4ec4514f5d4d7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48200 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-19 13:27:11 +00:00
Hoa Nguyen	a021618745	arch-riscv: Revert change-45522 This reverts change: https://gem5-review.googlesource.com/c/public/gem5/+/45522. This reverts commit `1cf41d4c54`. Reason for revert: The above commit caused booting Linux using RISCV either to hang or to take significantly time more than to finish. For the v21-1 release, the above commit will be reverted. JIRA: https://gem5.atlassian.net/browse/GEM5-1043 Change-Id: I58fbe96d7ea50031eba40ff49dabdef971faf6ff Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48099 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-16 20:50:47 +00:00
Kyle Roarty	46e62e5eb3	arch-gcn3: Free dest registers in non-memory Load DS insts Certain DS insts are classfied as Loads, but don't actually go through the memory pipeline. However, any instruction classified as a load marks its destination registers as free in the memory pipeline. Because these instructions didn't use the memory pipeline, they never freed their destination registers, which led to a deadlock. This patch explicitly calls the function used to free the destination registers in the execute() method of those Load instructions that don't use the memory pipeline. Change-Id: Ic2ac2e232c8fbad63d0c62c1862f2bdaeaba4edf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48019 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-14 20:47:27 +00:00
Javier Garcia Hernandez	4754e32219	arch-arm: Fixes an error related to HTM error code handling Arguments of the function bits(), called in restore method, are the other way around. This leads to wrong retry handling. Jira Issue: https://gem5.atlassian.net/browse/GEM5-1041 Change-Id: I0748b1cad57bea5527ca585852d183bd75b4c9ef Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47939 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-14 17:21:37 +00:00
Kyle Roarty	5820818c11	arch-vega: Add fatal when decoding missing insts Certain instructions don't have implementations in instructions.cc, and get decoded as a nullptr. This adds a fatal when decoding a missing instruction, as we aren't able to properly run a program if all its instructions aren't implemented, and it allows us to figure out which instruction is missing due to fatals printing the line they were called. Change-Id: I7e3690f079b790dceee102063773d5fbbc8619f1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47522 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-13 01:26:39 +00:00
Kyle Roarty	6d2404acc5	arch-x86: Ignore mbind syscall mbind gets called when running with a dGPU in ROCm 4, but we are able to ignore it without breaking anything Change-Id: I7c1ba47656122a5eb856981dca2a05359098e3b2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47526 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-09 16:11:20 +00:00
Daniel R. Carvalho	79bab1dc5d	mem: Adopt a memory namespace for memories Encapsulate every class inheriting from Abstract or Physical memories, and the memory controller in a memory namespace. Change-Id: I228f7e55efc395089e3616ae0a0a6325867bd782 Issued-on: https://gem5.atlassian.net/browse/GEM5-983 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47309 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 11:24:10 +00:00
Kyle Roarty	06da510020	arch-vega: Add decoding for implemented insts Certain instructions were implemented in instructions.cc, but weren't actually being decoded by the decoder, causing the decoder to return nullptr for valid instructions. This patch fixes the decoder to return the proper instruction class for implemented instructions Change-Id: I8d8525a1c435147017cb38d9df8e1675986ef04b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47521 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Kyle Roarty	9fe9d83e5b	arch-vega: Add missing return to flat_load_dwordx4 Change-Id: Ibf56c25a3d22d3c12ae2c1bb11f00f4a44b5919a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47520 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Kyle Roarty	cb73fe1959	arch-vega: Fix s_endpgm instruction Copy over changes that had been made to s_engpgm in GCN3 but weren't added to the Vega implementation Change-Id: I1063f83b1ce8f7c5e451c8c227265715c8f725b9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47519 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Richard Cooper	c25ef2d191	arch-arm: Update ARMv8.1-PAN to allow unprivileged instructions. Update the ARMv8.1-PAN implementation to allow specified unprivileged instructions to execute even when the cpsr.pan bit is set. The specified instructions generate memory requests with the TLB::ArmFlags::UserMode flags bit set. See sections D5.4.2 (About PSTATE.PAN) and G5.6.2 (About the PAN bit) of the Arm Architecture Reference Manual for details. https://developer.arm.com/documentation/ddi0487/latest/ Change-Id: I9e904e0154de72c2e4cc70cbc49b3c8407a3cb1d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47779 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 08:48:51 +00:00
Richard Cooper	704ef29f4f	arch-arm: Fix for build error in recent MacOS 11. On a recent version of MacOS 11, the build fails due to the missing sysctl.h include. Updated the preprocessor macros to include this file for __APPLE__ builds. Change-Id: I985d6c2ea97b82b32750bb562b2051f87d6c2e65 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47760 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 08:48:08 +00:00
Kyle Roarty	02dd6b77ff	arch-gcn3,arch-vega,gpu-compute: Move request counters When the Vega ISA got committed, it lacked the request counter tracking for memory requests that existed in the GCN3 code. Instead of copying over the same lines from the GCN3 code to the Vega code, this commit makes the various memory pipelines handle updating the request counter information instead, as every memory instruction calls a memory pipeline. This commit also adds an issueRequest in scalar_memory_pipeline, as previously, the gpuDynInsts were explicitly placed in the queue of issuedRequests. Change-Id: I5140d3b2f12be582f2ae9ff7c433167aeec5b68e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45347 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 01:18:01 +00:00
Kyle Roarty	3f9b03522c	arch-gcn3,gpu-compute: Set gpuDynInst exec_mask before use vector_register_file uses the exec_mask of a memory instruction in order to determine if it should mark a register as in-use or not. Previously, the exec_mask of memory instructions was only set on execution of that instruction, which occurs after the code in vector_register_file. This led to the code reading potentially garbage data, leading to a scenario where a register would be marked used when it shouldn't be. This fix sets the exec_mask of memory instructions in schedule_stage, which works because the only time the wavefront execMask() is updated is on a instruction executing, and we know the previous instruction will have executed by the time schedule_stage executes, due to the order the pipeline is executed in. This also undoes part of a patch from last year (`62ec973`) which treated the symptom of accidental register allocation, without preventing the registers from being allocated in the first place. This patch also removes now redundant code that sets the exec_mask in instructions.cc for memory instructions Change-Id: Idabd35020000764fb06133ac2458606c1aaf6f04 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45346 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 01:18:01 +00:00
Kyle Roarty	ccfee78f3a	arch-gcn3: Read registers in execute instead of initiateAcc Certain memory writes were reading their registers in initiateAcc, which lead to scenarios where a subsequent instruction would execute, clobbering the value in that register before the memory writes' initiateAcc method was called, causing the memory write to read wrong data. This patch moves all register reads to execute, preventing the above scenario from happening. Change-Id: Iee107c19e4b82c2e172bf2d6cc95b79983a43d83 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45345 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-08 01:18:01 +00:00
Daniel R. Carvalho	5ff1fac819	misc: Rename Debug namespace as debug As part of recent decisions regarding namespace naming conventions, all namespaces will be changed to snake case. gem5::Debug became gem5::debug. Change-Id: Ic04606baab3317d2e58ab3ca9b37fc201c406ee8 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47305 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 23:18:59 +00:00
Daniel R. Carvalho	7ded9b414c	arch-arm: Rename debug variables Pave the way for a "debug" namespace. Change-Id: I1796711cbde527269637b30b0b09cd06c9e25fa1 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47304 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 23:18:59 +00:00
Gabe Black	1c7c825757	arch,kern: Use CRTP to build open flags tables, not macros. Change-Id: I433c064c66254c6e082fd6e37b4364576c2fbc3a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45903 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 18:05:51 +00:00
Giacomo Travaglini	3472fdadc3	arch-arm: Forward declare kvm_vcpu_init Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I6fa5be48498d1a8f9c070e9ded11e8cadd4b89a1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47679 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 08:44:24 +00:00
Giacomo Travaglini	d1cdcb311b	misc: Move Mode and Translation from BaseTLB to BaseMMU This is a step towards moving most of the TLB logic to the MMU class. Change-Id: Id6b1fb30aa89960705f165f9738f5b50aa1e6bdb Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46779 Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 08:44:13 +00:00
Gabe Black	d58b4f004e	misc: Remove typedef (struct\|enum) Foo in cpp files. In C, to refer to a type without a struct or enum tag on the type, you need to typedef it like this: typedef struct { } Foo; Foo foo; In C++, this is unnecessary: struct Foo { }; Foo foo; Remove all of the first form in C++ files and replace them with the second form. Change-Id: I37cc0d63b2777466dc6cc51eb5a3201de2e2cf43 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46199 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-07 08:35:12 +00:00
Giacomo Travaglini	3a72e669dc	arch-arm: No need to copy haveLPAE when switching TLBs When calling the TLB::takeOverFrom, there is no need to copy the fixed haveLPAE variable as it is a system level parameter (from ArmSystem) and it is assumed to be the same for all TLBs (even the switched out) Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I0b010d18ae71e43290f7f76f229c1a231ff42ac0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/46899 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-06 12:56:52 +00:00
Giacomo Travaglini	065c9ec3f9	arch-arm: Forward declare kvm_reg_list The adoption of the gem5 namespace [1] broke aarch64 builds with the following error: build/ARM/arch/arm/kvm/base_cpu.cc:198:22: error: aggregate 'gem5::kvm_reg_list regs_probe' has incomplete type and cannot be defined In file included from build/ARM/arch/arm/kvm/base_cpu.cc:38:0: build/ARM/arch/arm/kvm/base_cpu.hh:115:28: note: forward declaration of 'struct gem5::kvm_reg_list' std::unique_ptr<struct kvm_reg_list> tryGetRegList(uint64_t nelem) const; Forward declaring the struct defined in linux/kvm.hh (included in source file) in the global namespace, rather than the gem5 one fixes the problem [1]: https://gem5-review.googlesource.com/c/public/gem5/+/46323 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I96c7d31aa4810edcf98e23cefeaf4895620b6444 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47619 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-06 12:41:18 +00:00
Daniel R. Carvalho	4b2118ed4b	misc: Remove sim/cur_tick dependency from sim/core.hh Remove this unnecessary dependency. Fixed all incorrect includes of sim/core.hh. Change-Id: I3ae282dbaeb45fbf4630237a3ab9b1a593ffbe0c Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/43592 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-06 09:59:11 +00:00
Sandipan Das	711f7a3009	arch-power: Add clone support This adds support for the clone() system call using which multiple cpus can be utilized in SE mode. For this change, it should be noted that Linux on Power uses the CLONE_BACKWARDS argument order. Change-Id: Iac91a7d110d9f7a133b8e102ac113f48a431a0d6 Signed-off-by: Kevin Joe <0keik0de@gmail.com> Signed-off-by: Chetan Agarwal <chetanag35@gmail.com> Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47579 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-05 18:41:35 +00:00
Sandipan Das	80becc6684	arch-power: Update copyrights Change-Id: Ifabd1e7178b5250767a2b560b57570512b732278 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40948 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-04 18:20:26 +00:00
Sandipan Das	bef21a9f1c	arch-power: Fix load-store timing sequence To properly implement load-store instructions for use with the TimingSimpleCPU model, the initiateAcc() part of the instruction should only be responsible for performing the effective address computation and then initiating memory access. The completeAcc() part of the instruction should then be responsible for setting the condition register flags or updating the base register based on the outcome of the memory access. This fixes the following instructions: * Load Byte and Zero with Update (lbzu) * Load Halfword and Zero with Update (lhzu) * Load Halfword Algebraic with Update (lhau) * Load Word and Zero with Update (lwzu) * Load Doubleword with Update (ldu) * Load Floating Single with Update (lfsu) * Load Floating Double with Update (lfdu) * Load Byte and Zero with Update Indexed (lbzux) * Load Halfword and Zero with Update Indexed (lhzux) * Load Halfword Algebraic with Update Indexed (lhaux) * Load Word and Zero with Update Indexed (lwzux) * Load Word Algebraic with Update Indexed (lwaux) * Load Doubleword with Update Indexed (ldux) * Load Floating Single with Update Indexed (lfsux) * Load Floating Double with Update Indexed (lfdux) * Load Byte And Reserve Indexed (lbarx) * Load Halfword And Reserve Indexed (lharx) * Load Word And Reserve Indexed (lwarx) * Load Doubleword And Reserve Indexed (ldarx) * Store Byte with Update (stbu) * Store Halfword with Update (sthu) * Store Word with Update (stwu) * Store Doubleword with Update (stdu) * Store Byte with Update Indexed (stbux) * Store Halfword with Update Indexed (sthux) * Store Word with Update Indexed (stwux) * Store Doubleword with Update Indexed (stdux) * Store Byte Conditional Indexed (stbcx.) * Store Halfword Conditional Indexed (sthcx.) * Store Word Conditional Indexed (stwcx.) * Store Doubleword Conditional Indexed (stdcx.) * Store Floating Single with Update (stfsu) * Store Floating Double with Update (stdsu) * Store Floating Single with Update Indexed (stfsux) * Store Floating Double with Update Indexed (stfdux) Change-Id: If5f720619ec3c40a90c1362a9dfc8cc204e57acf Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40947 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-04 18:20:15 +00:00
Sandipan Das	f019d9f866	arch-power: Add support for trapping user faults This adds support for trapping into GDB when user-mode faults such as those pertaining to alignment (SIGBUS), traps (SIGTRAP) and unimplemented opcodes (SIGILL) are encountered. Change-Id: Ieb557abd4173b5acb4be6f0c30964aea1eba71a5 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47359 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-04 18:19:56 +00:00
Sandipan Das	065362ddfc	arch-power: Add multi-mode debugging support This adds multi-mode support for remote debugging via GDB with the addition of the XML target description files for both 32-bit and 64-bit variants of the Power architecture. Proper byte order conversions have also been added. MSR has now been modeled to some extent but it is still not exposed by getRegs() since its a privileged register that cannot be modified from userspace. Similarly, the target descriptions require FPSCR to also be part of the payload and hence, it has been added too. Change-Id: I156fdccb791f161959dbb2c3dd8ab1e510d9cd4b Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40946 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-04 18:19:47 +00:00
Sandipan Das	6b37a7e02c	arch-power: Fix process initialization During process initialization, special purpose registers should either be explicitly set or cleared. These contain flag bits which might have unforseen side effects on the execution of a program. Change-Id: If7c5af9a93283a53717cc8cbba4bf373a7e40560 Signed-off-by: Sandipan Das <sandipan@linux.ibm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/40945 Reviewed-by: Boris Shingarov <shingarov@labware.com> Maintainer: Boris Shingarov <shingarov@labware.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-04 18:19:35 +00:00

1 2 3 4 5 ...

4783 Commits