derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Kyle Roarty	bd52b4793d	configs: Replace DirMem w/RubyDirectoryMemory, set addr_ranges This was originally from the GCN staging branch, which only had GPU_VIPER.py, but the other GPU_VIPER configs had DirMem as well, so I applied this change to all of them. The patch replaces the Directory in DirCntrl from DirMem to RubyDirectoryMemory. This fixes errors that DirMem caused relating to setting class variables. It also generates and sets addr_ranges in DirCntrl as RubyDirectoryMemory uses the parent object's addr_ranges in its code The style checker complained about a line length in GPU_VIPER_Region, so the patch also fixes that Change-Id: Icec96777a51d8a826b576fc752fae0f7f15427bc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32674 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 19:08:48 +00:00
Emily Brickey	a65eed883b	arch-arm: convert tlb to new style stats Change-Id: I2a3f138b53496be6361a1a2b81fa471a56a4dc10 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32794 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 16:35:02 +00:00
Giacomo Travaglini	eb1ac7a011	arch-arm: Early checking if debug is enabled in TLB The patch is aiming at speeding up gem5 execution. The TLB::translateFs is in the critical path of the simulator: every fetch + ld/st will make use of it. Checking all the time for a breakpoint during fetch is rather expensive; it is better to make use of the cached booleans in SelfDebug to do an early check to see if any of Watchpoint/Breakpopint/VectorCatch/SoftwareStep is enabled. Most workloads won't use them so there's no point on calling the testDebug method Change-Id: I0189b84e0dc2e081acce04ff44787b9f1014477c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32776 Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 13:04:58 +00:00
Giacomo Travaglini	81ad7e66db	arch-arm: Rename SelfDebug member variables * enableFlag -> mde The "enableFlag" variable, enabling the Breakpoint, Watchpoint, Vector Catch exceptions is actually the cached version of MDSCR_EL1.MDE. The "enableFlag" name looks too general as it's not covering the Software Step exception case. * bKDE -> kde * bSDD -> sdd The b prefix was likely referring to "breakpoint". However these bitfields are actually used by watchpoints as well. Change-Id: I48b762b32b2d763f4c4ceb7dcc28968cfb470fc1 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32775 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2020-08-18 13:04:58 +00:00
Giacomo Travaglini	b438ec2b1c	arch-arm: Remove setters from SoftwareStep Motivation: Those helpers are used and meant to be used by the parent (SelfDebug) class only. There is no point on exposing them to the outer world. Better to make SelfDebug a friend class and to allow it to access children's private data. Change-Id: Ib945b1aa46742b90062ce7a5de563f164127075f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32774 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 13:04:58 +00:00
Gabe Black	40e8cac306	misc: Make registerExitCallback use CallbackQueue2. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: I526d4a19ca4e54a6469a4ee26693c1c0400fcc70 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32644 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 11:49:06 +00:00
Gabe Black	316f7d42dc	mem: Use the new type of CallbackQueue in the MemBackdoor. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: Ide40528f8c613b46204550d6e6840a7b274a366a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32643 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 11:48:59 +00:00
Gabe Black	4f121c11a2	base: Add a new type of CallbackQueue. This type is templated on what arguments the callbacks in it accept, and it inherits directly from std::list instead of containing one and forwarding selected members. This version is called CallbackQueue2, but once all CallbackQueue instances have been replaced it will be renamed to CallbackQueue. Issue-on: https://gem5.atlassian.net/browse/GEM5-698 Change-Id: I32ab7454ea8c6a2af31cbcf5d4932a069ace1cb5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32642 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 11:48:51 +00:00
Ian Jiang	96f482fd6c	arch-riscv: Fix disassembling of all register instructions How many Rs to output in disassembling register instructions? It does not depend on wheather the register index is zero, but on the count of source registers. This patch fixes the problem. Change-Id: I9a770722003bc6f4a259589a7471a506494d4c86 Signed-off-by: Ian Jiang <ianjiang.ict@gmail.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32694 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-18 01:11:15 +00:00
Emily Brickey	bfc3967695	arch-arm: convert table_walker to new style stats Change-Id: I347a72d33e3d0eb9f60ac01dfa2cc82bdbae3cbb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32494 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2020-08-17 16:57:00 +00:00
Isaac Sánchez Barrera	7740fd7714	mem-cache,python: Allow custom TLB and events in each prefetcher. The `BasePrefetcher` python class had members `_events` and `_tlbs` defined as lists, meaning that any call to `list.append` on them would affect `_events` and `_tlbs` for all prefetchers, not just the calling object. This change redefines them as instance members to fix the problem. Change-Id: I68feb1d6d78e2fa5e8775afba8c81c6dd0de6c60 Signed-off-by: Isaac Sánchez Barrera <isaac.sanchez@bsc.es> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32394 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com>	2020-08-17 11:35:48 +00:00
Gabe Black	7121bc58ac	util,systemc: Update the stats API used in one of the examples. A new parameter as added to the initText method in March of this year, but this example code was not updated which prevents it from compiling. This change adds the parameter to the call and sets it to what the documenting comments say is the default, true. Change-Id: Ic8da46dba03f01f338c38a7bc02ba232a90ae349 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32641 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-16 04:29:34 +00:00
Gabe Black	2e824fa857	util,systemc: Update the gem5-within-systemc TLM example code. Some class names within gem5 changed in March of last year, and this code was not updated to match. Change ExternalMaster::Port to ExternalMaster::ExternalPort, and ExternalSlave::Port to ExternalSlave::ExternalPort. Change-Id: I04c0970c4107de3449473c24c7c6f99ada72bbb3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32640 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-16 04:29:27 +00:00
Giacomo Travaglini	db3490445d	util: Add Xen compilation to gen_arm_fs_files.py Change-Id: I61014d9686f0362ebb83dca5d4d33ac08d66d0a7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32557 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-15 13:09:17 +00:00
Giacomo Travaglini	7e78402cc7	util: Remove dependency check The list is rather old and it contains some entries which are likely unneeded. Since we are also now able to select specific FS binaries to be compiled individually, there is not point of requiring all components to be installed. Instead, if is better to rely on the error message of building process and let the users figure out which packages they need to install Change-Id: I16c74861cb1f2b09c3e91e408ace01a9bd7a234d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32556 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-15 13:09:17 +00:00
Giacomo Travaglini	4118feb041	util: Allow the short -j option in gen_arm_fs_files.py Change-Id: I15c3bad13882cd38683b7c733311191e1f51d13f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32555 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-15 13:09:17 +00:00
Giacomo Travaglini	6d5f631c5c	util: Change gen_arm_fs_files.py to allow selective compilation With the -b/--fs-binaries option it is possible to specify a list of fs binaries to be fetched/compiled. Change-Id: I12a642f65b74e8606c82cdddcbc3a8172bad2381 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32554 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-15 13:09:17 +00:00
Bobby R. Bruce	26e3c06e7f	tests: Dropped the i386 host tag in tests Issue-on: https://gem5.atlassian.net/browse/GEM5-532 Change-Id: Ifee50d59c65f8b460248508688232d9253c040b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32596 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 18:26:41 +00:00
Giacomo Travaglini	4fbe6cf419	arch-arm: Use isSecure variable for Stage2Lookup TLB entries are tagged with the security state of the cpu instead of the security attribute of the physical address Change-Id: I728ba1c841de1ec6c1ee03aee012b185c968d078 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32639 Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 13:07:41 +00:00
Giacomo Travaglini	0c28712f51	arch-arm: Fix physmem NS attribute in VMSAv8-32 descriptors The NS field in PTEs descriptors is tagging Secure/Non-secure physical memory (pages). This field is relevant in Secure state only: While in Secure state, software can access both the Secure and Non-secure physical address spaces, software in Non-secure state can only access Non-secure memory; the NS bit is hence discarded/treated as 1. This patch is aligning VMSAv8-32 with VMSAv8-64, which is tagging the pointed memory as Non-secure in case of a Non-secure lookup. The old behaviour was probably not leading to incorrect execution: once a translation completes, the security flag in the memory request is chcked against the security state of the cpu (and not only relying on the NS bit in the TLB entry) if (isSecure && !te->ns) { req->setFlags(Request::SECURE); } so we were already forbidding secure accesses from non secure world if NS = 0. It is however misleading in the debug logs to see tlb entries with NSTID = 1 and NS = 0. Change-Id: I1f964069f88c33fb14362dd4101cb22538907226 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32638 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 13:07:41 +00:00
Giacomo Travaglini	d3ec83ae3e	arch-arm: VSTTBR_EL2 doesn't contain a VMID field Change-Id: Ia6e14b509d7016020af9c85941e7b2d89dcdd359 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32637 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 13:07:41 +00:00
Giacomo Travaglini	1899320462	arch-arm: Disable HVC when SCR_EL3.HCE is 0 This was already implemented for AArch32 but it had been wrongly removed by: https://gem5-review.googlesource.com/c/public/gem5/+/31394 Change-Id: Ida303d5ccb5d8568ca4e7faaedf9b4efd1cd88b5 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32636 Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 13:07:41 +00:00
Giacomo Travaglini	735cf323b5	arch-arm: Fix XN in TLB permissions The SIF condition check should be logically ORed with the TLB entry XN attribute, instead of overriding it. Change-Id: I70b38d97bbdc82b9f385d40ad06546785fc2c5bb Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32635 Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 13:07:41 +00:00
Giacomo Travaglini	31d118ffcc	arch-arm: Fix SoftwareStep::debugExceptionReturnSS debugExceptionReturnSS is called on an ERET instruction to check for software step. The method was not using the SPSR.width and it was relying on the more generic ELIs32 to check the execution mode of the destination EL. This is not only an efficiency problem: the helper might not work when returning to EL0. In general it is not possible to understand if EL0 is using AArch32 or AArch64 if the current EL is not EL0 and EL1 is using AArch64. This is instead visible by inspecting the spsr.width during the execution of an ERET instruction Change-Id: Ibc5a43633d0020139f2c0e372959a3ab4880da6e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32634 Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 13:07:41 +00:00
Kyle Roarty	9f01f4fd6d	configs: Use proper keywordargs for RedirectPath in apu_se RedirectPath uses app_path and host_paths instead of src and dests. This patch fixes that in apu_se. The patch also changes the formatting for those lines, as simply replacing dests with host_paths put the lines over the 80 char limit. Change-Id: If7e4c41f2f52bc3d5aa26465c786294f9b68f8d3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32655 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 02:07:46 +00:00
Kyle Roarty	58a7be0d46	configs: Remove unneeded variable assignments in apu_se This patch removes: A line assigning a variable to itself An assignment to a variable (chroot) that is never used. The above assignment also caused an error, "'NoneType' object has no attribute 'startswith'" Change-Id: Ib93c25fee4a0f7c1440de8067b086d8b96614796 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32654 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 02:07:11 +00:00
Kyle Roarty	e9ff182b31	configs: Remove remnants of /dev/shm mapping from apu_se This patch removes a redirect for /dev/shm. It also removes a function call that cleaned up the /dev/shm redirect Change-Id: Iec2598c715223d079bc5dfd2ea52859945706cfc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32354 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-14 02:06:54 +00:00
Kyle Roarty	b415e57aeb	util: Add build dependency to gcn Dockerfile src/base/pngwriter.cc requires libpng-dev Change-Id: I7f009cd8f5cacd64150c06b716b1ce3008832910 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32474 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2020-08-13 20:39:14 +00:00
Pouya Fotouhi	762153a421	mem-ruby: Fix debug prints for regular Stores In the updated implementation of LL/SC (27103) the default value of success was changed, which results in printing "SC_Failed" for any regular stores. Change-Id: I4f2e0b26233ce0cbdf948aadd19c9d81bf18bec0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32514 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:48:30 +00:00
Kyle Roarty	62ec973244	arch-gcn3: Free registers when execMask = 0 Flat instructions free some of their registers through their memory requests, in particuar a call to scheduleWriteOperandsFromLoad(), which gets called from GlobalMemPipeline::exec. When execMask is 0, the instruction doesn't issue a memory request. This patch adds in a call to scheduleWriteOperandsFromLoad() when execMask is 0 for Flat Load and AtomicReturn instructions, as those are the instructions that call scheduleWriteOperandsFromLoad() in the memory pipeline. This patch also adds in a missing return statement when execMask is 0 in one of the Flat instructions. Change-Id: I09296adb7401e7515d3cedceb780a5df4598b109 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32234 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:39:34 +00:00
Kyle Roarty	187c44fe44	mem-ruby: fix races between data and DMA in MOESI_AMD_Base-dir There are race conditions while running several benchmarks, where the DMA engine and the CorePair simultaneously send requests for the same block. This patch fixes two scenarios (a) If the request from the DMA engine arrives before the one from the CorePair, the directory controller records it as a pending request. However, once the DMA request is serviced, the directory doesn't check for pending requests. The CorePair, consequently, never sees a response to its request and this results in a Deadlock. Added call to wakeUpDependents in the transition from BDR_Pm to U Added call to wakeUpDependents in the transition from BDW_P to U (b) If the request from the CorePair is being serviced by the directory and the DMA requests for the same block, this causes an invalid transition because the current coherence doesn't take care of this scenario. Added transition state where the requests from DMA are added to the stall buffer. Updated B to U CoreUnblock transition to check all buffers, as the DMA requests were being placed later in the stall buffer than was being checked Change-Id: I5a76efef97723bc53cf239ea7e112f84fc874ef8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31996 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Maintainer: Bradford Beckmann <brad.beckmann@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:05:17 +00:00
Kyle Roarty	d542dc838e	arch-gcn3: make read2st64_b32 write proper registers Per the GCN3 ISA, read2st64_b32 writes to consecutive registers Change-Id: Ibc1672584a72cf7de12e06068a03fe304b34dce2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32236 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Alexandru Duțu <alexandru.dutu@amd.com> Reviewed-by: Bradford Beckmann <brad.beckmann@amd.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 19:04:36 +00:00
Sooraj Puthoor	c07a3548c4	gpu-compute: Fixing HSA's barrier bit implementation This changeset fixes several bugs in the HSA barrier bit implementation. 1. Forces AQL packet launch to wait for completion of all previous packets 2. Enforces barrier bit blocking only if there are packets pending completion 3. Barrier bit unblocking is correclty done by the last pending packet 4. Implementing barrier bit for all packets to conform to HSA spec Change-Id: I62ce589dff57dcde4d64054a1b6ffd962acd5eb8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/30354 Reviewed-by: Sooraj Puthoor <puthoorsooraj@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-13 02:08:27 +00:00
Kyle Roarty	96cac38b3b	util: Install python six module in gcn dockerfile six is used in develop, but wasn't used in the GCN staging branch. Change-Id: Ic1ca42df871d1e683c288282497267d00421609f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32235 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-12 20:20:14 +00:00
Gabe Black	abedb7e7a4	arch: Remove the stacktrace.hh switching header file. This is no longer used. Change-Id: I1419b28d51ff603beb7d8ab89632ad7038c3057e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32215 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-12 06:19:24 +00:00
seanzw	caee304a13	cpu-simple: Fix BaseSimpleCPU to reset group stats BaseSimpleCPU::resetStats() should call Stats::Group::resetStats() to reset new style hierarchy stats. Change-Id: I932280449b29577d214db56ac8347aca4143c949 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32434 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-12 02:17:58 +00:00
Gabe Black	77d1168e78	util: Add stub unit tests for the call types in the m5 utility. These will be filled out in later changes. This CL just adds the plumbing to the build script. Change-Id: If58ea023d0c85eae0160f88217c83fca70346da2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27688 Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2020-08-12 01:37:09 +00:00
Gabe Black	38672ccfe4	util: Pull "usage()" out of the call types in the m5 utility. Also pull common implementations of some call type methods into the base class, and make disappearing call types clean themselves up to make the test a little simpler and less error prone. Change-Id: Ie178fe02d41587647ddc90a084d1d1142b84dde9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27687 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-12 01:36:53 +00:00
Giacomo Travaglini	bef04bca28	arch-arm: Reduce boilerplate when extracting SelfDebug from tc Change-Id: I1746400617be64ac9c2f3194442734e178342909 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31354 Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-10 19:32:21 +00:00
Giacomo Travaglini	9ba0807ab4	dev-arm: Fix <timer>_CTL_EL<x>.ISTATUS when masking the irq According to the ArmArm: "When the value of the ENABLE bit is 1, ISTATUS indicates whether the timer condition is met. ISTATUS takes no account of the value of the IMASK bit. If the value of ISTATUS is 1 and the value of IMASK is 0 then the timer interrupt is asserted." Since ISTATUS is simply flagging that timer conditions are met, an interrupt mask (via the <timer>_CTL_EL<x>.IMASK) shouldn't reset the field to 0. Clearing the ISTATUS bit leads to the following problem as an example: 1) virtual timer (EL1) issuing a physical interrupt to the GIC 2) hypervisor handling the physical interrupt; setting the CNTV_CTL_EL0.IMASK to 1 before issuing the virtual interrupt to the VM 3) The VM receives the virtual interrupt but it gets confused since CNTV_CTL_EL0.ISTATUS is 0 (due to point 2) What happens when we disable the timer? "When the value of the ENABLE bit is 0, the ISTATUS field is UNKNOWN." So we are allowed to not clear the ISTATUS bit if the timer gets disabled Change-Id: I8eb32459a3ef6829c1910cf63815e102e2705566 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Adrian Herrera <adrian.herrera@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31775 Reviewed-by: Hsuan Hsu <kugwa2000@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-10 15:35:54 +00:00
Ciro Santilli	5ede3d6497	sim-se: don't wake up SE futex syscalls on ARM events Before this commit: * SEV events were not waking neither WFE (wrong) nor futex WAIT (correct) * locked memory events (LLSC) due to LDXR and STXR were waking up both WFE (correct) and futex WAIT (wrong) This commit fixes all wrong behaviours mentioned above. The fact that LLSC events were waking up futexes leads to deadlocks, as shown in the test case described at: https://gem5.atlassian.net/browse/GEM5-537 because threads woken up by SVE are not removed from the waiter list for the futex address they are sleeping on. A previous fix atttempt was done at: 1531b56d605d47252dc0620bb3e755b7cf84df97 in which only sleeping threads are woken up. But that is not sufficient, because the futex sleeping thread that was being wrongly woken up on SEV can start to sleep on a second futex. As an example, consider the case where 4 threads are fighting over two critical sections protected by futex1 and futex2 addresses. In this case, one thread wakes up the other thread after it is done with the section. Suppose the following sequence of events: * thread1 is awake and all others are suspended on futex1 * thread1 SEV wakes thread2 from the futex1 while in the critical region 1. This is the wrong behaviour that this patch prevents, because now thread2 is still in the sleeper list for futex1 * thread1 then futex wakes tread3, then proceeds to critical region 2. * thread3 wakes up, but because thread2 has critical region, it sleeps again. * thread2 finishes its work, futex wakes thread3, and then proceeds to futex2 When it reaches futex2, thread1 is still working there, so it sleeps on futex2. * thread3 futex wakes thread2, because it is still wrongly on the sleeper list of futex1. But thread2 is in futex2 now. If it weren't for this mistake, it should have awaken the final thread4 instead. Outcome: thread4 sleeps forever, no other thread ever wakes it, because all other threads have woken from futex1 and awoken another thread. The problem is fixed by adding the waitingTcs unordered_set FutexMap, which is basically an inverse map to FutexMap, which tracks (addr, tgid) -> ThreadContext. This allows us allow to quickly check if a given ThreadContext is waiting on a futex in any address. Then the SEV wakeup code path now checks if the thread is k Change-Id: Icec5e30b041f53e5aa3b6e0d291e77bc0e865984 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29777 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Brandon Potter <Brandon.Potter@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Brandon Potter <Brandon.Potter@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-10 08:52:03 +00:00
Ciro Santilli	9c9fea575a	sim-se: factor out FutexMap::suspend and FutexMap::suspend_bitset Both methods do basically the same, especially since they don't handle the timeout which is basically the only difference between both modes of the syscall (one uses absolute and the other relative time). Remove the WaiterState::WaiterState(ThreadContext* _tc) constructor, since the only calls were from FutexMap::suspend which does not use them anymore. Instead, set the magic 0xffffffff constant as a parameter to suspend_bitset. Change-Id: I69d86bad31d63604657a3c71cf07e5623f0ea639 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29776 Reviewed-by: Brandon Potter <Brandon.Potter@amd.com> Maintainer: Brandon Potter <Brandon.Potter@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-10 08:52:03 +00:00
Ciro Santilli	aa219dba7a	sim-se: split futex_map.cc into header and source files To speed up development when modifying the implementation. Change-Id: I1b3c67c86f8faa38ed81a538521b08e256d21a5a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29775 Reviewed-by: Brandon Potter <Brandon.Potter@amd.com> Maintainer: Brandon Potter <Brandon.Potter@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-10 08:52:03 +00:00
Matthew Poremba	7fede74467	arch-mips: Remove old TypeBufferArg call TypeBufferArg was replaced by VPtr so this call is no longer needed. This fixes the MIPS build / nightly build. Change-Id: I3880229fa0ad87fad1ca35c136e12efc6c36ceda Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32414 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-08 17:15:21 +00:00
Ian Jiang	d60f649819	sim: Add checkpoint parameters for VMA list Add checkpoint parameters (together with corresponding serialization and unserialization) for VMA list of class MemState into a separate section named 'vmalist'. Without these VMA list parameters, a page table fault will occur when running with --restore-simpoint-checkpoint, because of an empty VMA list. For example: $ ./build/RISCV/gem5.debug --debug-flags=Exec configs/example/se.py \ -c tests/test-progs/hello/bin/riscv/linux/hello \ --cpu-type=NonCachingSimpleCPU --restore-simpoint-checkpoint \ --checkpoint-dir m5out/ -r 2 ... 2404000: system.switch_cpus: T0 : @_int_malloc+3392 : sd a5, 8(a0) \ : MemWrite : D=0x000000000001ed21 A=0x862e8 panic: Page table fault when accessing virtual address 0x862e8 ... Example checkpoint output: [system.cpu.workload.vmalist] size=3 [system.cpu.workload.vmalist.Vma0] name=stack addrRangeStart=... addrRangeEnd=... [system.cpu.workload.vmalist.Vma1] name=heap addrRangeStart=... addrRangeEnd=... [system.cpu.workload.vmalist.Vma2] ... Change-Id: Ib2fa7ad2c34fe667ce95bc4b10a1affcf60d9c1f Signed-off-by: Ian Jiang <ianjiang.ict@gmail.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31875 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Alexandru Duțu <alexandru.dutu@amd.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-07 01:10:41 +00:00
Boris Shingarov	349cad7703	arch-power: Implement GDB XML target description for PowerPC Change-Id: I2610626a7e1464316ebaa770291d4bdcb59e8856 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/31114 Reviewed-by: Ciro Santilli <ciro.santilli@arm.com> Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-06 15:16:03 +00:00
Gabe Black	45d934725d	cpu: Remove the "profile" parameter and plumbing. This parameter is associated with a periodic event which would take a sample for a kernel profile in FS mode. Unfortunately the only ISA which had working versions of the necessary classes was alpha, and that has been deleted. That means that without additional work for any given ISA, the profile parameter has no chance of working. Ideally, this parameter should be moved to the Workload classes. There it can intrinsically be tied to a particular kernel, rather than having to assume a particular kernel and gate everything on whether you're in FS mode. Because this isn't (IMHO) where this parameter should live in the long term, and because it's currently unusable without additional development for each of the ISAs, I think it makes the most sense to remove the front end for this mechanism from the CPU. Since the sampling/profiling mechanism itself could be useful and could be re-plumbed somewhere else, the back end and its classes are left alone. Change-Id: I2a3319c1d5ad0ef8c99f5d35953b93c51b2a8a0b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32214 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-05 23:58:53 +00:00
Gabe Black	26e8d648b4	systemc: Adjust some type names in a couple tests. These names happened to collide with names from gem5 itself, and when linked together produced strange and incorrect results. Ideally gem5's names should go inside a gem5 namespace, but that's a much larger change. Change-Id: Ie7c5f2236678d5dbb722a86321296fce395fbd37 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32175 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-05 23:53:45 +00:00
Gabe Black	d621325db0	systemc: Filter a pydot warning message out when checking test output. This warning can show up if pydot isn't set up properly and doesn't have anything to do with the success of the test. Change-Id: Iddcea5aa27196bc5cf747bf5a295b6c9a91b3d2c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32174 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-05 23:53:36 +00:00
Gabe Black	7a94f5cbf7	scons,fastmodel: Limit how many instances of simgen can run at once. Each instance of simgen uses a license. If there are only so many to go around, running many instances at once could exhaust the pool of licenses and break the build. The number of licenses may be less than the number of regular build steps we want to do in parallel, but may be greater than zero. To limit them to at most n in parallel where n might be less than j and/or more than 1, we create a group of license slots, assign simgen invocations to a slot, and then use scons's side effect mechanism to ensure no two invocations in the same slot run at the same time. This may be a suboptimal packing if the commands take significantly different amounts of time to run since the slots are preallocated and not demand allocated, but the difference shouldn't normally matter in practice, and scons doesn't provide a better mechanism for partially serializing certain build steps. Change-Id: Ifae58b48ae1b989c1915444bf7564f352f042305 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32124 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-08-05 23:53:27 +00:00

1 2 3 4 5 ...

15739 Commits