derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Erin Le	2db021b27b	mem: Comment removal and adding constexpr to is_secure bools This commit removes some comments and adds constexpr in front of "bool is_secure..." in pif.cc, signature_path.cc, and signature_path_v2.cc Change-Id: Icafe1d7c97d1d3fbf6abc12ba87ebb596255b96f	2024-08-05 15:43:40 -07:00
Erin Le	9adf44ed1f	mem: use is_secure instead of hardcoded false in prefetcher crash This modifies the crash fix so that the function calls that were modified use a local variables called `is_secure` instead of a hardcoded `false`. Some of these existed previously so it made more sense to use them, while others were newly added in to mark where the code might need to be changed later. Change-Id: I0c0d14b74f0ccf70ee5fe7c8b01ed0266353b3c1	2024-08-05 15:43:40 -07:00
Erin Le	b0756bedba	mem: Fix "Need is_secure arg" prefetcher crash This commit fixes the "Need is_secure arg" crash that occurs when using the IndirectMemoryPrefetcher, SignaturePathPrefetcher, SignaturePathPrefetcherV2, STeMSPrefetcher, and PIFPrefetcher. This was done by changing some variables to be AssociativeSet<...> instead of AssociativeCache<...> and changing the affected function calls. Change-Id: I61808c877514efeb73ad041de273ae386711acae	2024-08-05 15:43:40 -07:00
Yu-Cheng Chang	5df08fdb08	arch-riscv: Move pmpReset implementation to MMU::reset() (#1406 ) The PMP is part of RISC-V MMU subssystem, it should be put in RiscvISA::MMU::reset()	2024-08-05 14:21:48 -07:00
Matt Sinclair	edd73bd330	gpu-compute: fix typo in GPUMem debug print (#1412 ) The GPUMem print for when a memstatus request completes accidentally put a newline before the word "complete", causing complete to print on a newline and cause confusion. This commit resolves that.	2024-08-05 12:44:13 -07:00
Matt Sinclair	ba455e2025	gpu-compute: update GPUKernelInfo print to print WG number (#1413 ) Whenever a GPU kernel is launching a new WG, the GPUKernelInfo debug flag will print that the kernel is being launched, without the context of which WG from that kernel is being launched. This has caused some confusion to users, who think the entire kernel is being launched repeatedly. To resolve this confusion, update this print to make it clear which WG is being launched when this print is enabled.	2024-08-05 12:43:41 -07:00
Giacomo Travaglini	d2c8754ab3	mem: Fix name() helper for DRAM rank (#1410 ) At the moment the method simply returns the rank number. This is not particularly useful when enabling debug flags as the beginning of the line prints something like: 1: <debug_message> whereas it should really be: system.dram.rank1: <debug_message> Change-Id: I0136dc3d182afa4ae2e5a719cb366d8d0f444667 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-03 22:49:59 +01:00
Alexander Richardson	267817eaa1	arch-riscv: Fix implicit int-to-float conversion in .isa files (#1319 ) Explicitly convert to float/double to fix compiler warnings that I have turned on locally.	2024-07-31 04:24:54 -07:00
Erin (Jianghua) Le	2bfafa726f	sim: Add error message for kernel exceeding memory size (#1329 ) This commit adds an error message to src/sim/kernel_workload.cc to tell the user when the end address of the kernel is greater than the size of memory. The error message also specifies the minimum memory size needed to fit the kernel. Change-Id: I7d8f50889ed8172f64b84f98301a35e5f2f352d3	2024-07-30 19:39:41 -07:00
Yu-Cheng Chang	c13f895af0	arch,cpu: Implement generic reset method for MMU (#1342 ) Implementing generic reset method for MMU allows each ISA implementing their own reset methods. The default reset MMU method is flush all TLB entries. For example, The RISC-V needs to do PMP reset when received the reset signal, but the TLBs don't require to be flushed. Change-Id: I158261570fb6e5216ec105fbdc53460f83f88d15	2024-07-30 09:47:55 +01:00
Alexander Richardson	b64aa0b9b3	arch: Dump semihosting write buffer in debug output (#1389 ) This makes it easier to debug unexpected semihosting outputs (in my case a wrong buffer argument was being passed). Change-Id: I342610a92fb8efe121d030f7b9ea3307efc4fec3	2024-07-30 09:39:05 +01:00
Alexander Richardson	b23a4c7806	arch-arm: Add support for AArch32 PMEVCNTR/PMEVTYPER/PMCCFILTR (#1388 ) These registers were only handled in AArch64 mode but are also accessible as a c14 registers for AArch32. Change-Id: I62fe54427e96265df0589308afa1b5d665dbf210	2024-07-29 18:22:00 +01:00
Alexander Richardson	b51927e7a8	arch-arm: return 64-bit cycle counter for MISCREG_PMCCNTR (#1390 ) In AArch32 mode it is possible to read a 64-bit counter using mrrc. Instead of truncating in the PMU code, just allow the instruction implementation to truncate to 32 bits if accessed using mrc. Change-Id: I77620f6d1852a7d9e79c1ecee50f4297b4103b1c	2024-07-29 16:57:48 +01:00
Matthew Poremba	21f6e166b7	arch-vega: Panic on SDWAB / DPP VOPC unimplemented If SDWAB or DPP are used on a VOPC instruction and those are not implemented, it is highly likely to be a problem for the application. Rather than continue to execute and cause undefined behavior, exit the simulation with a panic showing the line of the instruction causing the issue. Change-Id: Ib3f94df7445d068b26907470c1f733be16cd2fc2	2024-07-25 16:18:14 -07:00
Matthew Poremba	b75fe56da5	arch-vega: Panic unimplemented SDWA/DPP for VOP1/VOP2 Add a panic if SDWA or DPP is used for an instruction which does not implement support for it. If an application uses SDWA or DPP it likely does not operate in the same way as the base instruction and therefore gem5 should panic rather than continue. It is likely data is incorrect which will make it more difficult to debug an application. Change-Id: I68ac448b0d62941761ef4efa0169f95796270f48	2024-07-25 16:18:14 -07:00
Matthew Poremba	6558821e2d	arch-vega: Add SDWAB for v_cmp_{eq,ne}_u16 This shows an example of how to use the previous commit which adds an SDWAB helper. The execute() method of both are the same with the exception of the lambda function passed to the helper method. Change-Id: I5ffe361440b4020b9f7669c0ed946aa6b3bbec25	2024-07-25 16:18:14 -07:00
Matthew Poremba	69338703e7	arch-vega: Implement SDWAB helper Implement a SDWAB helper which accepts a dynamic instruction and a lambda function defining a comparison function taking two values and returning a comparison result of 0 or 1 for false or true. Current instructions which implement SDWA do so on a per-instruction basis which adds a lot of redundant code. This allows for generic SDWAB implementations for VOPC instructions. All modifiers are implemented assuming that SDWBA VOPC instruction comparison types may be U32, I32, F32, U16, I16, F16 (which exist) but is extendible to I8, U8, or F8. Change-Id: Idab58a327c29dd19a1a5457237f3799a04f2031b	2024-07-25 16:18:13 -07:00
Matthew Poremba	a7bc4ca19a	arch-vega: Fix unconditional clamps in VOP3 (#1379 ) Some instructions are clamping floating point outputs unconditionally, leading to incorrect results. This commit finds instructions with this issue and checks the clamp bit before applying clamp. Change-Id: Ibc6de3813d81fd4f9d2c98dd497d19dd34cf6bde	2024-07-25 08:06:00 -07:00
Matthew Poremba	7dae1a1d25	arch-vega: Multiple SOPC fixes (#1366 ) Make S_CMP_LT_U32 use < instead of <=. Change types of EQ / LG for U64 to be U64. Change-Id: Ib0b3b7a46ba1aff16a6d439302ca087d988d6417	2024-07-23 12:45:52 -07:00
Ivana Mitrovic	82c91e8edb	arch-riscv: Improve widening/narrowing vectors overlap check (#1331 ) This PR improves the vector register groups overlap check in widening/narrowing instructions. - Fix wrong illegal overlap condition between VS2 and VD vector register groups. - Also check VS1 vector register group for overlap with VD in vector-vector instructions. - Parametrize widening/narrowing factors in overlap check function to potentially handle more cases. Fixes issue #442.	2024-07-22 10:54:02 -07:00
Erin (Jianghua) Le	b6f8ecb1be	python: move cache coherence protocol check above imports (#1360 ) This commit moves the requires() call that checks the cache coherence protocol above the imports. This change was made for the chi private l1, ruby mesi three level, mesi two level, and mi example cache hierarchies. This ensures that a clear error message about having the wrong coherence protocol is printed, rather than a less useful message. Change-Id: I3bac1ffcb1f8a9d94e486237f880cf248e442ba8	2024-07-22 09:34:04 -07:00
Alexander Richardson	fc59109429	arch,arch-arm: Fix remaining implicit float conversion warnings in .isa (#1327 ) This fixes the remaining implicit int/float conversions and enables the float conversion warnings for clang when building the Arm instruction execution logic. This depends on the previous fixes. Change-Id: I51aac94644a483175842c36da2d49d308aaceb49	2024-07-18 10:43:12 -07:00
Erin (Jianghua) Le	aaa6566548	mem: Change long in src/mem/physical.cc to int64_t (#1275 ) This changes `long`s in src/mem/physical.cc, which are 32 bits or more, to `uint64_t`s, which are exactly 64 bits. Change-Id: I64e089a2ac087bcf58b9c3c918c59dc5ff75d010	2024-07-18 10:12:24 -07:00
Robert Hauser	9b8c84cb5d	arch-riscv: Overwrite getEMI() for timing expr (#1346 ) TimingExpression enables runtime calculation of the commit latency in MinorCPU. For this, machInst is obtained by getEMI() to match it with a given instruction. At default, getEMI() always returns 0 and is therefore overwritten to enable timing expressions for RISC-V. This was already done for ARM (see src/arch/arm/insts/static_inst.hh). Change-Id: I03d669b3439fd24e00cbf893f5db9951dfe56b1f Signed-off-by: Robert Hauser <robert.hauser@uni-rostock.de>	2024-07-12 20:52:24 -07:00
Robert Hauser	5e5e8fb9c6	arch-riscv: Update local interrupts citation (#1347 ) Updated the bib information of the local RISC-V interrupts. Change-Id: I666c3df4529e159bd1946ca1a9623e47f84d5d9e Signed-off-by: Robert Hauser <robert.hauser@uni-rostock.de>	2024-07-12 20:51:49 -07:00
Tommaso Marinelli	e3b41291da	arch-riscv: Check VS1 group for overlap when widening/narrowing Currently, only the VS2 register group is checked for overlap with VD when executing a widening/narrowing instruction. This commits extends the check to VS1, when applicable (i.e. vector-vector operations). Change-Id: I892b7717c01e25546fb41e05afbd08fc40c60c59	2024-07-12 01:17:14 +00:00
Tommaso Marinelli	a8b7e9727d	arch-riscv: Generalize widening/narrowing vectors overlap check As of now, the widening/narrowing vector register groups overlap check always assumes a SEW multiplication factor equal to 2 (for either VD or VS2). This commits aims at making this check more generic. Change-Id: I4311fc3624cd324ccfdf2a1920a19efc85357120	2024-07-12 01:17:14 +00:00
Tommaso Marinelli	5b693fd8b6	arch-riscv: Remove duplicate line Change-Id: I32200aad5a59c9fd85f6ed783a4cebb841bf6ff1	2024-07-12 01:17:14 +00:00
Tommaso Marinelli	fbe6985365	arch-riscv: Fix widening instructions vectors overlap check This commit fixes the overlap check between VS2 and VD register groups in vector widening instructions. While the narrowing instructions check is correct, the widening one has to differentiate between two cases (Vs2 EEW = 2*SEW and Vs2 EEW = SEW). In the first case, overlap is allowed, as the EEW is the same as Vd. In the second case, the overlap legality check has to be adapted to use the Vs2 EMUL to calculate the boundaries. The rule has been derived again from Section 5.2 of RISC-V "V" Vector Extension specifications, version 1.0. The patch also includes some small code refactoring, e.g. using already defined vlmul and constants for vector operands. Fixes issue #442. Change-Id: Ic87095fb9079e6c8f53b9a0d79fbf531a85dc71d	2024-07-12 01:17:14 +00:00
Saúl	8dde32d2dc	arch-riscv: fix initialization for some vector reduction insts (#1340 ) Vector reduce float (widening and non-widening) and integer (widening) instructions initialize the reduce loop operation with the first element of the destination register (i.e. `Vd[0]`). Since all reductions per spec seem to be `Vd[0] = Vs1[0] + Vs2[]` (where `+` is an arbitrary binary op and `` indicates all active elements) gem5 will calculate this incorrectly if `Vd[0]` and/or `Vs1[0]` are non-neutral for the operation (the later case being because it's not taken into account at all). To solve this we just have to initialize the reduction loop to `Vs1[0]` (the non-widening integer reduction already does this).	2024-07-10 22:08:49 -07:00
Yu-Cheng Chang	ce8db85867	cpu: Add cpuIdlePins to indicate the threadContext of CPU is idle (#1285 ) If the threacContext of CPU enters the suspend mode, raise the threadID of threadContext cpu_idle_pins with the high signal to target. If the threadContext of CPU enters the activate mode, lower the threadID of thread cpu_idle_pins with low signal to target.	2024-07-10 10:36:37 +01:00
Yu-Cheng Chang	d54dcac393	arch-riscv: Fix setRegs from GDB failed after #1099 (#1291 ) The gem5 crashed when user try to update register value from GDB because PR[1] changes the index of CSR_XSTATUS to MISCREG_XSTATUS, which is out of NUM_PHYS_MISCREGS. The CSR_XSTATUS should use setRegWithMask to update it. [1] : https://github.com/gem5/gem5/pull/1099 gem5 issue: https://github.com/gem5/gem5/issues/1299 Change-Id: Iefc0d1f5adfb98ecfda0e74907964b47d1864b6d	2024-07-09 15:55:35 -07:00
Jason Lowe-Power	d20512c291	arch-riscv: add agnostic option to vector tail/mask policy for mem and arith instructions (#1135 ) These two commits add agnostic capability for both tail/mask policies, for vector memory and arithmetic instructions respectively. The common policy for instructions is to act as undisturbed if one is (i.e. tail or mask), or write all 1s if none. For those instructions in which multiple micro instructions are instantiated to write to the same register (`VlStride` and `VlIndex` for memory, and `VectorGather`, `VectorSlideUp` and `VectorSlideDown` for arithmetic), a (new) micro instruction named `VPinVdCpyVsMicroInst` has been used to pin the destination register so that there's no need to copy the partial results between them. This idea is similar to what's on ARM's SVE code. This micro also implements the tail/mask policy for this cases. Finally, it's worth noting that while now using an agnostic policy for both tail/mask should remove all dependencies with old destination registers, there's an exception with `VectorSlideUp`. The `vslideup_{vx,vi}` instructions need the elements in the offset to be unchanged. The current implementation overrides the current vta/vma and makes them act as undisturbed, since they require the old destination register anyways. There's a minor issue with this though, as `v{,f}slide1up` variants do not need this, but since they share the same constructor, will act all the same. Related issue #997.	2024-07-08 11:47:11 -07:00
Robert Hauser	77528d1928	systemc: Use headerDelay in timing annotation (#1328 ) 1. Responder (downstream components): When sending a BEGIN_REQ, the timing annotation marks the time when a transaction is visible to the target (see [1] on page 465). When writing the data, the downstream component calculates the transfer time and would send END_REQ after this time (see [1] on page 540). Therefore, not the payloadDelay, but the headerDelay should be used, as already written as a comment in the source files. When reading data, payloadDelay will be 0 anyway. 2. Requester (upstream component): For data read, the begin of the transfer is marked by BEGIN_RESP and the upstream component would delay END_RESP to model the data transfer (see [1] on page 540). Therefore, BEGIN_RESP should be delayed by the headerDelay, not the payloadDelay. [1] "IEEE Standard for Standard SystemC® Language Reference Manual," in IEEE Std 1666-2023 (Revision of IEEE Std 1666-2011), vol., no., pp.1-618, 8 Sept. 2023, doi: 10.1109/IEEESTD.2023.10246125. Change-Id: I3b5e8ad6bc37cbb309b124efdc8764fca3728b7a Signed-off-by: Robert Hauser <robert.hauser@uni-rostock.de>	2024-07-05 09:05:24 -07:00
Giacomo Travaglini	d825103df2	arch-arm: Implement FEAT_TTST (#1323 ) Implement small translation table extension. This feature relaxes the lower limit on the size of the translation tables, by increasing the maximum permitted values of the T1SZ and T0SZ field in: TCR_EL1, TCR_EL2, TCR_EL3,VTCR_EL2 and VSTCR_EL2 Change-Id: I4c2187815b2d7f14407edb38095c6bcc2004b62a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-04 09:37:41 +01:00
Giacomo Travaglini	c9d9108978	arch-arm: MISCREG_AT_S1E2R/W are executable from S state (#1322 ) Change-Id: Ieaebdf0d62b5115f8085f478b2da105633b6a26a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-04 09:37:17 +01:00
Giacomo Travaglini	f3e3c60805	arch-arm: Proper support for NonSecure IPA space in Secure state Change-Id: Ie2e2278ecdc5213db74999e3561b2918937c2c2e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 13:16:13 +01:00
Giacomo Travaglini	eb400e773b	arch-arm: Remove makeStage2 from TLBIOp Change-Id: I25276e4b5b7c491e69208044ceb193c67ddfd91c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 13:15:49 +01:00
Giacomo Travaglini	49ca08b01a	arch-arm: Add isStage2 qualifier to the LongDecriptor We are currently using the LongDecriptor for both stage1 and stage2 translations. There are several cases where the bitfield meaning changes depending on the translation stage. Change-Id: Ic33d9ef225a57fd79ce2b4bf47896aeb6bdd8d9c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 13:15:31 +01:00
Giacomo Travaglini	9cce68ca71	arch-arm: Replace isSecure boolean with SecurityState enum Change-Id: If01b8b2811b2c028e669ea3700174c7945b07a06 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 12:45:24 +01:00
Alexander Richardson	d5c0383887	arch-arm: support 64-bit PMCCNTR from AArch32 (#1304 ) For ARMv8 CPUs this register allows reading a 64-bit cycle counter in from 32-bit execution state. Change-Id: I7cd9e2711ada5156920440cc3c89e7a74ca54a49	2024-07-02 08:59:44 +01:00
Giacomo Travaglini	b28659d4f9	arch-arm: Implement FEAT_XS (#1303 ) This patch is adding a functional implementation of FEAT_XS. Unless we operate with DVM enabled, TLBIs broadcasting is accomplished in 0 time; so there is no timing benefit introduced by enabling FEAT_XS other than the way it affects TLB management (invalidation) Change-Id: I067cb8b7702c59c40c9bbb8da536a0b7f3337b5d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 08:52:59 +01:00
Matt Sinclair	04a3fd5b5d	gpu-compute,mem-ruby: Add RubyHitMiss flag for TCP and TCC cache (#1260 ) Add hit and miss print for TCP and TCC cache with RubyHitMiss debug flag Change-Id: I40ae3449020b917f39ac91d29fa4e1dd7c791e7b	2024-06-30 13:32:01 -05:00
Bobby R. Bruce	b3f23830c9	misc: Update versioning for develop branch Develop for v24.1 Change-Id: I4ef34c4a4ef67d171505ff9380746ae193655305	2024-06-27 23:36:07 -07:00
Bobby R. Bruce	6fcc13cf55	misc: Merge branch stable into develop This guarantees all changes put on the staging branch and, for whatever reason, put on stable are on develop. This syncs the branches. Change-Id: Ib3513f49977bb4ed3046c2d9d6cf162953b15887	2024-06-27 23:27:21 -07:00
Harshil Patel	3acb6e59cf	resources: Update elfie.py to work with obtain_resources (#1289 ) Change-Id: I08c5e50a150c8434c6c2ca36af81fb6ec3915af8	2024-06-27 20:02:57 -07:00
Jarvis Jia	f56571fed9	Merge branch 'develop' into rubyhitmiss	2024-06-27 21:45:08 +08:00
Rajesh Shashi Kumar	3ce5e0584a	arch-arm: This commit fixes a typo in the ARM ldaddalx instruction (#1279 ) The acquire-release flavor of the ldadd instruction should read ldaddalx (eg. ldaddalb/ldaddalh) according to specification. However, this is currently noted as ldadd"la"x (eg. ldaddlab/ldaddlah). Issue: https://github.com/gem5/gem5/issues/1224 Change-Id: Ib932fa0e572207729c923c27f24c34cc21dff0e5 Co-authored-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2024-06-26 09:03:50 -07:00
Harshil Patel	e0d03fbc2f	resources: fix check for additional_params for workloads Change-Id: I0a4b5f0eef6e2f9faf35cea8130572a066aab6cd	2024-06-26 07:13:04 -07:00
Harshil Patel	144a2071fe	resources: fix check for additional_params for workloads Change-Id: I0a4b5f0eef6e2f9faf35cea8130572a066aab6cd	2024-06-25 16:30:07 -07:00

... 3 4 5 6 7 ...

15466 Commits