derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Matthew Poremba	2b3beb92ff	dev-amdgpu,gpu-compute,configs: MI300X (#1141 ) Release of MI300X simulation capability: - Implements the required MI300X features over MI200 (currently only architecture flat scratch). - Make the gpu-compute model use MI200 features when MI300X / gfx942 is configured. - Fix up the scratch_ instructions which are seem to be preferred in debug hipcc builds over buffer_. - Add mi300.py config similar to mi200.py. This config can optionally use resources instead of command line args.	2024-05-17 09:26:04 -07:00
Alexander Richardson	716fe6d31d	arch-arm: Fix 32-bit semihosting ABI (#1142 ) It appears we have been trying to read 64-bit arguments for ARM32 since `695583709b`. I noticed that SYS_OPEN was trying to read a really long string as the pathname argument and it turned out it was reading from the wrong stack offset. With this change I can successfully run some of the semihosting tests for ARM32. Change-Id: Ie154052dac4211993fb6c4c99d93990123c2eacf	2024-05-16 10:28:45 -07:00
Alexander Richardson	6b34765d5d	arch-generic: Avoid out-of-memory errors for bad semihosting calls (#1143 ) In BaseSemihosting::readString() we were using the len argument to allocate a std::vector without checking whether the value makes any sense. This resulted in a std::bad_alloc exception being raised prior to https://github.com/gem5/gem5/pull/1142 for my semihosting tests. This commit prevents semihosting from reading more than 64K for string arguments which should be more than sufficient for any valid code. Change-Id: I059669016ee2c5721fedb914595d0494f6cfd4cd	2024-05-16 10:28:10 -07:00
Chong-Teng Wang	adb177dab6	arch-riscv: Fix vrgather instruction (#1134 ) This commit fixes the implementation of vrgather instruction based on rvv 1.0. In section 16.4. Vector Register Gather Instructions, > Vector-scalar and vector-immediate forms of the register gather are also provided. These read one element from the source vector at the given index, and write this value to the active elements of the destination vector register. The index value in the scalar register and the immediate, zero-extended to XLEN bits, are treated as unsigned integers. If XLEN > SEW, the index value is not truncated to SEW bits. The fix zero-extends the index value in the scalar register and the immediate.	2024-05-16 10:12:35 -07:00
Matthew Poremba	c1803eafac	arch-vega: Architected flat scratch and scratch insts Architected flat scratch is added in MI300 which store the scratch base address in dedicated registers rather than in SGPRs. These registers are used by scratch_ instructions. These are flat instruction which explicitly target the private memory aperture. These instructions have a different address calculation than global_ instructions. This change implements architected flat scratch support, fixes the address calculation of scratch_ instructions, and implements decodings for some scratch_ instructions. Previous flat_ instructions which happen to access the private memory aperture have no change in address calculation. Since scratch_ instructions are identical to flat_ instruction except for address calculation, the decodings simply reuse existing flat_ instruction definitions. Change-Id: I1e1d15a2fbcc7a4a678157c35608f4f22b359e21	2024-05-16 09:23:03 -07:00
Chong-Teng Wang	d48191d608	arch-riscv: Add RVV FP16 support (Zvfh & Zvfhmin) (#1123 ) Add support for the following two extensions: [Zvfh](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#185-zvfh-vector-extension-for-half-precision-floating-point): Vector Extension for Half-Precision Floating-Point [Zvfhmin](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#184-zvfhmin-vector-extension-for-minimal-half-precision-floating-point): Vector Extension for Minimal Half-Precision Floating-Point For instructions (`vfncvt[.rtz].x[u].f.w`) and (`vfwcvt.f.x[u].v`) which will become defined when `SEW = 8`, a new template `VectorFloatWideningAndNarrowingCvtDecodeBlock` is added and 8-bit floating point type (`float8_t`) is defined. The data type `float8_t` is introduced in the newer `3e` version of the SoftFloat Package, however, the current version in use is `3d` which does not include this definition. Despite this, `float8_t` is utilized solely for constructing the `vfncvt[.rtz].x[u].f.w` and `vfwcvt.f.x[u].v` instructions when `SEW = 8`. There are no operations that directly manipulate data of the `float8_t` type.	2024-05-16 08:37:00 -07:00
Ivana Mitrovic	10b24dc9a4	arch-arm: Implement FEAT_MPAM in CPU (#1082 ) This PR implements FEAT_MPAM on the CPU side. We define a MPAM system registers and a mechanism for tagging memory requests with the MPAM information bundle as specified in existing documentation [1]. What this PR is not covering is the MPAM implementation in a MSC (Memory System Component). Which means at the moment it's only possible to have static partitioning schemes (via the PartitioningPolicies already part of gem5) and there is currently no way to dynamically program partitions at runtime. [1]: https://developer.arm.com/documentation/ddi0487/latest/	2024-05-13 08:56:23 -07:00
Ivana Mitrovic	53245fa0e8	arch-riscv: Fix CSR instruction behavior 2nd attempts (#1099 ) Quote from change[1] > The RISC-V spec clarifies the CSR instruction operation, some of them shall not read or write CSR by the hints of RD/RS1/uimm, but the original version use the 'data != oldData' condition to determine whether write or not, and always read CSR first. See CSR instruction in spec: Section 9.1 Page 56 of https://github.com/riscv/riscv-isa-manual/releases/download/Ratified-IMAFDQC/riscv-spec-20191213.pdf \|\|\|Register operand\|\|\| \|--- \|--- \|--- \|--- \|--- \| \|Instruction\|rd is x0\|rs1 is x0\|Reads CSR\|Writes CSR\| \|CSRRW\|Yes\|-\|No\|Yes\| \|CSRRW\|No\|-\|Yes\|Yes\| \|CSRRS/CSRRC\|-\|Yes\|Yes\|No\| \|CSRRS/CSRRC\|-\|No\|Yes\|Yes\| \|\|\|Immediate operand\|\|\| \|Instruction\|rd is x0\|uimm = 0\|Reads CSR\|Writes CSR\| \|CSRRWI\|Yes\|-\|No\|Yes\| \|CSRRWI\|No\|-\|Yes\|Yes\| \|CSRRSI/CSRRCI\|-\|Yes\|Yes\|No\| \|CSRRSI/CSRRCI\|-\|No\|Yes\|Yes\| The issue cause the ubuntu hanging because we shared the same status CSR with `mstatus`, `sstatus` and `ustatus` and interrupt enabling CSR with mip, sip and uip. We may need to read origin CSR without effect of unmask bits to avoid override the bits of other CSR. Now the ubuntu can work after the patch merged. [1] https://gem5-review.googlesource.com/c/public/gem5/+/67717	2024-05-10 10:21:48 -07:00
Matthew Poremba	e3c2a322a1	arch-vega: Fix SDWA dst select (#1120 ) The destination select should take a value of the selection size (dword, word, or byte) starting at bit 0, move that to the selected destination, and then apply the unused constraint (DST_U) to the remaining word or bytes. Currently the code is selecting the word/byte currently being iterated over, rather than the least significant word/byte. As a result, any selection that is not word 0 or byte 0 will be replaced with the original destination value at those bits. This results in the wrong value. This commit changes the orig bits to be the original dest value at the lowest word / byte location. Tested with the mfma_i32_16x16x16i8 example which uses an SDWA V_OR_B32 to pack i8 values into VGPRs for the MFMA. Change-Id: I54ed819479a25fa9276d29a8f14f0fea7fd71afe	2024-05-10 08:49:13 -07:00
Chong-Teng Wang	8c4d5f8e27	arch-riscv: Fix narrowing/widening type-convert instructions (#1079 ) Correct ei calculation under VectorFloatWideningCvtFormat and VectorFloatNarrowingCvtFormat. Change-Id: I08699ffe3b9f8a7d4543023437626cc054344053	2024-05-09 10:17:15 -07:00
Roger Chang	c1713a0b18	arch-riscv: Fix CSR instruction behavior 2nd attempts Change-Id: Id0a9a374281445c7821863f0f74564857d3d8fa2	2024-05-07 20:32:56 +08:00
Roger Chang	1a81144985	arch-riscv: Move FCSR implementation to isa.cc Change-Id: I132edfe2c0ae4caecaa9e6209249662895b5c608	2024-05-07 20:32:56 +08:00
Matthew Poremba	6ed446e546	arch-x86: Add XCR0 register and add to X86KvmCPU (#1040 ) The extended control registers were not being updated in the KVM thread context nor updated in the KVM state. This was causing issues when checkpointing since the XCR0 value was reverting to the default value rather than what it was previously before the checkpoint. THis was causing multiple applications to crash due to executing instructions which are now illegal instructions due to XCR0 being incorrect. This commit adds the XCR0 as a misc register similar to the exiting x86 control registers and adds all of the helper functions to access and set the register value. It also adds support for updating the KVM CPU's state with the register value and updating the thread context's misc reg value so that it is checkpointed along with the other misc regs. Note that this does not add support for XSAVE of the AVX state (i.e., the upper 128 bits of YMM registers). It does however fix the immediate problem in issue #958 . Change-Id: I97456c8b57cbc7b381bd4be94944ce6567a43c76	2024-05-06 09:58:07 -07:00
Matthew Poremba	cb47755e15	gpu: Consolidated fixes for v24.0 (#1103 ) Includes fixes for several bugs reported via email, self found, and internal reports. Also includes runs through Valgrind and UBsan. See individual commits for more details.	2024-05-06 07:35:57 -07:00
Giacomo Travaglini	7c9925bafa	arch-generic: Fix reading from special :semihosting-features file (#1089 ) The implementation of SYS_FLEN was missing, which caused picolibc to treat this file as not implemented. Additionally, there was a bug in the SYS_READ call that was comparing the wrong variable against the passed buffer length. It was comparing the current file position against the buffer length instead of the number of written bytes. Finally, pos was unititialized which could result in spurious errors. Change-Id: I8b487a79df5970a5001d3fef08d5579bb4aa0dd0	2024-05-06 07:30:13 +01:00
Matthew Poremba	8249d6d1cd	arch-vega: Remove FP asserts in VOP3 lane manip insts The VOP3 instruction encoding generally states that ABS/NEG modifiers in the instruction encoding are only valid on floating point data types. This is currently coded in gem5 to mean floating point instructions. For untyped instructions like V_CNDMASK_B32, we don't actually know what the data type is. We must trust that the compiler did not attempt to apply these bits to non-FP data types. This commit simply removes the asserts. The ABS/NEG modifiers are therefore ignored which is consistent with the ISA documentation. This is done on the lane manipulation instructions V_CNDMASK_B32, V_READLINE_B32, and V_WRITELANE_B32 which are typically used to mask off or move data between registers. Other bitwise instructions (e.g., V_OR_B32) keep the asserts as bitwise operations on FP types are genernally illegal in languages like C++. Change-Id: I478c5272ba96383a063b2828de21d60948b25c8f	2024-05-03 14:31:17 -07:00
Matthew Poremba	0faa9510f9	arch-vega,gpu-compute: Fix misc ubsan runtime errors Three main fixes: - Remove the initDynOperandInfo. UBSAN errors and exits due to things not being captured properly. After a few failed attempts playing with the capture list, just move the lambda to a new method. - Invalid data type size for some thread mask instructions. This might actually have caused silent bugs when the thread id was > 31. - Alignment issues with the operands. Change-Id: I0297e10df0f0ab9730b6f1bd132602cd36b5e7ac	2024-05-03 14:26:46 -07:00
Yu-Cheng Chang	3a2a917a53	arch-riscv: Fix VCSR read behavoir (#1076 ) The VCSR should read the value with VXSAT and VXRM <table class="tableblock frame-all grid-all fit-content center"> <caption class="title">Table 40. vcsr layout</caption> <colgroup> <col> <col> <col> </colgroup> <thead> <tr> <th class="tableblock halign-right valign-top">Bits</th> <th class="tableblock halign-left valign-top">Name</th> <th class="tableblock halign-left valign-top">Description</th> </tr> </thead> <tbody> <tr> <td class="tableblock halign-right valign-top"><p class="tableblock">XLEN-1:3</p></td> <td class="tableblock halign-left valign-top"></td> <td class="tableblock halign-left valign-top"><p class="tableblock">Reserved</p></td> </tr> <tr> <td class="tableblock halign-right valign-top"><p class="tableblock">2:1</p></td> <td class="tableblock halign-left valign-top"><p class="tableblock">vxrm[1:0]</p></td> <td class="tableblock halign-left valign-top"><p class="tableblock">Fixed-point rounding mode</p></td> </tr> <tr> <td class="tableblock halign-right valign-top"><p class="tableblock">0</p></td> <td class="tableblock halign-left valign-top"><p class="tableblock">vxsat</p></td> <td class="tableblock halign-left valign-top"><p class="tableblock">Fixed-point accrued saturation flag</p></td> </tr> </tbody> </table> Change-Id: I1227b920da78026951dfa548e41c8cc56da6caac	2024-05-03 09:53:43 -07:00
Yu-Cheng Chang	8b885222b1	arch-riscv: Fix interrupt and status CSR behavoir (#1091 ) From sepc > Instructions that access a non-existent CSR are reserved. Attempts to access a CSR without appropriate privilege level raise illegal-instruction exceptions or, as described in Section 13.6.1, virtual-instruction exceptions. Attempts to write a read-only register raise illegal-instruction exceptions. A read/write register might also contain some bits that are read-only, in which case writes to the read-only bits are ignored. Setting the bit not in the mask should be ignore rather than raise the illegal exception. The unmask bits of xstatus CSR are `WPRI`, the unmasks bits of xie are `RO`(above priv v1.12) or `WPRI`(priv v1.11 and priv v1.10), the unmask bits of xip CSR are `RO`(above priv v1.12) or `WPRI`(priv v1.11) or `WIRI` (priv v1.10). Note: The workload of `riscv-ubuntu-20.04-boot` uses the priv v1.10. More details please see the `RISC-V spec: Privileged Architecture` v1.10: https://github.com/riscv/riscv-isa-manual/releases/tag/riscv-priv-1.10 v1.11(20190608): https://github.com/riscv/riscv-isa-manual/releases/tag/Ratified-IMFDQC-and-Priv-v1.11 v1.12(20211213): https://github.com/riscv/riscv-isa-manual/releases/tag/Priv-v1.12 Change-Id: I5d6e964e99b30b71da3dc267cd1575665d922633	2024-05-02 09:07:30 -07:00
Giacomo Travaglini	a6b20eae80	Merge branch 'develop' into semihosting-features-fix	2024-05-02 10:12:27 +01:00
Alexander Richardson	aa2fade12e	Drop unrelated change	2024-05-01 18:00:09 +01:00
Alexander Richardson	e7566448fa	arch-generic: More reliable special file name handling in semihosting (#1090 ) Currently, the filesRootDir is prepended for all paths that do not start with '/'. However, we should not be doing this for the special files :tt and :semihosting-features. Noticed this while testing semihosting with a non-empty filesRootDir. Change-Id: I156c8b680cb71cdc88788be3b0e93fc1d52e11e5	2024-05-01 17:41:55 +01:00
Alex Richardson	bb4c13143c	arch-generic: Fix reading from special :semihosting-features file The implementation of SYS_FLEN was missing, which caused picolibc to treat this file as not implemented. Additionally, there was a bug in the SYS_READ call that was comparing the wrong variable against the passed buffer length. It was comparing the current file position against the buffer length instead of the number of written bytes. Finally, pos was unititialized which could result in spurious errors. Change-Id: I8b487a79df5970a5001d3fef08d5579bb4aa0dd0	2024-04-30 16:28:06 -07:00
Yangyu Chen	666d1dd9a2	arch-riscv: Add Integer Conditional operations extension (Zicond) instructions (#1078 ) This PR added RISC-V Integer Conditional Operations Extension, which is in the RVA23U64 Profile Mandatory Base. And the performance of conditional move instructions in micro-architecture is an interesting point to explore. Zicond instructions added: czero.eqz, czero.nez Changes based on spec: https://github.com/riscvarchive/riscv-zicond/releases/download/v1.0.1/riscv-zicond_1.0.1.pdf	2024-04-30 05:44:45 -07:00
Giacomo Travaglini	b8e414f14d	arch-arm: Implement FEAT_MPAM With this patch we are adding the specific logic required to tag memory requests from the PE with MPAM data. This is happening within the MMU before a translation is completed Change-Id: Ifecb285244dd469a639150d69a7e884fe3c441be Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-29 11:32:39 +01:00
Adrián Herrera	c988642ca8	arch-arm: Define system registers for FEAT_MPAM This patch is adding FEAT_MPAM register definition/decoding. Co-authored-by: Hristo Belchev <hristo.belchev@arm.com> Co-authored-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I70483fcc758419365f4b3762479684c6c52f4d62 Signed-off-by: Adrián Herrera <adrian.herrera@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-29 11:32:39 +01:00
Giacomo Travaglini	65cf6b0a1c	arch-arm: Cache the highestEL in the ISA object This is for fast retrieval of the highest implemented exception level Change-Id: Id631c2b999d46a8b79570e4043ae04bc2b2e7531 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-29 10:38:08 +01:00
Alexander Richardson	1bb5d3b99e	arch-riscv: Add support for RISC-V semihosting (#681 ) See https://github.com/riscv-software-src/riscv-semihosting for the current specification. Almost all code is shared with the Arm implementation. Tested by running some binaries built with [picolibc](https://github.com/picolibc/picolibc).	2024-04-27 05:12:32 -07:00
Matthew Poremba	a6f2c8afdb	arch-x86: Add XCR0 register and add to X86KvmCPU The extended control registers were not being updated in the KVM thread context nor updated in the KVM state. This was causing issues when checkpointing since the XCR0 value was reverting to the default value rather than what it was previously before the checkpoint. THis was causing multiple applications to crash due to executing instructions which are now illegal instructions due to XCR0 being incorrect. This commit adds the XCR0 as a misc register similar to the exiting x86 control registers and adds all of the helper functions to access and set the register value. It also adds support for updating the KVM CPU's state with the register value and updating the thread context's misc reg value so that it is checkpointed along with the other misc regs. Note that this does not add support for XSAVE of the AVX state (i.e., the upper 128 bits of YMM registers). It does however fix the immediate problem in issue #958 . A checkpoint upgrader is also provided to add the default value of XCR0 if the checkpoint tag is missing. Change-Id: I97456c8b57cbc7b381bd4be94944ce6567a43c76	2024-04-25 11:24:53 -07:00
Giacomo Travaglini	a3d030d161	arch-arm: Add the FAR_EL* register accessor Use it accordingly in the faulting/exception logic Change-Id: I2f6360d04698b6fb7188e776f1d6966e99ce19b1 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-25 09:45:54 +01:00
Giacomo Travaglini	19628e746d	arch-arm: Add readRegister/writeRegister templates This is adding two templated functions for reading/writing system registers (MiscRegs). It is introducing them inside a new misc_regs namespace. Change-Id: I21233337c057673d46d1147971ebabbfc2c2bb6a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-25 09:45:00 +01:00
Giacomo Travaglini	bf78579fa5	arch-arm: Change the TlbTestInterface to accept a RequestPtr Now that the Request has been made an Extensible object, it can carry within itself much more data. It makes sense to pass it to the TlbTestInterface as more information about the table walk can be extracted from it. This is also aligning with the testTranslation utility which is expecting a request reference as first argument. Change-Id: I3dbc9a81d6b4bcc1801246ba7eb4136774d8f3c7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	89323c5112	arch-arm: Group testTranslation and finalizeTranslation together They both make final checks to the VA->PA translation before relinquishing control back to the translate client (usually CPU code) Change-Id: Ib0a9da25404248c22c6a240817d2f50f0913fdf7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	0c20eb3ec7	arch-arm: Call finalizePhysical even when MMU is off The finalizePhysical is just checking if the physical address falls within the m5op region (if using mmapped m5ops). There's not reason why we shouldn't enable it with virtual memory off Change-Id: I5ab80fd4e7886743abd4b7d85937b72253b578d3 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	a299d2db0c	arch-arm: Move testWalk check within the fetchDescriptor We also unify the fault handling logic; rather than cleaning up the WalkerState in several places scattered throughout the walking code, we handle faults in the top level method Change-Id: Ia22fb6f27044ff445fffbab228777a48efa473cb Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	6d0cb6eaa3	arch-arm: Pull out Request generation from the TableWalker::Port Change-Id: Ie8c309bb79b4ce7c656428660c9e2effd58a89f0 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	e450cfef16	arch-arm: Move testWalk functionality to the TableWalker class It's more efficient to pass a reference of the tester to the TableWalkers. In this way a table walk check is tested directly from the walkers instead of going through the MMU every time. Change-Id: I9820dbabb8b551981005a65efa54a76b1a027541 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	bbe5bf2644	arch-arm: Simplify TableWalker::walk method Change-Id: Ib823b3b577a70f6ec14de854cb9c250faa04e932 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-04-24 18:12:36 +01:00
Giacomo Travaglini	9d9b7848bb	arch-arm: Properly compute EL even in stage2 walks This is done in order to differentiate between EL0 (unprivileged) and EL1. Effectively it won't change much as most of the decisions are now taken according to the translation regime which will be the same regardless (EL10) Change-Id: I218037e9c19cf638aff05c51869e439204d9af69 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-04-24 18:12:36 +01:00
Ivana Mitrovic	c44b8635ab	arch-x86: Movfp account for dataSize=4 (#1024 ) Movfp instruction did not account for only copying the lower half of src register if dataSize is 4. GitHub Issue: #893 I used the test code in issue #893 to verify the fix is working.	2024-04-18 10:36:00 -07:00
Lukas Zenick	01a5edc86e	arch-x86: Use mbits function for clarity Change-Id: I577ee55752f917e561e4c741ba7a19f0229318b5	2024-04-15 22:49:41 -05:00
Lukas Zenick	d67a7797d2	arch-x86: Movfp account for dataSize=4 Change-Id: I97e7a6f2738a57cad9907ddfe5c8030a26c147e8	2024-04-14 15:59:24 -05:00
Matthew Poremba	3db6e86fea	arch-vega: Fix string check warnings on fast build gem5.fast does not currently build if the GPU model is built. This fixes the array-bounds warnings allowing gem5.fast to build again. Change-Id: I463c2847c3ecfd2257a70418fa247090b0493f9b	2024-04-14 12:22:57 -07:00
Ivana Mitrovic	db1c336237	cpu,arch-arm,arch-riscv: adding new instruction types to RISC-V (#589 ) This commit adds more detailed instruction types for RISC-V Vector. Concretely, it substitutes VectorIntegerArith, VectorFloatArith, VectorIntegerReduce and VectorFloatReduce with more specific types related to the operation that each instruction (e.g., VectorIntegerAdd or VectorIntegerMult). Additionaly, fixes two RISC-V instruction types (VectorXXX) that were used in ARM SVE, placing the proper SimdXXX ones. Change-Id: I31774fa6a7cd249abfffec68d11d3d77f08ad70b CC @adriaarmejach	2024-04-11 10:15:56 -07:00
Yu-Cheng Chang	116c483a42	arch-riscv: Make c.flwsp destination register more maintainable (#1006 ) RISC-V C.FLWSP format: ![image](https://github.com/gem5/gem5/assets/32214817/f4c8d114-cd6b-4946-afff-fa752b31e337) The name FC1 and FD share the same bits, change to FC1 to make it better `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L110)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L84)` `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L85)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L76)`	2024-04-10 08:11:51 -07:00
Hoa Nguyen	bc3627d682	arch-riscv: Remove a tab character (#1010 ) Change-Id: Id54ae8ba37faba11cf4055ddaf7e6b99cfd51999 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2024-04-10 08:08:57 -07:00
Bobby R. Bruce	3af15a535e	mem-cache, configs, arch-arm: Handle partitioning policies through a PartitionManager (#966 ) This PR is offloading some of the partitioning logic to the partitioning manager, effectively changing the partitioning interface. Rather than always relying on the PartitionFieldExtention data structure to convey partition IDs, we make it implementation defined by introducing the partitioning manager abstraction. We want user to be able to extract the partitionId more flexibly and this requires using a SimObject. Users can extend the PartitioningManager, overriding the readPacketPartitionId, therefore providing their own mean of injecting/extracting partitioning data from a packet	2024-04-08 16:05:17 -07:00
Ivana Mitrovic	a8d778516d	arch-riscv,sim: m5ops argument / return fix for 32 bit RISC-V (#900 ) M5Ops C / C++ functions partially use 64 bit arguments and return value. In general, 64 bit arguments and return values are possible for 32 bit RISC-V systems as well, since the arguments and the return value is split into two registers. However, at the moment, this does not work for 32 bit RISC-V systems on the simulator side, since there is a one to one mapping between argument registers and m5op function parameters. To solve this problem, the get() function of the RISC-V reg_abi is updated. It now will merge two registers if there is a 64 bit argument. For this, the function code has to be passed to the get() function. The default value of this function code is set to 0xF00, since 0x00 is already used for M5_ARM. The parameter list of other get() functions for argument return is also extended by this function code parameter with the keyword [[maybe_unused]]. To enable a return value of size 64 bit, a0 is assigned with the lower 32 bit and a1 with the higher 32 bit. Related Issue: https://github.com/gem5/gem5/issues/881	2024-04-08 10:09:17 -07:00
Robert Hauser	841b821261	arch-riscv: fix c.fswsp source register (#998 ) RISC-V C.FSWSP format (RISC-V Unprivileged ISA V20191213, page 102): \|15-13\|12-7\|6-2\|1-0\| \|-------\|----\|----\|----\| \|funct3\|imm\|rs2\|op\| Source register is bit 2-6, not bit 20-24 `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L111)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L86)` `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L87)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L80)`	2024-04-08 08:41:11 -07:00
Yu-Cheng Chang	71b0b1f2b6	arch-riscv: Fix c.fsw source register (#1005 ) RISC-V C.FSW format: ![image](https://github.com/gem5/gem5/assets/32214817/31f46525-23e1-4b36-91ee-968f18b9d32a) Source register is bit 2-4, not bit 20-24 `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L112)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L88)` `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L87)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L80)`	2024-04-08 08:30:54 -07:00

1 2 3 4 5 ...

5917 Commits