derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Robert Hauser	5e5e8fb9c6	arch-riscv: Update local interrupts citation (#1347 ) Updated the bib information of the local RISC-V interrupts. Change-Id: I666c3df4529e159bd1946ca1a9623e47f84d5d9e Signed-off-by: Robert Hauser <robert.hauser@uni-rostock.de>	2024-07-12 20:51:49 -07:00
Saúl	8dde32d2dc	arch-riscv: fix initialization for some vector reduction insts (#1340 ) Vector reduce float (widening and non-widening) and integer (widening) instructions initialize the reduce loop operation with the first element of the destination register (i.e. `Vd[0]`). Since all reductions per spec seem to be `Vd[0] = Vs1[0] + Vs2[]` (where `+` is an arbitrary binary op and `` indicates all active elements) gem5 will calculate this incorrectly if `Vd[0]` and/or `Vs1[0]` are non-neutral for the operation (the later case being because it's not taken into account at all). To solve this we just have to initialize the reduction loop to `Vs1[0]` (the non-widening integer reduction already does this).	2024-07-10 22:08:49 -07:00
Yu-Cheng Chang	d54dcac393	arch-riscv: Fix setRegs from GDB failed after #1099 (#1291 ) The gem5 crashed when user try to update register value from GDB because PR[1] changes the index of CSR_XSTATUS to MISCREG_XSTATUS, which is out of NUM_PHYS_MISCREGS. The CSR_XSTATUS should use setRegWithMask to update it. [1] : https://github.com/gem5/gem5/pull/1099 gem5 issue: https://github.com/gem5/gem5/issues/1299 Change-Id: Iefc0d1f5adfb98ecfda0e74907964b47d1864b6d	2024-07-09 15:55:35 -07:00
Jason Lowe-Power	d20512c291	arch-riscv: add agnostic option to vector tail/mask policy for mem and arith instructions (#1135 ) These two commits add agnostic capability for both tail/mask policies, for vector memory and arithmetic instructions respectively. The common policy for instructions is to act as undisturbed if one is (i.e. tail or mask), or write all 1s if none. For those instructions in which multiple micro instructions are instantiated to write to the same register (`VlStride` and `VlIndex` for memory, and `VectorGather`, `VectorSlideUp` and `VectorSlideDown` for arithmetic), a (new) micro instruction named `VPinVdCpyVsMicroInst` has been used to pin the destination register so that there's no need to copy the partial results between them. This idea is similar to what's on ARM's SVE code. This micro also implements the tail/mask policy for this cases. Finally, it's worth noting that while now using an agnostic policy for both tail/mask should remove all dependencies with old destination registers, there's an exception with `VectorSlideUp`. The `vslideup_{vx,vi}` instructions need the elements in the offset to be unchanged. The current implementation overrides the current vta/vma and makes them act as undisturbed, since they require the old destination register anyways. There's a minor issue with this though, as `v{,f}slide1up` variants do not need this, but since they share the same constructor, will act all the same. Related issue #997.	2024-07-08 11:47:11 -07:00
Giacomo Travaglini	d825103df2	arch-arm: Implement FEAT_TTST (#1323 ) Implement small translation table extension. This feature relaxes the lower limit on the size of the translation tables, by increasing the maximum permitted values of the T1SZ and T0SZ field in: TCR_EL1, TCR_EL2, TCR_EL3,VTCR_EL2 and VSTCR_EL2 Change-Id: I4c2187815b2d7f14407edb38095c6bcc2004b62a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-04 09:37:41 +01:00
Giacomo Travaglini	c9d9108978	arch-arm: MISCREG_AT_S1E2R/W are executable from S state (#1322 ) Change-Id: Ieaebdf0d62b5115f8085f478b2da105633b6a26a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-04 09:37:17 +01:00
Giacomo Travaglini	f3e3c60805	arch-arm: Proper support for NonSecure IPA space in Secure state Change-Id: Ie2e2278ecdc5213db74999e3561b2918937c2c2e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 13:16:13 +01:00
Giacomo Travaglini	eb400e773b	arch-arm: Remove makeStage2 from TLBIOp Change-Id: I25276e4b5b7c491e69208044ceb193c67ddfd91c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 13:15:49 +01:00
Giacomo Travaglini	49ca08b01a	arch-arm: Add isStage2 qualifier to the LongDecriptor We are currently using the LongDecriptor for both stage1 and stage2 translations. There are several cases where the bitfield meaning changes depending on the translation stage. Change-Id: Ic33d9ef225a57fd79ce2b4bf47896aeb6bdd8d9c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 13:15:31 +01:00
Giacomo Travaglini	9cce68ca71	arch-arm: Replace isSecure boolean with SecurityState enum Change-Id: If01b8b2811b2c028e669ea3700174c7945b07a06 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 12:45:24 +01:00
Alexander Richardson	d5c0383887	arch-arm: support 64-bit PMCCNTR from AArch32 (#1304 ) For ARMv8 CPUs this register allows reading a 64-bit cycle counter in from 32-bit execution state. Change-Id: I7cd9e2711ada5156920440cc3c89e7a74ca54a49	2024-07-02 08:59:44 +01:00
Giacomo Travaglini	b28659d4f9	arch-arm: Implement FEAT_XS (#1303 ) This patch is adding a functional implementation of FEAT_XS. Unless we operate with DVM enabled, TLBIs broadcasting is accomplished in 0 time; so there is no timing benefit introduced by enabling FEAT_XS other than the way it affects TLB management (invalidation) Change-Id: I067cb8b7702c59c40c9bbb8da536a0b7f3337b5d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-07-02 08:52:59 +01:00
Rajesh Shashi Kumar	3ce5e0584a	arch-arm: This commit fixes a typo in the ARM ldaddalx instruction (#1279 ) The acquire-release flavor of the ldadd instruction should read ldaddalx (eg. ldaddalb/ldaddalh) according to specification. However, this is currently noted as ldadd"la"x (eg. ldaddlab/ldaddlah). Issue: https://github.com/gem5/gem5/issues/1224 Change-Id: Ib932fa0e572207729c923c27f24c34cc21dff0e5 Co-authored-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2024-06-26 09:03:50 -07:00
Saúl Adserias	99f58d37da	arch-riscv: add agnostic opt to vector tail/mask for arith insts Change-Id: I693b5f3a6cc8a8f320be26b214fd9b359e541f14	2024-06-24 10:03:52 -07:00
Saúl Adserias	73c364519a	arch-riscv: add agnostic opt to vector tail/mask for mem insts Change-Id: I567a110806b77d5576810706bd3e30185b0e0b75	2024-06-24 10:03:52 -07:00
Jason Lowe-Power	013f773d31	arch-riscv: Fix TLB lookup with vaddrs (#1264 ) Previously, all of the TLB lookup/insert functions were using the full virtual addresses even though the variables in the functions said "vpn." This change explicitly converts the virtual address to the VPN without any least significant zeros for the offset. I.e., vpn >> page_size. The main bug solved in this changeset is the asid was \|'d with the upper bits of the virtual address, but sometimes there were all 1's. Therefore, you could get a TLB hit even if the ASID was different. Interestingly, the page that seemed to cause these issues was a 1 GiB page. This change also starts refactoring some of the page table details to support sv46 and sv57 page table formats. In my testing, the Linux kernel boot uses large pages (even OpenSBI uses large pages), so it seems that large pages also work. However, this seems like magic to me, so I'm not sure if it's correct. This change also updates some asserts, and debug statements with more useful debugging information. Partially fixes #1235. More testing needs to be done to be confident.	2024-06-20 13:24:50 -07:00
Hoa Nguyen	15e0236a8b	arch,cpu,sim: Add mechanism to partially print vector regs (#1234 ) Currently, gem5's inst tracer prints the whole vector register container by default. The size of vector register containers in gem5 is the maximum size allowed by the ISA. For vector-length agnostic (VLA) vector registers, this means ARM SVE vector container is 2048 bits long, and RISC-V vector container is 65535 bits long. Note that VLA implementation in gem5 allows the vector length to be varied within the limit specified by the ISAs. However, in most use cases of gem5, the vector length is much less than 65535 bits. This causes two issues: (1) the vector container requires allocating and moving around a large amount of unused data while only a fraction of it is used, and (2) printing the execution trace of a vector register results in a wall of text with a small amount of useful data. This change addresses the problem (2) by providing a mechanism to limit the amount data printed by the instruction tracer. This is done by adding a function printing the first X bits of a vector register container, where X is the vector length determined at runtime, as opposed to the vector container size, which is determined at compilation time. Change-Id: I815fa5aa738373510afcfb0d544a5b19c40dc0c7 --------- Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2024-06-17 14:05:47 -07:00
Hoa Nguyen	500da4306b	arch: Mark FailUnimplemented instructions as Invalid instructions (#1247 ) This is a follow-up on the discussion here [1]. The IsInvalid flag was previously defined as an instruction that does not appear in the ISA. However, a micro-architecture can choose to not recognize an instruction in and raise illegal instruction fault even if the instruction is in the ISA. This change modifies the definition of a Invalid instruction such that, if a StaticInst instruction is marked as IsInvalid, it means the instruction is not recognized by the decoder. This means that any instruction recognized by the decoder are not invalid, even if the instruction is not in the official ISA spec; e.g., m5 pseudo-instructions. Note that instructions that are recognized by the decoder but are chosen to act as a nop are not invalid. This applies to WarnUnimplemented instructions, e.g. hint instructions. [1] https://github.com/gem5/gem5/pull/1071 Change-Id: I1371b222d8b06793d47f434d0f148c5571672068 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2024-06-17 12:44:05 -07:00
Matthew Poremba	2f5842d253	arch-vega: Add valid flag to ds_swizzle_b32 Currently the flag is just Load and there is a long comment explaining why. This does not meet any of the scoreboard check requirements: https://github.com/gem5/gem5/blob/develop/src/gpu-compute/scoreboard_check_stage.cc#L230-L241 Add a generic ALU flag as well so the instruction executes instead of panicking. Change-Id: I54b2d20d47fad5e8f05f927328433aab7db7d862	2024-06-15 14:28:59 -07:00
Matthew Poremba	42369eab2c	arch-vega: Implement MI300 FLAT SVE bit For scratch instructions only, this bit specifies if an offset in a VGPR should be used for address calculation. This is new in MI300 and was previously the LDS bit. The LDS bit is rarely used and in fact gem5 does not even check this bit. This fixes a bug when SADDR == 0x7f (i.e., no SGPR should be used) where a VGPR was being added to the address when it should have been ignored. Change-Id: I9864379692df6795b25b58b98825da05d18fc5db	2024-06-15 14:28:59 -07:00
Matthew Poremba	1dab4be002	arch-vega: Implement VOP3 V_FMAC_F32 A version of V_FMAC_F32 with extra modifiers from VOP3 format. Change-Id: Ib6b41b0a3ceb91269b91a0287dfc94bc73e4d217	2024-06-15 14:28:58 -07:00
Matthew Poremba	f91d14fe46	gpu-compute: Add MFMA stats (#1248 ) Add dynamic instruction counts for MFMAs. Change-Id: I976b01344577cf011aeb3dd648a8c0017281c4e3	2024-06-15 13:04:00 -07:00
Hoa Nguyen	d528a6bd2d	arch: Flag all ISAs Unknown instruction as IsInvalid Change-Id: I096138a157c4e2063c5f4f4324c21c1463dddb65 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2024-06-11 18:48:29 +00:00
Alexander Richardson	3cfc550fc0	arch-arm,mem: Don't hardcode secure mode accesses for semihosting (#1200 ) When accessing memory using functionalAccess(), the MMU could tell us to use a nonsecure access even though the CPU is operating in secure mode. I noticed this while trying to run a simple semihosting hello world with the MMU+caches enabled and the semihosting calls ended up reading from memory instead of the caches due to an S/NS mismatch. See also https://github.com/gem5/gem5/pull/1198 which happens to also mask the issue I saw, but I believe both changes are needed. Change-Id: I9e6b9839b194fbd41938e2225449c74701ea7fee	2024-06-09 14:08:54 -07:00
Saúl	5cfad84a98	arch-riscv: correctly set dynamic VLEN for all arith instructions (#1187 ) Some arithmetic instructions of the riscv vector extension where still using the default VLEN=256 instead of the dynamic one through the inherited `vlen` attribute. Most of them only use this to calculate the effective index for the mask element like so: ``` uint32_t ei = i + vtype_VLMAX(vtype, vlen, true) * this->microIdx; if (this->vm \|\| elem_mask(v0, ei)) { ... ``` This means that instructions will wrongly compute the mask index in the second and subsequent micro instructions (`microIdx` > 0). This commit fixes this by adding the corresponding `set_vlen` snippet to the affected instruction formats. Change-Id: Ib041de972d6938490741a9fb4c214a6a5172c34e	2024-06-07 22:33:56 -07:00
Alexander Richardson	ec5881ec4e	arch-arm: avoid using an uninitialized variable use in MMU walks (#1198 ) While running a simple Arm32 binary, I noticed that all memory transactions were being marked as NS instead of S once I turn on the MMU (even though the page tables have the NS bit set to zero). The result of this was that semihosting calls were failing since they were using functional accesses with the SECURE flag set, but the caches only contained NS tagged entries so these accesses always read stale values from DRAM. Digging through the Arm MMU code it appears that the NS bit lookup was being keyed of the `secureLookup` flag which is only used for long descriptors. I believe `0c28712f51` should have used isSecure instead of secureLookup. To avoid using these uninitialized values in the future I wrapped the LPAE state in a std::optional to ensure that it is only accessed once initialized. Change-Id: Ibc406ed3f4cfa768f470e34a5eca3c1a2bf45cd8	2024-06-07 08:59:28 +01:00
Alexander Richardson	8e5fbcbbbb	arch-generic: flush streams after semihosting write calls (#1202 ) The SYS_WRITEC and SYS_WRITE0 calls are specified as writing to the debug channel, so it is a reasonable expectation for these messages to be visibile immediately after the semihosting call. Change-Id: I8e6e9a7aab593a59e82ecb9cf4603c18c7a8acbe	2024-06-06 09:57:36 +01:00
Yu-Cheng Chang	5d3f1c3316	arch-riscv: Add rvZext to BranchTarget (#1173 ) Ensure the upper xlen bits are all zeros Change-Id: Id81330eced907d21320bc1af85ad38fb6e95f6b1	2024-06-03 10:03:51 -07:00
Matthew Poremba	00dcd5b0bc	arch-vega: Implement literals for 64b dest operands This feature has been available since Vega10 but was never implemented. MI300 adds a few new instructions that make use of this more often (e.g., v_mov_b64). Change-Id: Ieeb7834462b76d77c0030f49622d0de09f90c9e4	2024-05-31 13:41:46 -07:00
Matthew Poremba	6c8caf83c6	arch-vega: Implement V_ACCVGPR_MOV_B32 instruction This instruction is a simple move from accumulation register to accumulation register. It is essentially a move with the accumulation offset added to the register index. Change-Id: Ic93ae72599b75c91213f56ebafe5bbd7b2867089	2024-05-31 09:32:35 -07:00
Matthew Poremba	7cdb69bf21	arch-vega: Fill in scratch insts to match flat/global Flat, scratch, and global share the same instruction implementation with different address calculations essentially. These instructions were already implemented but not added to the decoder. This commit adds the remaining scratch instructions which have a shared instruction implementation. Change-Id: I8f2e9ceb221294dce1b81c45745b642f0592d985	2024-05-31 09:32:34 -07:00
Bobby R. Bruce	a0de33110b	arch-vega: Fix clang comp error due to constant exp (#1183 ) The lines `constexpr int B_I = std::ceil(64.0f / (N * M / H));` caused the following compilation error in clang Version 16: ``` error: constexpr variable 'B_I' must be initialized by a constant expression ``` `std::ceil` is not a const expression. Therefore instances of this expression in instructions.hh have been replaced with a constant expression friendly alternative. This is calling our compiler tests to fail: https://github.com/gem5/gem5/actions/runs/9288296434/job/25559409142 Change-Id: I74da1dab08b335c59bdddef6581746a94107f370	2024-05-30 09:44:34 -07:00
Bobby R. Bruce	b161172f65	arch-arm: Fix memory attributes of table walks (#1180 ) This PR is doing the following: 1) Fixing memory attributes of partial translation entries (table walks) 2) Properly setting the cacheability of table walks	2024-05-29 08:07:44 -07:00
Nicholas Mosier	9027d5c3e2	arch-x86: set AF=0 when logical instructions execute (#1171 ) Fix #1168. Prevent logical instructions like AND, OR, and TEST from having input dependencies on the previous value of the Zaps register (ZF+AF+PF+SF) by having them set AF=0, rather than not modifying AF.	2024-05-29 08:04:44 -07:00
Nicholas Mosier	a54d3198a8	arch-x86: break 32/64-bit mov's input dependency on prior dest value (#1172 ) Fix #1169. Break the input dependency of 32-bit and 64-bit 'mov' micro-ops on the prior value in the destination register. Such a dependency is required for 8-bit and 16-bit moves, as they do not completely overwrite the value in the destination register. However, it is unnecessary for 32-bit moves (which implicitly zero the upper 32 bits) and 64-bit moves. This patch implements the fix by adding a new code template field inside the generated constructors of X86StaticInst's, called `invalidate_srcs`, which instruction implementations like `mov` can use to conditionally invalidate particular source registers as needed. In `mov`'s case, this is when the data size is 32 or 64 bits. Change-Id: Ib2aef6be6da08752640ea3414b90efb7965be924	2024-05-29 07:54:03 -07:00
Giacomo Travaglini	c4ed23a10b	arch-arm: Implement HCR_EL2 force broadcast for EL1&0 TLBIs (#1175 ) According to the Arm architecture reference manual, it is possible to force the broadcast of the following TLBIs: AArch64: TLBI VMALLE1, TLBI VAE1, TLBI ASIDE1, TLBI VAAE1, TLBI VALE1, TLBI VAALE1, IC IALLU, TLBI RVAE1, TLBI RVAAE1, TLBI RVALE1, and TLBI RVAALE1. AArch32: BPIALL, TLBIALL, TLBIMVA, TLBIASID, DTLBIALL, DTLBIMVA, DTLBIASID, ITLBIALL, ITLBIMVA, ITLBIASID, TLBIMVAA, ICIALLU, TLBIMVAL, and TLBIMVAAL. Via the HCR_EL2.FB bit Change-Id: Ib11aa05cd202fadfbd9221db7a2043051196ecbd Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-29 11:54:24 +01:00
Giacomo Travaglini	e9dcb906b4	arch-arm: Set memory attributes for partial table entries Change-Id: I80adcead410f226c323e4d781adb1ff17a386986 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-29 09:30:58 +01:00
Giacomo Travaglini	09f0c20be2	arch-arm: Use HCR_EL2.CD for stage2 table walks When determining the cacheability of table walks, SCTLR.C should only be used in stage1 EL1&0 translations. Stage2 translations should rely on HCR_EL2.CD instead Change-Id: I1b0830bc3fb5086f68d7a7a1560c7fed5d126d28 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-29 09:30:58 +01:00
Giacomo Travaglini	854662f48f	arch-arm: Check OSH domain as well for cacheability attribute Make table walks uncacheable if marked as uncacheable in either inner or outer shareable domain Change-Id: I5898a3b91b5b919e0beda6c6fe896394e3ab94df Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-29 09:30:58 +01:00
Ivana Mitrovic	5ec1acaf5f	arch-arm: TLBIs targeting EL2 regime are executable from S state (#1176 ) Those AArch64 instructions/registers were labelled as executable from EL3 only if SCR_EL3.NS == 1. This is not valid anymore after the introduction of FEAT_SEL2	2024-05-28 10:54:18 -07:00
Matthew Poremba	1dfaa224ff	arch-vega: Fix GCC 13 build errors (#1162 ) The new static analysis in GCC 13 finds issues with operand.hh. This commit fixes the error so that gem5 compiles when BUILD_GPU is true. Change-Id: I6f4b0d350f0cabb6e356de20a46e1ca65fd0da55	2024-05-28 07:58:28 -07:00
Giacomo Travaglini	27c7647fee	arch-arm: Use monWrite a shorter version Change-Id: I8da8a39238eb100315d3df496f55a6bf3da948c6 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-28 11:20:52 +01:00
Giacomo Travaglini	6995a99d77	arch-arm: TLBIs targeting EL2 regime are executable from S state Those AArch64 instructions/registers were labelled as executable from EL3 only if SCR_EL3.NS == 1. This is not valid anymore after the introduction of FEAT_SEL2 Change-Id: Ie7b56f3fe779c3a99d4f0ef937c7c8ec0530b00e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-28 11:20:32 +01:00
Giacomo Travaglini	10dbfb8bb7	arch-arm: Rewrite performTlbi to use map instead of switch (#1166 ) This is making it easier for TLBI instructions to share code. Common code (under the form of tlbi* functions) are closely matching the instruction description in the Arm pseudocode Change-Id: If10c22fb4a7df2bcd0335e9761286ad3c458722b Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-28 11:03:07 +01:00
Yu-Cheng Chang	4f6fdbf8bf	arch-riscv: Fix c.jalr and c.jr instruction (#1163 ) The bit 0 of register should be 0 for jump address. Wrong handling the jump address may cause infinite run or segment fault. gem5 issue: https://github.com/gem5/gem5/issues/981	2024-05-25 20:18:42 -07:00
Matthew Poremba	1616d34003	arch-vega: Template MFMA instructions (#1128 ) templated - v_mfma_f64_16x16x4f64 added support for - v_mfma_f32_32x32x2f32 - v_mfma_f32_4x4x1_16b_f32 - v_mfma_f32_16x16x4f32 [formula for gprs needed](https://github.com/ROCm/amd_matrix_instruction_calculator) [formulas for register layouts and lanes used in computation](https://www.amd.com/content/dam/amd/en/documents/instinct-tech-docs/instruction-set-architectures/amd-instinct-mi300-cdna3-instruction-set-architecture.pdf) Change-Id: I15d6c0a5865d58323ae8dbcb3f6dcb701a9ab3c7	2024-05-22 08:53:25 -07:00
Robert Hauser	688f8fb03b	arch-riscv: add exception code to DPRINTFS msg (#1153 ) Change-Id: Ib5d1dc991f18256ec634c604c776629ea31317a9	2024-05-21 09:59:25 -07:00
Yu-Cheng Chang	5e20438c1c	arch-riscv: Fix GDB connection failed after #1099 (#1152 ) GDB connection failed after the PR[1] changed the index of CSR_FCSR to MISCREG_FCSR itself. It cause the out of bound error. [1]: https://github.com/gem5/gem5/pull/1099 gem5 issue: https://github.com/gem5/gem5/issues/1151 Change-Id: I402febe5a3a9addf3d4821ad716ade14e227d5d7	2024-05-21 09:58:15 -07:00
Giacomo Travaglini	6f4ba0b422	arch-arm: Add missing outer-shareable TLBIs to the list (#1147 ) Those were not part of the performTlbi switch and simulation was therefore panicking when they were encountered Change-Id: Ifbe0b89e45539df4abc147ac5970b0caf0d9dfdc Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-05-20 19:24:45 -07:00
Chong-Teng Wang	13924336b1	arch-riscv: Fix viota instruction (#1137 ) This commit fixes and refactors the implementation of viota. It also overrides the generateDisassembly function in viota's macro/micro to correctly print out the instruction when tacing/debugging. For example, it changes from: viota_m vd, vd, vs2, v0.t to: viota_m vd, vs2, v0.t	2024-05-20 12:19:22 -07:00

1 2 3 4 5 ...

5979 Commits