derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Saúl	7d80658a39	arch-riscv: fix vl in mask load/store (i.e vlm.v/vsm.v) (#830 ) The vlm.v and vsm.v unit-stride mask load/store instructions are constructed with an incorrect VL when the current one is larger than than VLEN/EEW (i.e. when LMUL > 1). This commit fixes the issue for both instructions.	2024-02-08 14:06:49 -08:00
Bobby R. Bruce	7fe1588546	arch-riscv: Fix load and store to use EEW instead of SEW (#859 ) Vector unit-stride instructions have an EEW encoded directly in the instruction, We should use that instead of SEW in vtype. Ref: https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#73-vector-loadstore-width-encoding	2024-02-08 12:14:11 -08:00
Saúl	804f137325	arch-riscv: add unit-stride fault-only-first loads (i.e. vleff) (#794 ) This patch provides unit-stride fault-only-first loads (i.e. vleff) for the RISC-V architecture. They are implemented within the regular unit-stride load (i.e. vle). A snippet named `fault_code` is inserted with templating to change their behaviour to fault-only-first. A part from this, a new micro based on the vset\vl\* instructions (VlFFTrimVlMicroOp) is inserted as the last micro in the macro constructor to trim the VL to it's corresponding length based on the faulting index. This trimming micro waits for the load micros to finish (via data dependency) and has a reference to the other micros to check whether they faulted or not. The new VL is calculated with the VL of each micro, stopping on the first faulting one (if there's such a fault). I've tested this with VLEN=128,256,...,16384 and all the corresponding SEW+LMUL configurations. Change-Id: I7b937f6bcb396725461bba4912d2667f3b22f955	2024-02-08 09:15:58 -08:00
Minje Jun	db5c71a919	mem-ruby: Pass UD on ReadShared hit only if SD is not allowed This commit allows CompData_SD be sent when ReadShared hits on UD line and the local cache keeps the line, unless the request doesn't allow SD. Change-Id: I337f24c871cc4c19c5b5fb11f9b35c0a8eb7911c	2024-02-08 18:47:44 +09:00
Minje Jun	628be390a0	mem-ruby: Fix ReadShared hit handling on UD line In case ReadShared hit on a UD line and there's no sharers, this chage makes the downstream respond with Unique even though it doesn't deallocate the line. This will make the requestor to UD and the downstream to UD_RU. In the previous implementation, loosely exclusive intermediate cache can cause loss of dirty data. Example sequence is as below. Configurations L2 cache: Roughly inclusive to L1 without back-invalidation - dealloc_on_* = false - dealloc_backinv_* = false L3 cache: Roughly exclusive to L2 without back-invalidation - alloc_on_readshared = tue - alloc_on_readunique = false - dealloc_on_shared = false - dealloc_on_unique = true - dealloc_backinv_* = false - is_HN = false LLC: Same clusivity as L3 except is_HN = true For all caches, allow_SD = true and fwd_unique_on_readshared = false Example problem sequence: 1. L1 sends ReadUnique then becomes UD. L2 is UC_RU. L3 and LLC are RU. 2. L1 evicts the line to L2 by WriteBackFull (UD_PD). L2 becomes UD. 3. L2 evicts the line to L3 using WriteBackFull (UD_PD). L3 becomes UD. 4. L1 reads the line with ReadShared which misses on L2. 5. L2 reads the line with ReadShared which hits on L3. L3 becomes UD_RSC because it doesn't deallocate the line (dataToBeInvalid=false) 6. L3 evicts the line to LLC by WriteCleanFull (UD_PD) because L3 doesn't back-invalidate and still has sharer. The local cache line is invalidated by Deallocate_CacheBlock. L3 becomes RUSC and LLC becomes UD_RU. 7. When UD_RU is evicted at LLC, the UD_RU line is dropped expecting the upstream to writeback, causing loss of dirty data. Change-Id: Ic9bee27f2ec8906dd5df8bd3be60e5a9a76c782f	2024-02-08 18:47:44 +09:00
Minje Jun	1b5d92ee9c	mem-ruby: Revert Writeback CHI UD_RU line at local evict This reverts commit d613d814a431525e122552a667eed653a057f2be. Change-Id: I50e218b7debf3a2836ce12515d8fcb6c0b38df53	2024-02-08 18:47:44 +09:00
Minje Jun	e141d9e4d0	mem-ruby: Writeback CHI UD_RU line at local evict In Ruby CHI protocol UD_RU state means the line is in UD state in the local cache and the upstream may have it in UD or UC state. In the previous implementation UD_RU line was just dropped without WriteBack which can cause loss of dirty data when the upstream has it in UC state. This commit fixes it by performing WriteBack when evciting UD_RU line. Change-Id: I1db9b4f95cc576e71dcef38b01de24775df514ba	2024-02-08 18:47:44 +09:00
QQeg	e685c072d1	arch-riscv: Remove micro_elems in VleMicro template Change-Id: I91267de8b1142075aa2873bfcedfd8b15c6863d4	2024-02-08 07:24:55 +00:00
QQeg	7eeac98b8d	arch-riscv: Fix load and store to use EEW instead of SEW Vector unit-stride instructions have an EEW encoded directly in the instruction, We should use that instead of SEW in vtype. Change-Id: I282041ce8ed57fbcca899f7497ef6c6fb2dfcf85	2024-02-07 21:11:28 +00:00
Jason Lowe-Power	4aecf9d35c	stdlib: fix typo in error message (#855 ) Change-Id: I28f1881d207caa36c6101eef221ef4cdd229da57 Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2024-02-06 09:50:01 -08:00
Robert Hauser	f289f9e8b5	arch-riscv: adding support for local interrupts (#813 ) Besides the standard RISC-V interrupts software, timer, and external interrupt, the RISC-V specification also offers the possibility to implement local interrupts. With this patch, we contribute an extension of RiscvInterrupts that enables connecting interrupt sources to the local interrupt controller. We assigned the local interrupts to machine-level and gave them the highest priority. If two local interrupts are pending, there exception code will be the tie-breaker (higher ID > lower ID). 32 Bit systems only recognize the local interrupts 16 to 31, 64 Bit systems 16 to 63. Change-Id: Iff8d34e740b925dce351c0c6f54f4bd37a647e0c --------- Co-authored-by: Robert Hauser <robert.hauser@uni-rostock.de>	2024-02-06 09:38:50 -08:00
Yu-Cheng Chang	ba6c569b8d	arch-riscv: Add BasePMAChecker to support customized PMA (#846 ) The RISC-V privilege spec don't specify the implementation of PMA(physical memory attribute), which is addressed in the previous CL[1]. This CL creates the BasePMAChecker to support customized PMA so that we can only focus on the features wanted in the study. The CL also leaves the common methods `check` and `takeOverFrom` to make MMU easy to interact with PMA. [1] https://gem5-review.googlesource.com/c/public/gem5/+/40596 Change-Id: I9725e3a8f7f9276e41f0d06988259456149d2a77	2024-02-06 05:38:34 -08:00
Giacomo Travaglini	a60d6960c7	arch-arm: Remove unused/unimplemented TLB methods (#849 ) Change-Id: I3a76a914df1ba65ec5200f11111cf20f3e1eb924 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-06 09:18:06 +00:00
Mahyar Samani	8efe6dc1bc	sim: Updating Process::Map (#835 ) Changing size from int to int64_t to allow for mapping regions bigger than 2GB.	2024-02-05 12:17:05 -08:00
Giacomo Travaglini	05f93175a7	arch-arm: Crypto instruction execution requires SIMD to be enabled (#848 ) Crypto instructions will cause an undefined instruction when executed with SIMD disabled. The PR is also refactoring their implementation by checking the release object instead of the ID register field. This is improving readability	2024-02-05 19:22:04 +00:00
wmin0	e4e359135e	systemc: Reduce unnecessary backdoor request in atomic transaction (#795 ) The backdoor request in b_transport is only used for hinting the dmi capability. Since most of traffic patterns are continous, we can cache the previous backdoor request result to spare the backdoor inspect of next request. Change-Id: I53c47226f949dd0be19d52cad0650fcfd62eebbc	2024-02-05 11:08:20 -08:00
Chong-Teng Wang	40ecdf5fb4	arch-riscv: Fix RVV instructions vmv.s.x/vfmv.s.f (#843 ) This commit fixes the implementation of vmv.s.x and vfmv.s.f. When vl = 0, no elements are updated in the destination vector register group, regardless of vstart. Change-Id: Ib21b3125da8009325743ec70ca0874704328356c Reference: [Integer Scalar Move Instructions](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#161-integer-scalar-move-instructions) [Floating-Point Scalar Move Instructions](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#162-floating-point-scalar-move-instructions)	2024-02-05 08:51:42 -08:00
Chong-Teng Wang	85059a369e	arch-riscv: Fix control flow in VectorFloatMaskMacroConstructor (#844 ) This commit adjusts the logic in VectorFloatMaskMacroConstructor to ensure the %(copy_old_vd)s section is not skipped when vl = 0, ensuring correct values in destination vector register. Change-Id: I2478722d6f003a0f2e4b3cd0ba3e845bed938ee6 This is the same problem as #715 .	2024-02-05 06:29:05 -08:00
Giacomo Travaglini	16e06bad0c	arch-arm: Exec Crypto instructions only if SIMD&FP enabled We not only check for the presence of the relative FEAT_*, we also check if AdvSIMD is enabled; we throw an undefined instruction otherwise. Change-Id: I1fd0cdc8057c5a7901774802dc076817f06c8e66 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-05 12:56:48 +00:00
Giacomo Travaglini	ebef2fc4b1	arch-arm: Crypto instructions checking release object Check directly if extension is enabled instead of looking for ID register field value. This makes the code more readable Change-Id: If0b882ac3464c3587731b72a7edb3b8b65ea86c7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-05 12:56:48 +00:00
Giacomo Travaglini	4eb0cd44fc	cpu-o3: Restrict constraint on number of physical registers Having the number of physical registers matching exactly the number of architectural ones does not guarantee a proper execution as it means the freeList would have 0 registers available for renaming. In this case the worst would happen: renaming would silently stall execution indefinitely. With this change we report the issue to the user and fail execution Change-Id: I1eb968802f1a1a5115012f44b541542a682f887d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-02 21:18:32 +00:00
Giacomo Travaglini	1fb7c1ad7e	cpu-o3: Rename numFreeEntries into minFreeEntries Change-Id: I89faeb001ebdcbc90ea88508f8d231ec6e7fe197 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-02 18:11:47 +00:00
Giacomo Travaglini	86158de220	cpu-o3: Stop using RenameMap::numFreeEntries The method is extracting the minimum number of [1] non-zero free registers/entries across all register classes. This means that if we have saturated all register storage for a particular class, renaming will stop as a whole. I believe it does make sense to keep renaming and only block renaming in case an instruction requiring the particular register type is encountered. This would happen with the Rename::renameInsts method [1]: https://github.com/gem5/gem5/blob/stable/src/cpu/o3/rename_map.hh#L269 [2]: https://github.com/gem5/gem5/blob/stable/src/cpu/o3/rename.cc#L662 Change-Id: I932826a77a5c0b2e05d8fdcab0e6ca13cf0e3d23 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-02 18:11:47 +00:00
Giacomo Travaglini	d031244ca7	misc: When unused, set #MatRegClass registers to 0 This is working around an existing SMT issue [1]. The BaseO3CPU uses two physical matrix registers [2]. This is enough for a single threaded CPU which as of now uses 1 architectural matrix only. The problem arises when SMT is enabled. As 2 architectural matrices need to be supported by a single CPU, the O3CPU won't have any available register in the freeList for renaming. This causes the SMT O3CPU to indefinitely stall renaming [3] If the archtectural number of registers is seto to 0, the regclass won't be taken into consideration when evaluating if we can rename instructions. This issue has been implicitly fixed for RISCV by a preceding PR [4] [1]: https://github.com/gem5/gem5/issues/668 [2]: https://github.com/gem5/gem5/blob/stable/src/cpu/o3/BaseO3CPU.py#L170 [3]: https://github.com/gem5/gem5/blob/stable/src/cpu/o3/rename.cc#L1228 [4]: https://github.com/gem5/gem5/pull/83 Change-Id: I99bfdefff11a246b1f191251dc67689e95b3f0db Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-02 18:11:47 +00:00
Giacomo Travaglini	33e62b8e8a	arch-arm: Adopt new TranslationRegime data type in MMU translations (#829 ) This is more complaint with the VMSAv8-64, which is using Translation Regimes instead of historical (Armv7) isHyp tagging and the ExceptionLevel managing the translation. This greatly simplifies translation code, specially with FEAT_VHE where the managing el (EL2) could handle to different translation regimes (EL and EL2&0).	2024-02-02 11:54:38 +00:00
kroarty-lanl	197be3a0dd	dev: Fix off-by-one in IDE controller PCI register allocation (#824 ) The PCI configuration space is 256 bytes, yet because the PCI_CONFIG_SIZE macro is 0xff, the final register allocation in the IDE controller only allocated up to byte 255. Change-Id: I1aef2cad9df366ee8425edb410037061eb29ae33	2024-02-01 10:14:28 -08:00
Mahyar Samani	b79fe82e5c	cpu,stdlib: Updating strided generator (#762 ) This change improves the functionality of strided generator to create trace with better flexibility. It allows the user to manually set offset and stride size instead of calculating it based on a "gen_id". This way different patterns could be created with the same SimObject. In addition, this change adds stdlib components for strided generator.	2024-02-01 09:08:42 -08:00
Giacomo Travaglini	3a2c8feca8	arch-arm: MMU aarch64EL is not used in AArch64 only anymore We therefore rename it to exceptionLevel Change-Id: I2a3aabaefa315d95bd034b13d95d5a5b0b8e9319 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:45:06 +00:00
Giacomo Travaglini	3737e8b6df	arch-arm: Use MAIR_EL2 mem attribute register when in EL0 host With the old code, the MAIR_EL1 register was checked when inserting an EL2&0 TLB entry Change-Id: I064032fb2946777c2f4c50c06a124f828245e18a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:44:16 +00:00
Giacomo Travaglini	d42ef792bf	arch-arm: Check ELIs64 for EL2 when in EL2&0 regime The problem with: ELIs64(tc, aarch64EL == EL0 ? EL1 : aarch64EL); Is that when we are executing at EL0 in host (EL2&0 translation regime), the execution mode (AArch32 vs AArch64) is dictated by EL2 and not by EL1 (which is the guest) Change-Id: I463a2a9461c94d0886990ae3d0a6e22aeb4b9ea3 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:59 +00:00
Giacomo Travaglini	458c98082c	arch-arm: Replace EL based translation with regimes This is the final step in the transformation process. We limit the use of the "managing Exception Level" for a translation in favour of the more standard "Translation Regime" This greatly simplifies our code, especially with VHE where the managing el (EL2) could handle to different translation regimes (EL and EL2&0). We can therefore remove the isHost flag wherever it got used. That case is automatically handled by the proper regime value (EL2&0) Change-Id: Iafd1d2ce4757cfa6598656759694e5e7b05267ad Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:47 +00:00
Giacomo Travaglini	e333a77c12	arch-arm: Remove _Xt postfix from TLBI instructions The Xt is not part of the architectural name of the register and it was likely added with the introduction of extended register (Xt) TLBIs in Armv8 to differentiate them with the old Armv7 ones. The use of _Xt was not consistent anyway: newer TLBIs were already omitting it. Change-Id: Ic805340ffa7b5770e3b75a71bfb76e055e651f8b Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:26 +00:00
Giacomo Travaglini	594428f010	arch-arm: Remove redundant isHyp as a TLB entry field We should stop using isHyp.. An hypervisor entry is flagged already by the EL of the entry (el == EL2) Change-Id: I20c3d06fa2b04e0b938a380ca917d0b596eddcf2 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:00 +00:00
Giacomo Travaglini	a6ca81906a	arch-arm: Simplify setting of isHyp for mem translations The isHyp descriptor is an old artifact of armv7 and it flags a PL2 (AArch32) or EL2 & EL2&0 (AArch64) translations. It is commonly set according to the EL/mode [1] but it may differ from the execution state in case of explicit translation requests (via the AT instruction as an example [2]). There is really no need to complicate the masking of isHyp. We should just make use of the tranType method (in charge of setting aarch64EL) to properly set aarch64EL, and make isHyp coincide with the case of aarch64EL == EL2. This is a step towards the removal of the isHyp flag. More specifically the patch does the following: * HypMode translation type moved in the EL2 case The translation is used by ATS1HR/ATS1HW: Performs stage 1 address translation as defined for PL2 and the Non-secure state * S1S2NsTran translation type moved in the EL1 case The translation is used by ATS12NSOPR/ATS12NSOPW: Performs stage 1 and 2 address translations as defined for PL1 and the Non-secure state * S1CTran translation type can be at either EL1 or EL3 The translation is used by ATS1CPR/ATS1CPW Performs stage 1 address translation as defined for PL1 and the current Security state [1]: https://github.com/gem5/gem5/blob/stable/src/arch/arm/mmu.cc#L1281 [2]: https://github.com/gem5/gem5/blob/stable/src/arch/arm/mmu.cc#L1282 Change-Id: Ie653170f6053c5d8141a2de9f50febf5bf53ab9c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:42:40 +00:00
Kaustav Goswami	b5d18b84a8	arm,stdlib: added kvm support to the ARM board (#725 ) This change adds support to use KVM cores on the ARM board. The board simulates gic to enable KVM, similar to the gem5 ARM FS configs. The limitation is that it only supports VExpress_GEM5_V1. Signed-off-by: Kaustav Goswami <kggoswami@ucdavis.edu>	2024-01-31 10:17:58 -08:00
Jason Lowe-Power	b3870ee7b0	arch-riscv: Fix fence.i instruction in O3 CPU (#816 ) arch-riscv: Fix fence.i instruction in O3 CPU	2024-01-30 15:39:32 -08:00
Roger Chang	d94ef08a36	arch-riscv: Fix fence.i instruction in O3 CPU We should clean the instruction buffer after the fence.i is execute to avoid execute old instruction for self-modifying code Change-Id: Iece0ee0d10631fcd9bd17ee67cf0c92f72acdbd8	2024-01-29 11:43:27 +08:00
QQeg	08ed87bc9d	arch-riscv: Add template Vector1Vs1VdMaskDeclare This commit adds a new template, Vector1Vs1VdMaskDeclare, to replace the use of Vector1Vs1RdMaskDeclare in Vector1Vs1VdMaskFormat. The change addresses the issue with the number of indices in srcRegIdxArr. Only two indices are available in Vector1Vs1RdMaskDeclare, but instructions that use Vector1Vs1VdMaskFormat, like 'vmsbf', require three indices (for vs1, vs2(old_vd), and vm) to function correctly. Change-Id: I0c966e11289ce07efcc3b0cc56948311289530ad	2024-01-28 09:38:11 +00:00
QQeg	31ffc11c57	arch-riscv: Fix segmentation fault in vmsbf/vmsof/vmsif This commit simplifies the conditional logic in vmsbf/vmsof/vmsif by removing an unnecessary variable and condition. The updated logic checks 'this->vm' or the result of 'elem_mask(v0, i)' directly, which prevents a segmentation fault regardless of whether 'vm' is set or not. Change-Id: I799fa7b684ff98959a64f9694ef9c854f3a1f43a	2024-01-28 09:38:11 +00:00
Giacomo Travaglini	ce32d7c523	arch-arm: Replace CRYPTO extension with canonical names (#810 ) These are: FEAT_AES, FEAT_PMULL, FEAT_SHA256, FEAT_SHA1, FEAT_CRC32 With this patch we are also enabling them by default by adding them to the Armv8 release object. Some of them are mandatory anyway since Armv8.1 Change-Id: I221ae8646d91151fdfaf97a4815168a4fe3d8c5a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-01-26 19:39:35 +00:00
Matthew Poremba	7f71477f15	dev-amdgpu: Limit SDMA NOP count to wptr boundary (#806 ) If the NOP count of an SDMA NOP packet goes beyond the wptr address, the queue decode method will loop infinitely. If a packet comes in with a bad count this causes gem5 to hang. This change advances the rptr one dword at a time until either reaching the NOP count or when rptr == wptr to prevent this issue. Change-Id: Ib2c0f74a477bff27890c9c064bb4190e76e513bd	2024-01-25 15:35:35 -08:00
Ivana Mitrovic	1c0127ae7c	base: Fix Integer overflow in AddrRange (#786 ) This PR fixes the bug mentioned in #240.	2024-01-25 10:18:29 -08:00
Ivana Mitrovic	24e0d71034	arch-gcn3: Remove gcn3 (#781 ) Related to issue #703 , this PR removes GCN3 related files and updates source code, documentation, and tests to switch over to Vega is that was not done already. Highlights are: - Remove all src/arch/amdgpu/gcn3 files and update Kconfigs. - Remove references to GCN3 and replace with Vega where applicable. - Update the build targets in the gcn-gpu Docker. This will need to be rebuilt but not urgently. - Remove the GCN3 tag in testlib. Most tests seem to be using Vega already, so that commit is small.	2024-01-25 10:14:46 -08:00
QQeg	7a96709b11	arch-riscv: Fix vsadd_vi and vsaddu_vi to match v-spec (#805 ) This commit fixes the implementation of two instructions, vsadd_vi and vsaddu_vi, in the OPIVI category to match the RISC-V vector specification. According to [riscv-v-spec](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#101-vector-arithmetic-instruction-encoding), the immediate field of these two instructions should be sign extended. > For integer operations, the scalar can be a 5-bit immediate, imm[4:0], encoded in the rs1 field. The value is sign-extended to SEW bits, unless otherwise specified. There is an example in both [vsadd](https://github.com/QQeg/rvv_intrinsic_testcases/tree/master/vsadd_vi) and [vsaddu](https://github.com/QQeg/rvv_intrinsic_testcases/tree/master/vsaddu_vi). Change-Id: Ib877627ba01c0868b2103d41613651df488fca13	2024-01-24 17:21:26 -08:00
Yu-Cheng Chang	6dd936e5b5	arch-riscv: Simply implementation of vector multiply and divide instructions (#793 ) Align the implementation of scalar multiply and divide instructions Change-Id: I53297d4c841c41593baaae0ea140bfbbd874a1d9	2024-01-24 13:20:15 -08:00
Matthew Poremba	44c78d843c	arch-vega: Implement memory aperture operands (#803 ) Vega (gfx900) introduced new memory aperture registers to get the base address and limit for LDS and private (scratch) memory. These have not commonly been used by the compiler until ROCm 6. Now that the compiler is generating reads from these special registers, implement the support for them. Tested with LULESH which is using the SHARED_BASE register (LDS) with ROCm 6.0. This assembly seems to replace S_GETREG_B32 emitted by the ROCm 5 compiler. Change-Id: Id2bd26ce8ef687c84a647fa2ac2da54d657913e5	2024-01-24 11:19:43 -08:00
Matthew Poremba	0ac110ac95	dev-amdgpu: Check privledge bit for SDMA RLC queues (#792 ) By default all SDMA queues are privileged queues, meaning the addresses in SDMA packets use the privileged translation tables. RLC queues (sometimes called user queues) are not necessarily privileged and might use user translation tables. RLC queues are used more often in ROCm 6.0 exposing an issue with invalid translations with RLC queues. This changeset checks the priv bit in the SDMA MQD when an RLC queue is mapped. Each packet type which uses an address then checks the bit before performing translation. Tested with daily/weekly tests with a ROCm 6.0 disk image and tests are passing. Change-Id: I6122fbc194e8d6f5d38e81f1b0e11646d90e0ea0	2024-01-24 07:25:43 -08:00
Matthew Poremba	dfafc5792a	arch-vega: Remove deleted instruction.cc from build (#801 ) Change-Id: I03073d35a0d36788dfe8309e6ed466d0a496e31e	2024-01-23 18:47:01 -08:00
Harshil Patel	78613e2307	base: Add a check for edge case - Now check for the condition where the bigger address range wraps but smaller does not. Change-Id: Icc7a549afaf82a277dc2845255aa1702a1d662e0	2024-01-23 11:35:54 -08:00
Harshil Patel	7372097376	base: fix Integer overflow in AddrRange bug An issue raised in #240 where if an address range ends at the last byte of a 64 bit address space, it will be considered a subset of any other address range that starts at the first byte of the range. Change-Id: I517f4717052eda2504de971be0eb59ee9a623dd3	2024-01-22 15:43:11 -08:00

1 2 3 4 5 ...

14931 Commits