derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Arnabjyoti Kalita	b826d96f40	cpu-o3: add PerThreadUnifiedThreadMap to O3 CPU (#842 ) Github issue: https://github.com/gem5/gem5/issues/373 Change-Id: I1c8aba9bc5ea4e45faa6c174780904b8bd618604	2024-02-12 09:26:31 -08:00
Matt Sinclair	a840dda23a	arch-vega,gpu-compute,mem-ruby: SQC Invalidation Support (#852 ) This PR adds support for SQC (GPU I-cache) invalidation to the GPU model. It does this by updating the GPU-VIPER-SQC protocol to support flushes, the sequencer model to send out invalidates and the gpu compute model to send invalidates and handle responses. It also adds support for S_ICACHE_INV, a VEGA ISA instruction that invalidates the entire GPU I-cache. Additionally, the PR modifies the kernel start behavior to invalidate the I-cache too. It previously invalidated only the L1 D-cache.	2024-02-09 17:29:56 -06:00
Vishnu Ramadas	8054459df6	arch-vega: Add support for S_ICACHE_INV instruction Previously, the S_ICACHE_INV instruction was unimplemented and simulation panicked if it was encountered. This commit adds support for executing the instruction by injecting a memory barrier in the scalar pipeline and invalidating the ICACHE (or SQC) Change-Id: I0fbd4e53f630a267971a23cea6f17d4fef403d15	2024-02-09 12:19:08 -06:00
Vishnu Ramadas	85680ea58e	gpu-compute: Remove unused and redundant functions In ComputeUnit, a previous commit added a SystemHubEvent event class to the SQCPort. This was found to be unnecessary during the review process and is removed in this commit. Similarly, invBuf() which was added in FetchUnit as part of an earlier commit was found to be redundant. This commit removes it Change-Id: I6ee8d344d29e7bfade49fb9549654b71e3c4b96f	2024-02-09 12:17:24 -06:00
Vishnu Ramadas	690b2b9462	gpu-compute, mem-ruby: Add comments and reformat code Change-Id: Id2b3886dce347fdcfcad22009a42b92febc00a6c	2024-02-09 12:17:24 -06:00
Vishnu Ramadas	7dae25e881	configs, gpu-compute: Add parameter in shader for CUs per SQC Change-Id: If0ae0db1b6ccc08a92f169a271b137f69f410f7b	2024-02-09 12:17:24 -06:00
Vishnu Ramadas	0e93e6142a	arch-vega, gpu-compute, mem-ruby: Remove extra empty lines Change-Id: I18770ec7e38c4a992a0ae6de95b0be49ab4426c2	2024-02-09 12:17:24 -06:00
Vishnu Ramadas	440409d807	gpu-compute: Add Icache invalidation at kernel start Previously, the data caches were invalidated at the start of each kernel. This commit adds support for invalidating instruction cache at kernel launch time Change-Id: I32e50f63fa1442c2514d4dd8f9d7689759f503d3	2024-02-09 12:16:41 -06:00
Vishnu Ramadas	03838afce0	gpu-compute: Add support for injecting scalar memory barrier This commit adds support for injecting a scalar memory barrier in the GPU. The barrier will primarily be used to invalidate the entire SQC cache. The commit also invalidates all buffers and decrements related counters upon completion of the invalidation request Change-Id: Ib8e270bbeb8229a4470d606c96876ba5c87335bf	2024-02-09 12:14:57 -06:00
Vishnu Ramadas	23dc98ea72	mem-ruby: Add SQC cache invalidation support to GPU VIPER This commit adds support for cache invalidation in GPU VIPER protocol's SQC cache. To support this, the commit also adds L1 cache invalidation framework in the Sequencer such that the Sequencer sends out an invalidation request for each line in the cache and declares completion once all lines are evicted. Change-Id: I2f52eacabb2412b16f467f994e985c378230f841	2024-02-09 12:14:57 -06:00
Hristo Belchev	fd3aac1518	mem-cache: Fix circular dependency in QoS mem (#857 ) This PR removes a circular dependency between `QoSMemSinkCtrl` and `QoSMemSinkInterface` that prevented the `controller()` function of `QoSMemSinkInterface` from being used by removing the default value for `QoSMemSinkCtrl.interface`. Change-Id: I4ecc39b974e239be1a2e9285e1f6f8ea873c018d	2024-02-09 11:32:16 +00:00
Saúl	7d80658a39	arch-riscv: fix vl in mask load/store (i.e vlm.v/vsm.v) (#830 ) The vlm.v and vsm.v unit-stride mask load/store instructions are constructed with an incorrect VL when the current one is larger than than VLEN/EEW (i.e. when LMUL > 1). This commit fixes the issue for both instructions.	2024-02-08 14:06:49 -08:00
Bobby R. Bruce	7fe1588546	arch-riscv: Fix load and store to use EEW instead of SEW (#859 ) Vector unit-stride instructions have an EEW encoded directly in the instruction, We should use that instead of SEW in vtype. Ref: https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#73-vector-loadstore-width-encoding	2024-02-08 12:14:11 -08:00
Bobby R. Bruce	b2d13ee63a	util: Remove action runner add-apt-repo git-core/ppa (#856 ) We were having some difficulty on a server running this `apt-apt-repository` command due to suspected firewall issues. On further inspection is appear to be superfluous as git can be obtained easily through `apt-get` without adding this repository.	2024-02-08 12:13:12 -08:00
Saúl	804f137325	arch-riscv: add unit-stride fault-only-first loads (i.e. vleff) (#794 ) This patch provides unit-stride fault-only-first loads (i.e. vleff) for the RISC-V architecture. They are implemented within the regular unit-stride load (i.e. vle). A snippet named `fault_code` is inserted with templating to change their behaviour to fault-only-first. A part from this, a new micro based on the vset\vl\* instructions (VlFFTrimVlMicroOp) is inserted as the last micro in the macro constructor to trim the VL to it's corresponding length based on the faulting index. This trimming micro waits for the load micros to finish (via data dependency) and has a reference to the other micros to check whether they faulted or not. The new VL is calculated with the VL of each micro, stopping on the first faulting one (if there's such a fault). I've tested this with VLEN=128,256,...,16384 and all the corresponding SEW+LMUL configurations. Change-Id: I7b937f6bcb396725461bba4912d2667f3b22f955	2024-02-08 09:15:58 -08:00
QQeg	e685c072d1	arch-riscv: Remove micro_elems in VleMicro template Change-Id: I91267de8b1142075aa2873bfcedfd8b15c6863d4	2024-02-08 07:24:55 +00:00
QQeg	7eeac98b8d	arch-riscv: Fix load and store to use EEW instead of SEW Vector unit-stride instructions have an EEW encoded directly in the instruction, We should use that instead of SEW in vtype. Change-Id: I282041ce8ed57fbcca899f7497ef6c6fb2dfcf85	2024-02-07 21:11:28 +00:00
Jason Lowe-Power	4aecf9d35c	stdlib: fix typo in error message (#855 ) Change-Id: I28f1881d207caa36c6101eef221ef4cdd229da57 Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2024-02-06 09:50:01 -08:00
Robert Hauser	f289f9e8b5	arch-riscv: adding support for local interrupts (#813 ) Besides the standard RISC-V interrupts software, timer, and external interrupt, the RISC-V specification also offers the possibility to implement local interrupts. With this patch, we contribute an extension of RiscvInterrupts that enables connecting interrupt sources to the local interrupt controller. We assigned the local interrupts to machine-level and gave them the highest priority. If two local interrupts are pending, there exception code will be the tie-breaker (higher ID > lower ID). 32 Bit systems only recognize the local interrupts 16 to 31, 64 Bit systems 16 to 63. Change-Id: Iff8d34e740b925dce351c0c6f54f4bd37a647e0c --------- Co-authored-by: Robert Hauser <robert.hauser@uni-rostock.de>	2024-02-06 09:38:50 -08:00
Harshil Patel	de0342128c	tests: move to obtain-resources from wget (#845 )	2024-02-06 09:34:03 -08:00
Bobby R. Bruce	c7426f9427	misc: Add 'workflow_dispatch' to daily tests (#850 ) This allows us to manually trigger daily test runs rather than wait for the scheduled time. This can be useful in cases where a fix for a broken test is pushed and we wish to verify it works as intended ASAP.	2024-02-06 09:32:31 -08:00
Suraj Shirvankar	44aaebc49a	tests: Allow pyunit tests to run on specific directories (#847 ) This change allows pyunit tests to be run on specific directories instead of the default `pyunit` directory. You can pass in the directory as follows. I have built gem5.opt for RISCV however it should work the same with other builds ``` ./build/RISCV/gem5.opt tests/run_pyunit.py --directory tests/pyunit/gem5/ ``` The default path works as it is currently ``` ./build/RISCV/gem5.opt tests/run_pyunit.py ``` Change-Id: Id9cc17498fa01b489de0bc96a9c80fc6b639a43f Signed-off-by: Suraj Shirvankar <surajshirvankar@gmail.com>	2024-02-06 09:32:12 -08:00
Yu-Cheng Chang	ba6c569b8d	arch-riscv: Add BasePMAChecker to support customized PMA (#846 ) The RISC-V privilege spec don't specify the implementation of PMA(physical memory attribute), which is addressed in the previous CL[1]. This CL creates the BasePMAChecker to support customized PMA so that we can only focus on the features wanted in the study. The CL also leaves the common methods `check` and `takeOverFrom` to make MMU easy to interact with PMA. [1] https://gem5-review.googlesource.com/c/public/gem5/+/40596 Change-Id: I9725e3a8f7f9276e41f0d06988259456149d2a77	2024-02-06 05:38:34 -08:00
Giacomo Travaglini	a60d6960c7	arch-arm: Remove unused/unimplemented TLB methods (#849 ) Change-Id: I3a76a914df1ba65ec5200f11111cf20f3e1eb924 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-06 09:18:06 +00:00
Mahyar Samani	8efe6dc1bc	sim: Updating Process::Map (#835 ) Changing size from int to int64_t to allow for mapping regions bigger than 2GB.	2024-02-05 12:17:05 -08:00
Giacomo Travaglini	05f93175a7	arch-arm: Crypto instruction execution requires SIMD to be enabled (#848 ) Crypto instructions will cause an undefined instruction when executed with SIMD disabled. The PR is also refactoring their implementation by checking the release object instead of the ID register field. This is improving readability	2024-02-05 19:22:04 +00:00
wmin0	e4e359135e	systemc: Reduce unnecessary backdoor request in atomic transaction (#795 ) The backdoor request in b_transport is only used for hinting the dmi capability. Since most of traffic patterns are continous, we can cache the previous backdoor request result to spare the backdoor inspect of next request. Change-Id: I53c47226f949dd0be19d52cad0650fcfd62eebbc	2024-02-05 11:08:20 -08:00
dependabot[bot]	61516e863f	misc: bump tqdm from 4.64.1 to 4.66.1 (#833 ) Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.64.1 to 4.66.1.	2024-02-05 10:19:32 -08:00
Bobby R. Bruce	6f1d9b47e9	misc: Update actions/checkout from v3 to v4 (#836 ) The `checkout` action now has a v4. v3 utilizes Node.js 16 which is now deprecated by GitHub actions. Migrating to v4 is therefore encouraged.	2024-02-05 08:54:32 -08:00
Bobby R. Bruce	df83efe129	misc: bump mypy from 1.5.1 to 1.8.0 (#837 ) See PR #834. This was accidently closed. This dependabot was correct.	2024-02-05 08:53:02 -08:00
Chong-Teng Wang	40ecdf5fb4	arch-riscv: Fix RVV instructions vmv.s.x/vfmv.s.f (#843 ) This commit fixes the implementation of vmv.s.x and vfmv.s.f. When vl = 0, no elements are updated in the destination vector register group, regardless of vstart. Change-Id: Ib21b3125da8009325743ec70ca0874704328356c Reference: [Integer Scalar Move Instructions](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#161-integer-scalar-move-instructions) [Floating-Point Scalar Move Instructions](https://github.com/riscv/riscv-v-spec/blob/master/v-spec.adoc#162-floating-point-scalar-move-instructions)	2024-02-05 08:51:42 -08:00
Chong-Teng Wang	85059a369e	arch-riscv: Fix control flow in VectorFloatMaskMacroConstructor (#844 ) This commit adjusts the logic in VectorFloatMaskMacroConstructor to ensure the %(copy_old_vd)s section is not skipped when vl = 0, ensuring correct values in destination vector register. Change-Id: I2478722d6f003a0f2e4b3cd0ba3e845bed938ee6 This is the same problem as #715 .	2024-02-05 06:29:05 -08:00
Giacomo Travaglini	16e06bad0c	arch-arm: Exec Crypto instructions only if SIMD&FP enabled We not only check for the presence of the relative FEAT_*, we also check if AdvSIMD is enabled; we throw an undefined instruction otherwise. Change-Id: I1fd0cdc8057c5a7901774802dc076817f06c8e66 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-05 12:56:48 +00:00
Giacomo Travaglini	ebef2fc4b1	arch-arm: Crypto instructions checking release object Check directly if extension is enabled instead of looking for ID register field value. This makes the code more readable Change-Id: If0b882ac3464c3587731b72a7edb3b8b65ea86c7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-05 12:56:48 +00:00
Bobby R. Bruce	f0ee1db19f	Merge branch 'develop' into mypy-1.8.0	2024-02-02 10:40:50 -08:00
Bobby R. Bruce	ea3face87b	misc: bump pre-commit from 2.20.0 to 3.6.0 (#832 ) Bumps [pre-commit](https://github.com/pre-commit/pre-commit) from 2.20.0 to 3.6.0.	2024-02-02 10:39:56 -08:00
Harshil Patel	858acacb20	tests: fix wget link for gpu tests (#840 )	2024-02-02 10:34:41 -08:00
Giacomo Travaglini	33e62b8e8a	arch-arm: Adopt new TranslationRegime data type in MMU translations (#829 ) This is more complaint with the VMSAv8-64, which is using Translation Regimes instead of historical (Armv7) isHyp tagging and the ExceptionLevel managing the translation. This greatly simplifies translation code, specially with FEAT_VHE where the managing el (EL2) could handle to different translation regimes (EL and EL2&0).	2024-02-02 11:54:38 +00:00
dependabot[bot]	234d63db6f	misc: bump pre-commit from 2.20.0 to 3.6.0 Bumps [pre-commit](https://github.com/pre-commit/pre-commit) from 2.20.0 to 3.6.0. - [Release notes](https://github.com/pre-commit/pre-commit/releases) - [Changelog](https://github.com/pre-commit/pre-commit/blob/main/CHANGELOG.md) - [Commits](https://github.com/pre-commit/pre-commit/compare/v2.20.0...v3.6.0) Change-Id: I421f6d08fa370562a4310b2010d3d5071498bd6e --- updated-dependencies: - dependency-name: pre-commit dependency-type: direct:production update-type: version-update:semver-major ... Change-Id: Ifcf6ecdfdbdd465c1e1cd58506c21445dbe747f0 Signed-off-by: dependabot[bot] <support@github.com>	2024-02-01 15:51:24 -08:00
Bobby R. Bruce	80a7dfc300	misc: bump mypy from 1.5.1 to 1.8.0 See PR #834. This was accidently closed. This dependabot was correct. Change-Id: I63a337b6f3cc4ae06bdfb28976605a9682fc236a	2024-02-01 15:44:37 -08:00
kroarty-lanl	197be3a0dd	dev: Fix off-by-one in IDE controller PCI register allocation (#824 ) The PCI configuration space is 256 bytes, yet because the PCI_CONFIG_SIZE macro is 0xff, the final register allocation in the IDE controller only allocated up to byte 255. Change-Id: I1aef2cad9df366ee8425edb410037061eb29ae33	2024-02-01 10:14:28 -08:00
Mahyar Samani	b79fe82e5c	cpu,stdlib: Updating strided generator (#762 ) This change improves the functionality of strided generator to create trace with better flexibility. It allows the user to manually set offset and stride size instead of calculating it based on a "gen_id". This way different patterns could be created with the same SimObject. In addition, this change adds stdlib components for strided generator.	2024-02-01 09:08:42 -08:00
Harshil Patel	b5fae2f620	tests: Switch to vega_x86 from gcn3_x86 in daily tests (#817 ) Change-Id: Ic2ed8cc4488ddd361b5773b91100d806b94f1b8a	2024-02-01 09:06:04 -08:00
Giacomo Travaglini	3a2c8feca8	arch-arm: MMU aarch64EL is not used in AArch64 only anymore We therefore rename it to exceptionLevel Change-Id: I2a3aabaefa315d95bd034b13d95d5a5b0b8e9319 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:45:06 +00:00
Giacomo Travaglini	3737e8b6df	arch-arm: Use MAIR_EL2 mem attribute register when in EL0 host With the old code, the MAIR_EL1 register was checked when inserting an EL2&0 TLB entry Change-Id: I064032fb2946777c2f4c50c06a124f828245e18a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:44:16 +00:00
Giacomo Travaglini	d42ef792bf	arch-arm: Check ELIs64 for EL2 when in EL2&0 regime The problem with: ELIs64(tc, aarch64EL == EL0 ? EL1 : aarch64EL); Is that when we are executing at EL0 in host (EL2&0 translation regime), the execution mode (AArch32 vs AArch64) is dictated by EL2 and not by EL1 (which is the guest) Change-Id: I463a2a9461c94d0886990ae3d0a6e22aeb4b9ea3 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:59 +00:00
Giacomo Travaglini	458c98082c	arch-arm: Replace EL based translation with regimes This is the final step in the transformation process. We limit the use of the "managing Exception Level" for a translation in favour of the more standard "Translation Regime" This greatly simplifies our code, especially with VHE where the managing el (EL2) could handle to different translation regimes (EL and EL2&0). We can therefore remove the isHost flag wherever it got used. That case is automatically handled by the proper regime value (EL2&0) Change-Id: Iafd1d2ce4757cfa6598656759694e5e7b05267ad Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:47 +00:00
Giacomo Travaglini	e333a77c12	arch-arm: Remove _Xt postfix from TLBI instructions The Xt is not part of the architectural name of the register and it was likely added with the introduction of extended register (Xt) TLBIs in Armv8 to differentiate them with the old Armv7 ones. The use of _Xt was not consistent anyway: newer TLBIs were already omitting it. Change-Id: Ic805340ffa7b5770e3b75a71bfb76e055e651f8b Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:26 +00:00
Giacomo Travaglini	594428f010	arch-arm: Remove redundant isHyp as a TLB entry field We should stop using isHyp.. An hypervisor entry is flagged already by the EL of the entry (el == EL2) Change-Id: I20c3d06fa2b04e0b938a380ca917d0b596eddcf2 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:43:00 +00:00
Giacomo Travaglini	a6ca81906a	arch-arm: Simplify setting of isHyp for mem translations The isHyp descriptor is an old artifact of armv7 and it flags a PL2 (AArch32) or EL2 & EL2&0 (AArch64) translations. It is commonly set according to the EL/mode [1] but it may differ from the execution state in case of explicit translation requests (via the AT instruction as an example [2]). There is really no need to complicate the masking of isHyp. We should just make use of the tranType method (in charge of setting aarch64EL) to properly set aarch64EL, and make isHyp coincide with the case of aarch64EL == EL2. This is a step towards the removal of the isHyp flag. More specifically the patch does the following: * HypMode translation type moved in the EL2 case The translation is used by ATS1HR/ATS1HW: Performs stage 1 address translation as defined for PL2 and the Non-secure state * S1S2NsTran translation type moved in the EL1 case The translation is used by ATS12NSOPR/ATS12NSOPW: Performs stage 1 and 2 address translations as defined for PL1 and the Non-secure state * S1CTran translation type can be at either EL1 or EL3 The translation is used by ATS1CPR/ATS1CPW Performs stage 1 address translation as defined for PL1 and the current Security state [1]: https://github.com/gem5/gem5/blob/stable/src/arch/arm/mmu.cc#L1281 [2]: https://github.com/gem5/gem5/blob/stable/src/arch/arm/mmu.cc#L1282 Change-Id: Ie653170f6053c5d8141a2de9f50febf5bf53ab9c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-01 13:42:40 +00:00

1 2 3 4 5 ...

21304 Commits