derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Bobby R. Bruce	6057de452b	tests,gpu-compute: Fix incorrect options handling Change-Id: Ica845ad7c4a49fe2636df3bf184220a33557bc5e	2024-08-22 05:49:07 -07:00
Bobby R. Bruce	0a188850fe	tests,gpu-compute: Fix artifact upload for GPU tests actions/upload-artifact@v4 does not understand periods in artifact names. Change-Id: Ia272f9dcf9cb2213fb78b1814007921232395914	2024-08-22 05:49:07 -07:00
Bobby R. Bruce	28a6ca201b	misc,tests: Remove Gerrit ID check from CI Workflow Change-Id: I86933f3b315f3233e135de2e32498c1641f7443e	2024-08-22 04:24:56 -07:00
Bobby R. Bruce	868e287e71	stdlib: Give user's disk_device priority when setting root val (#1467 ) In `get_default_kernel_root_val()`, now prioiritizes the explicit disk_device passed from the user over the default implemented by the board. Also adjusts syntax for selecting this value in `set_kernel_disk_workload()` for consistency. It seems that the common use case for setting `disk_device` is that there is a mismatch between where the disk image is mounted and where the board expects it by default. In this case, it also seems common that the root partition will be on this explicit device as well. In cases where this is not true, explicit kernel arguments can be used to define the distinct disk device apart from the root. However, this seems less common than the above so, in that case, it would be easier to tie these together.	2024-08-22 03:38:56 -07:00
Junshi Wang	387caa0075	arch-arm: Add place holder of registers. Add declaration of HAFGRTR_EL2 registers and read/write as GPR. Change-Id: I87570d1e87d479f4530cf2c6e05931cdc26ee361 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-22 08:21:28 +01:00
Setu	f6010439fe	mem: Fixed implementation of Best Offset Prefetcher (#1403 ) This PR fixes the issues with the implementation of the Best Offset Prefetcher described in issue #1402 On branch bop Changes to be committed: modified: src/mem/cache/prefetch/bop.cc modified: src/mem/cache/prefetch/bop.hh --------- Co-authored-by: Setu Gupta <setu.gupta.2020@gamil.com> Co-authored-by: Abhishek Shailendra Singh <abs218@leigh.edu> Co-authored-by: Setu Gupta <setu.gupta@partner.samsung.com>	2024-08-21 09:54:20 -07:00
Bobby R. Bruce	3429da7787	python,tests,misc: Remove Gerrit-ID insertion from pre-commit Change-Id: I4db06415c9d0bbba7a6db56d7e9febf6491003bf	2024-08-20 15:40:55 -07:00
Noah Krim	dfa4bbd7a4	Merge branch 'develop' into fix-kernel-workload-root-val	2024-08-20 15:12:10 -07:00
Bobby R. Bruce	e7442036a5	tests,gpu-compute: Fix Daily/Weekly GPU tests failures (#1485 ) Without specifying the "gem5/gpu" directory, this test attempted to run the entire test suite. This caused the daily and weekly tests to fail. This change fixes this.	2024-08-20 14:18:51 -07:00
Ali Nezhadi Khelejani	1512eddd43	misc: Update on-create.sh (#1477 ) After merging the old personal gem5 repository with the stable version v24, I tried to run the project inside the `.devcontainer` environment. During the image build process, I encountered the following error: ```sh [7683 ms] Start: Run in container: /bin/sh -c ./.devcontainer/on-create.sh fatal: detected dubious ownership in repository at '/workspaces/gem5' To add an exception for this directory, call: git config --global --add safe.directory /workspaces/gem5 [7724 ms] onCreateCommand failed with exit code 128. Skipping any further user-provided commands. ``` This error occurred due to an ownership permission problem, which I resolved by adding the following line.	2024-08-20 11:15:33 -07:00
Bobby R. Bruce	0857442e44	util-docker: Cleanup, refactor, better document Dockerfiles (#1292 ) * Removes the "docker-compose.yaml" in favor of "docker-bake.hcl". This uses the `docker buildx` tool which has the advantage of enabling multi-platformm builds where desired. By default all images are built targeting `linux/arm64`, `linux/amd64` and `linux/riscv64` as targets with the exception of the GPU images where only `linux/amd64` makes sense. * Remove unused/older Docker build targets (these can easily be re-added but they were not regularly built or have any current usage). * Update "README.md" to better describe these Dockerfiles and how they are built. * Simplify GCC and Clang compiler images. Each uses the Ubuntu 24.04 All Deps image as a base then specialized the compiler on top. * To simply things, all compiler versions are built from 24.04. This means narrowing the supported versions from GCC v10 to v14 and Clang v14 to v18. * Fix some bugs in the "docker-bake.hcl" thus ensuring all targets may be built from it. * Cleanup the systemc and sst images: reducing their size and building them off the common 24.04 ubuntu base image.	2024-08-20 09:45:47 -07:00
Tiberiu Bucur	88de81f167	arch-arm, sim-se: Fix VPtr bug Some syscalls were incorrectly using 64 bit integers instead of VPtr's guest pointers, causing parameter value corruption. This commit addresses this issue. Change-Id: If9e27a7c776b802dda18979d1a83a76c23557359 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-20 16:18:24 +01:00
Tiberiu Bucur	107e8f3d17	arch, sim-se: Fix size_t size mismatch bug Same as with the off_t, some syscalls were using incorrect size parametres in place of a guest-defined size_t. This commit changes the signature of said syscalls and adds the size_t typedef to the arch-dependent Linux OSs. Change-Id: Iece43814971a8e6275d25f6789e41528d241d1f4 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-20 16:18:24 +01:00
Tiberiu Bucur	f74260c552	arch, sim-se: Fix off_t size mismatch bug Some system calls were using incorrect sizing for offset parametres, which was causing the ABI to pass wrong values due to size mismatches. One such syscall is lseek, which in the Arm syscall table was incorrectly marked as llseek, which does not exist in aarch64 Linux. In addition, the off_t alias for general Linux was changed from an unsigned to a signed type, to accurately reflect the behaviour in the real-life Linux operating system. Change-Id: Iada4b66a8933466c162ba9ec901dbdae73c73a18 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-20 16:18:24 +01:00
Tiberiu Bucur	9b9b9ffbff	arch-arm: Ignore/implement several syscalls This commit either adds the implementation or the ignoreFunc to the corresponding entry in the syscall table for some Arm syscalls that were required in order to test the fix for the incorrect parameter size bug in se mode. Change-Id: Ifc6d87e2decf1bf96ecd81de6690f92927377bf8 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-20 16:18:24 +01:00
Tiberiu Bucur	fe6ef662d1	configs: Add --param to starter_se This commit adds the --param option to the starter_se configuration script for the Arm ISA. This is in order to support attaching remote debugger sessions. Change-Id: I2d8cc9f677f731948872003cca6066d1072ad570 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-20 16:18:24 +01:00
Noah Krim	cb79596036	Merge branch 'develop' into fix-kernel-workload-root-val	2024-08-19 12:40:03 -07:00
Harshil Patel	ce4c2c6495	dev,arch-x86: Added softstrobe mode to intel8254 timer (#1447 ) This PR should fix the #1195	2024-08-19 12:21:31 -07:00
Bobby R. Bruce	7413d3217c	docs,misc: RELEASE-NOTES.md updates for v24.1 (#1460 )	2024-08-19 10:58:29 -07:00
Bobby R. Bruce	f600db4a98	gpu-compute,tests: Move GPU tests to testlib (#1270 ) A new host tag `gcn_gpu` has been added. This allows for selection of those GPU tests which depend upon the gcn-gpu docker image to run. In addition to this, the square GPU tests has been moved to the CI tests. This ensures some GPU code is compiled and run on every PR.	2024-08-19 10:58:06 -07:00
Yangyu Chen	b0d81ec8a2	arch-riscv: fix GDB breakpoint issue for RV32 (#1470 ) Since PR #1316, we use sign-extend for all address generation, including PC, to match the ISA specification for modifiable XLEN. However, when we set a breakpoint using remote GDB, our address is not sign-extended. This causes the breakpoint to be set at the wrong address, as specified in Issue #1463. This PR fixes the issue by sign-extending the address when setting a breakpoint. This also matches the RISC-V ISA Specification that "must sign-extend results to fill the entire widest supported XLEN in the destination register." Change-Id: I9b493bf8ad5b1ef45a9728bb40fc5e38250fe9c3 Signed-off-by: Yangyu Chen <cyy@cyyself.name>	2024-08-19 10:25:39 -07:00
Bobby R. Bruce	cad4307951	util-docker: Re-add env variables to SST Change-Id: I653baeb69f8be1501766b57337f6643e00d7dd60	2024-08-19 10:08:20 -07:00
Yu-Cheng Chang	aa4fe362a5	arch-riscv: Sign-extend the address in newPCState (#1471 ) From #1316, creating the new PCState should sign-extend the address to avoid wrong address issue. Change-Id: I884b4e3708f5f1cc49cfd44d51bec5a2b63cc47a	2024-08-19 08:21:42 -07:00
Giacomo Travaglini	280871245b	arch-arm: Redirect VHE for ZCR_EL1 (#1472 ) Change-Id: Iff83d25257065503dc02728461823bc9985dbab3 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-16 22:49:49 +01:00
Noah Krim	eca0059654	stdlib: Give user's disk_device priority when setting root val In `get_default_kernel_root_val()`, now prioiritizes the explicit disk_device passed from the user over the default implemented by the board. Also adjusts syntax for selecting this value in `set_kernel_disk_workload()` for consistency. Change-Id: Icddcf438f5b96c2288c3cc608782f191df2c394e	2024-08-15 13:34:03 -07:00
Bobby R. Bruce	646df63e56	misc: Fix typos in util/dockerfiles/README.md Change-Id: I5488301543bfff21279b6c0b1aae841574efee95 Co-authored-by: Harshil Patel <harshilp2107@gmail.com>	2024-08-15 10:45:53 -07:00
Alexander Richardson	646f994efb	arch-arm: Fix incorrect operation of VRINT* instructions (#1325 ) After a lot of debugging and comparing traces I noticed that vrintp was giving different results from QEMU. An input of 0x3f800000 (1.0) was being passed to the fplib helpers as (uint32_t)1 which has a completely different floating-point interpretation and the result was therefore completely wrong. I've fixed this as well as all remaining implicit float-to-int conversions in the ARM instruction execution. There are more -W(implicit-)float-conversion warnings in the other executors, but for now this fixes the issue I was seeing. Change-Id: Ifdeee745ca155d7f4504ac4c54235ac431acdeb9	2024-08-15 11:01:48 +01:00
Bobby R. Bruce	0c26ee5f71	util-docker: Replace gem5 v24.0 clone with wget This is more efficient. Change-Id: Idd57343183a8667425dbc036ad0c7c18581898f5	2024-08-14 14:08:44 -07:00
Setu	629bf84e10	mem: Stride Prefetcher Fix (#1449 ) This PR fixes the issues mentioned in #1448. Note that this contribution is the result of a joint collaboration with @AbhishekUoR This PR introduces the following 4 changes: 1. It changes the addresses which are used to compute the stride to cache line aligned addresses (the current version uses word aligned addresses) 2. It correctly returns if the stride does not match (as opposed to issuing prefetches using the new stride incorrectly) 3. It returns if the new stride is 0, indicating multiple reads from the same cache line. 4. It removes code which is no longer necessary after the addition of changes number 1 and 3. Change-Id: Ic346d0e15df6d07e2b93289c8d6b89b4c2f45a34 --------- Co-authored-by: Abhishek Shailendra Singh <abs218@leigh.edu>	2024-08-14 07:16:10 -07:00
Bobby R. Bruce	dcb04a72fc	util-docker,tests: Remove Ubuntu 20.04 Docker Change-Id: I1d4bbebaa4b6f064b5f40a95d066bbf092cf103f	2024-08-13 16:15:49 -07:00
Bobby R. Bruce	9f93c8ac9c	util-docker: Revert docker image tag to 'latest' Change-Id: Iafe92716725e6b3cecfeba57098c3a7efaf73d97	2024-08-13 16:13:33 -07:00
Bobby R. Bruce	59455daa85	util-docker: Fix correct common platform comment Change-Id: Ifc703b47b1e59522ba01f4c2b59a4863779eefb1	2024-08-13 16:12:45 -07:00
Bobby R. Bruce	8b61490df1	util-docker: Update dockerfiles README Change-Id: I39bca04b3770bd51203944d69d0fbecff85055f8	2024-08-13 16:09:08 -07:00
Bobby R. Bruce	bef452ce72	misc,tests: Update supported GCC and Clang compilers - GCC: v10 to v14 - Clang: v14 to v18 Change-Id: I6cd1686ffff0f08686a231b6b4936da343d53831	2024-08-13 16:09:06 -07:00
Bobby R. Bruce	b68c2ef37f	util-docker: Add vim to 24.04-all-deps Ubuntu Docker Change-Id: I898a0fddcdcf8a876fcbbe11795e858395ad9740	2024-08-13 16:08:05 -07:00
Bobby R. Bruce	3875dcdfd7	util-docker: Update the sst Dockerfile 1. Builds on top of the Ubuntu 24.04 all-deps image. 2. Unify the download, build, install, and cleanup steps. Change-Id: I4c2bf8e571dfd228f7df8372cda0f428de59af51	2024-08-13 16:08:05 -07:00
Bobby R. Bruce	2c0c933a3a	util-docker: Cleanup the systemc docker 1. Uses the ubuntu-24.04_all-deps as the base image. 2. Unifies the build and cleanup into a single step, thus reducing the size of the image. Change-Id: I63b5dad2af0e8b1f6be8ad1f28321c743f36b2dc	2024-08-13 16:08:05 -07:00
Bobby R. Bruce	58aad68329	util-docker: Order targets in docker-bake This improves readbility. The targets order matches that in the default group. Change-Id: I1102aeb48bc256df9b58032a327ec663e5733a98	2024-08-13 16:08:03 -07:00
Bobby R. Bruce	9978b4ea4c	util-docker: Add 'devcontainer' to default bake group Change-Id: I4b245cabd6e384cab780bd22b0f8b40d9819b92b	2024-08-13 16:07:22 -07:00
Bobby R. Bruce	4956c475f4	util-docker: Set the GPU Docker images to build only to x86 These images won't work and make no sense compiling to any platform other than X86. These are used in SE mode simulations where the host platform matters. Change-Id: I47405e930bf511fabcbc93d0b08ee2fb2c556869	2024-08-13 16:07:22 -07:00
Bobby R. Bruce	c1a562083d	util-docker: Set 'pull' to 'always' This ensures the a `docker pull` command is always run before building. Change-Id: If1a66b9b426d5843459e0308a64f13a11c0c6ed2	2024-08-13 16:07:21 -07:00
Bobby R. Bruce	9d635dea55	util-docker: Improve Docker gcc and clang builder 1. Uses the all-dependencies image as the base image. 2. Has all compilers use Ubuntu 24.04. Notes: This change implitly changes our supported compilers to GCC v10 to v13 and Clang v14 to v18. This will be fully incorporated into the project later. Change-Id: Id8e2141ea64a34c7e3532605f6ecb7d9ccb76951	2024-08-13 16:07:15 -07:00
Bobby R. Bruce	a1eefb6ed8	util-docker: Remove old/unsed/unecessary Dockerfiles * Unsupported compilers. * Unsed cross compilers. * The gem5-all-min-deps image. Change-Id: Iaab64e5e6685b0a538c38b2979fae86f01bc53e8	2024-08-13 16:05:47 -07:00
Bobby R. Bruce	e03a20bdb4	util-docker: Remove version from systemc Docker context This simplifies things slightly. Change-Id: I1263e385f7adeb2b83cdc09f7f6903be9193c467	2024-08-13 16:05:47 -07:00
Bobby R. Bruce	c291678881	util-docker: Fix docker-bake.hcl sst context The context for sst is the "sst" directory. Change-Id: Ic120cca13a9e4df02b98d101ad8e16c296807c2d	2024-08-13 16:05:47 -07:00
Bobby R. Bruce	e82e824f08	util-docker: Breakup long (>79 char) lines in docker-bake Change-Id: I5488301543bfff21279b6c0b1aae841574efee95	2024-08-13 16:05:46 -07:00
Bobby R. Bruce	3640559a12	misc,tests: Fix compiler tests (add missing `,`) (#1459 )	2024-08-13 06:54:12 -07:00
Alexander Richardson	f6f547fb62	arch-arm: Fix incorrect behaviour of VFNMS and VFNMA (#1420 ) This was found while comparing a diverging execution against QEMU traces and checking for the first mismatched program counter. Fortunately this was caused by a branch shortly after this incorrect computation but still took a long time to track down. There are two issues here: the decoder had inverted the cases for S and A, and the sign bit was wrong for VFN*.	2024-08-13 09:05:52 +01:00
Matthew Poremba	c359b53a19	arch-vega: Update microscaling format scaling and denorm handling (#1451 ) This PR has 3 commits: - Update scaling methods to scale by multiplication or division when upcasting or downcasting respectively. - Preserve the sign when a microscaling conversion results in NaN or infinity to match hardware. - Rework rounding to handle cases where conversion results in a denormal number in the output type so that the value is correct.	2024-08-12 07:00:26 -07:00
Matthew Poremba	7d46c50663	arch-vega: Swizzle multi-dword scratch requests (#1445 ) Scratch memory requests that are larger than one dword are using a different memory layout than global instructions. Rather than being placed contiguously, each dword is interleaved 64 lanes * 4 bytes away as described in Section 9.1.5.2. "Swizzled Buffer Addressing" in the MI300 specification. This was verified by comparing MI300 output (which uses scratch_ instructions) with MI200 (which uses buffer instructions). MI300 FashionMNIST bs=1 now matches CPU reference. This requires several changes to the instruction implementations: - For stores, data in the GPUDynInst can be swizzled before the data is written to memory. This is easy to do using a helper method. This is done in the template<int N> variant of initMemWrite. To use this x2 stores are changed to use template<int N> rather than loading a U64. The swizzle function is renamed to swizzleAddr to avoid confusion with swizzleData. - For loads, data is unswizzled in completeAcc when writing register values. This is not as easy to implement as a helper and is thus implemented for the three load instructions that load more than one dword. - Accessing swizzled data requires at least one packet per dword. A new GPU memory helper is added to create these packets for scratch requests specifically. This is called in the template<int N> variant of initMemRead / initMemWrite. Loads and stores of x2 are changed to use this variant instead of accessing a U64. The GPUDynInst status vector restrictions are increased to allow for swizzled x4 accesses. For simplicity this does not currently support misaligned swizzled accesses and will panic upon seeing such a case. Change-Id: Ic686c51e28e0af029a043d5a5b3d4069f2cb94f9	2024-08-12 06:58:48 -07:00

... 2 3 4 5 6 ...

22088 Commits