derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Yu-Cheng Chang	6b4dbdcedb	tests,arch-riscv: update bitmanip asmtest binaries (#931 ) Gem5 resource update: https://github.com/gem5/gem5-resources/pull/25 Gem5 issue: https://github.com/gem5/gem5/issues/883 Change-Id: I1892d7591d6fa49d0563623fd90292e0d38d9ba3	2024-04-16 09:51:32 -07:00
Matthew Poremba	9b463dbdfd	util-docker: Bump gpu-fs build docker to ROCm 6.0.2 (#1025 ) This bumps the docker image used to build GPU applications for input to GPUFS simulations from ROCm 5.4.2 to ROCm 6.0.2 and Ubuntu from 20.04 to 22.04. This matches the versions in gem5-resources#29 . Several notes were added to the Dockerfile to describe where the RUN commands come from. A README.md is also added to clarify that this is not a disk image for GPUFS and is only used to build applications. Change-Id: I9ada99e2ed1854cb7adb76f2a1fa662bab398f86	2024-04-15 13:36:06 -07:00
Bobby R. Bruce	1aa0bf8ec6	tests,github: Update CI Tests' GitHub Actions versions (#1021 )	2024-04-15 13:35:33 -07:00
Bobby R. Bruce	56a2346b8d	tests,util-docker,github: Add Ubuntu 24.04 Docker image & updated tests/actions to use it (#1018 ) This ensures gem5 compiles and runs in 24.04 environments. A necessary PR, for ensuring gem5 support Ubuntu 24.04 (related issue: #909)	2024-04-15 13:34:22 -07:00
Matthew Poremba	a03319bef7	arch-vega: Fix output warnings, gem5.fast (#1023 ) Fix gem5.fast build not building when using gpu model. Removes very spammy stat distribution bucket size prints when running gpu model.	2024-04-15 13:18:27 -07:00
Matthew Poremba	7e2d8dee42	mem,gpu-compute: Implement GPU TCC directed invalidate (#1011 ) The GPU device currently supports large BAR which means that the driver can write directly to GPU memory over the PCI bus without using SDMA or PM4 packets. The gem5 PCI interface only provides an atomic interface for BAR reads/writes, which means the values cannot go through timing mode Ruby caches. This causes bugs as the TCC cache is allowed to keep clean data between kernels for performance reasons. If there is a BAR write directly to memory bypassing the cache, the value in the cache is stale and must be invalidated. In this commit a TCC invalidate is generated for all writes over PCI that go directly to GPU memory. This will also invalidate TCP along the way if necessary. This currently relies on the driver synchonization which only allows BAR writes in between kernels. Therefore, the cache should only be in I or V state. To handle a race condition between invalidates and launching the next kernel, the invalidates return a response and the GPU command processor will wait for all TCC invalidates to be complete before launching the next kernel. This fixes issues with stale data in nanoGPT and possibly PENNANT.	2024-04-15 13:18:01 -07:00
Bobby R. Bruce	630f3822b8	github: Update 'ubuntu-22.04' to 'ubuntu-latest' (#1022 ) There was some inconsistency in the GitHub Workflow files on using 'ubuntu-latest' (which gets the latest Ubuntu version) or 'ubuntu-22.04'. To keep things consistent 'ubuntu-latest' is now used in all cases. This also saves us updating workloads upon release of a new Ubuntu version.	2024-04-15 09:55:56 -07:00
Giacomo Travaglini	bdcffdd0f0	dev-arm: Do not mark the MpamMSC as abstract (#1030 ) This prevents its instantiation Change-Id: I775a64904a01cf36e4cc1e0cd45765f03325c5ca Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-04-15 09:40:22 -07:00
Bobby R. Bruce	a7330ac4fb	misc: bump dnspython in /util/gem5-resource-manager (#1027 ) Bumps [dnspython](https://github.com/rthalley/dnspython) from 2.3.0 to 2.6.1. - [Release notes](https://github.com/rthalley/dnspython/releases) - [Changelog](https://github.com/rthalley/dnspython/blob/main/doc/whatsnew.rst) - [Commits](https://github.com/rthalley/dnspython/compare/v2.3.0...v2.6.1) Change-Id: Iaa0ed0fa68220fd8b52eb62c0089831b253e17d0 --- updated-dependencies: - dependency-name: dnspython dependency-type: direct:production ... Change-Id: I6e3ed8287f5fd60e7bd1c0a3e565db94ef8627a9 Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-04-15 08:41:14 -07:00
Ivana Mitrovic	dbb71948ce	util: Update resource manager dependencies (#1015 ) This PR combines the changes from these dependabot PRs: #1008 and #1012.	2024-04-15 08:35:47 -07:00
Matthew Poremba	3db6e86fea	arch-vega: Fix string check warnings on fast build gem5.fast does not currently build if the GPU model is built. This fixes the array-bounds warnings allowing gem5.fast to build again. Change-Id: I463c2847c3ecfd2257a70418fa247090b0493f9b	2024-04-14 12:22:57 -07:00
Bobby R. Bruce	b986c542ca	tests,misc: Set pre-commit/action to v3.0.1 v3.0.0 of pre-commit/action caused a deprecation warning in actions. v3.0.1 was released to deal with this. Change-Id: Ib5654e465565ad4356754ac097983aec4166b98f	2024-04-13 20:30:34 -07:00
Bobby R. Bruce	3f45a2d08d	misc,tests: Up actions/setup-python version to v5 This was causing a deprecation warning in GitHub Actions. Change-Id: I9ab147acf12e3763ab731769468ce5b1dc5e4dea	2024-04-13 20:20:26 -07:00
Matthew Poremba	01f2df4b8a	gpu-compute: Fix stat bucket sizes Change-Id: If30505515867a866c631cb117d3d22e19814a2f2	2024-04-13 15:51:41 -07:00
Bobby R. Bruce	ccd9beb661	util-docker: Remove 22.04 min-dep Dockerfile We only test the latest LTS Ubuntu release with min-deps. With 24.04, we no longer require the 22.04 min dependencies image. Change-Id: I4b3d668c1f9d10c2b6071848e6daada6c763b5e7	2024-04-13 14:16:41 -07:00
Bobby R. Bruce	05bc85aa9b	misc: Update Update GitHub Actions to use 24.04 over 22.04 This change ensures all our tests run on our most recent supported LTS release of Ubuntu. In the case of compiler tests we still test 22.04 all-dep but test 24.04 all-dep and min-dep (i.e., we drop 22.04 min-dep as it's somewhat redundant). Change-Id: I63666d1017594b496523a48e5112a8994f57885f	2024-04-13 14:13:35 -07:00
Bobby R. Bruce	d091c64db1	util-docker: Add Ubuntu 24.04 min-dep Docker Change-Id: Ia5cb4f2fd54ce53494ab95705b4f6006648d7eba	2024-04-13 14:08:13 -07:00
Bobby R. Bruce	3962fca2e3	util-docker: Add ubuntu-24.04_all-deps Docker Change-Id: I5917c446cacc25d1a333b5cf8147ee78b112aeb3	2024-04-13 14:08:13 -07:00
Bobby R. Bruce	bdaeb082c3	util-docker: Update docker-compose URLs to 'ghcr.io/gem5' (#1017 ) 'gcr.io/test-gem5' was the registry we used when hosting them on Google Cloud services. We now use the GitHub container registries.	2024-04-13 14:05:34 -07:00
Bobby R. Bruce	392a2b4ffa	misc: Add a DevContainer specification to the gem5 repo (#911 ) Speciftying a DevContainer in gem5 allows for users to quickly create an environment in which they can develop, build, and run gem5. The ".devcontainer/devcontainer.json" file specifies the properties of the container. In this commit they are as follows: 1. The Docker image ghcr.io/gem5/devcontainer. This is built from "util/dockerfiles/devcontainer". This Dockerfile provides all dependencies and a pre-built gem5 binary from the current main branch (added to "/usr/local/bin"). In order to support this Docker container on different platforms we use the Docker multi-platform feature. As such, this must be built using `docker buildx bake devcontainer --push` which reads the `docker-bake.hcl file for the specification of the multi-platform image. 2. Visual Studio extensions. This is a list of Visual Studio Code extensions useful when developing gem5. They are automatically added the Visual Studio dev container. 3. Features. Features are enhancemets that can be added to a DevContainer. Normally they are libraries and other commonly used tools to be included in the Container. As we have our dependencies specified in the Dockerfile here we select one to enable Docker inside the container, one to enable the Github CLI, one to improve Linting, and finally one to enable the vscode CLI. 4. The On Create Command : This command allows us to specify commands to be run after the DevContainer is created. In this case we execute ".devcontainer/on-create.sh" which, right now, refreshes the git index and installed the pre-commit checks.	2024-04-12 10:37:17 -07:00
Yu-Cheng Chang	ebb70dea99	cpu: Fix KVM false negative warning after Kconfig transition (#1013 ) When we start to build gem5. We will read and process all of SConsopts files, and process the after_sconsopts_callbacks after all of SConsopts files read. In the KVM_ISA env setting, the KVM_ISA env can be set in the different files, take x86 and arm as example: KVM_ISA default value: `bc39283451/src/cpu/kvm/SConsopts` x86 KVM_ISA: `bc39283451/src/arch/x86/kvm/SConsopts (L39-L45)` arm KVM_ISA: `bc39283451/src/arch/arm/kvm/SConsopts (L35-L36)` We should move the kvm warning after all of SConsopts env read issue: https://github.com/gem5/gem5/issues/686 Change-Id: I096c6bebaaec18f9b2af93191d0dd23c65084eda	2024-04-12 09:23:56 -07:00
Nicholas Mosier	bc39283451	cpu-o3, arch-x86: initialize interrupts for all SMT threads (#1007 ) Fix issue #1004. When enabling SMT with the O3 cpu, only the first interrupts object was getting initialized properly. This patch initializes all interrupts objects, one per SMT thread. Change-Id: I300782b645bd8ea3ef2497278fb73125ab4bf495	2024-04-11 11:17:24 -07:00
Ivana Mitrovic	db1c336237	cpu,arch-arm,arch-riscv: adding new instruction types to RISC-V (#589 ) This commit adds more detailed instruction types for RISC-V Vector. Concretely, it substitutes VectorIntegerArith, VectorFloatArith, VectorIntegerReduce and VectorFloatReduce with more specific types related to the operation that each instruction (e.g., VectorIntegerAdd or VectorIntegerMult). Additionaly, fixes two RISC-V instruction types (VectorXXX) that were used in ARM SVE, placing the proper SimdXXX ones. Change-Id: I31774fa6a7cd249abfffec68d11d3d77f08ad70b CC @adriaarmejach	2024-04-11 10:15:56 -07:00
Giacomo Travaglini	3b5ae7b4d1	Add a generic cache template library (#745 ) Add a generic cache template to construct internal storage structures. Also add some example use cases by converting the prefetcher tables to use this new library.	2024-04-11 08:00:34 +01:00
Pranith Kumar	769f750eb9	mem-cache: Implement AssociativeSet from AssociativeCache AssociativeSet can reuse most of the generic cache library code with the addition of a secure bit. This reduces duplicated code. Change-Id: I008ef79b0dd5f95418a3fb79396aeb0a6c601784	2024-04-10 16:17:57 -04:00
Pranith Kumar	f3bc10c168	mem-cache: Derive tagged entry from cache entry The tagged entry can be derived from the generic cache entry and add the secure flag that it needs. This reduces code duplication. Change-Id: I7ff0bddc40604a8a789036a6300eabda40339a0f	2024-04-10 16:17:57 -04:00
Pranith Kumar	8fb3611614	mem-cache: prefetch: Implement DCPT tables using cache library The DCPT table is better built using the generic cache library since we do not need the secure bit. Change-Id: I8a4a8d3dab7fbc3bbc816107492978ac7f3f5934	2024-04-10 16:17:57 -04:00
Pranith Kumar	2c7d4bed66	mem-cache: Implement VFT tables using cache library The frequency table is better built using the generic cache library instead of the AssociativeSet since the secure bit is not needed for this structure. Change-Id: Ie3b6442235daec7b350c608ad1380bed58f5ccf4	2024-04-10 16:17:57 -04:00
Pranith Kumar	2cc2ad5097	misc: Add a generic cache library Add a generic cache library modeled after AssociativeSet that can be used for constructing internal caching structures. Change-Id: I1767309ed01f52672b32810636a09142ff23242f	2024-04-10 16:17:57 -04:00
Giacomo Travaglini	4b98551aaf	Update src/python/gem5/components/cachehierarchies/abstract_cache_hierarchy.py Co-authored-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2024-04-10 16:17:56 -04:00
Giacomo Travaglini	efe397ca92	stdlib: Add DTB generation capabilites to AbstractCacheHierarchy Now that we are able to provide a view of the cache hierarchy from the python world, we can start generating DTB entries for caches and more specifically to properly fill the next-level-cache and cache-level properties Change-Id: Iba9ea08fe605f77a353c9e64d62b04b80478b4e2 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-04-10 16:17:56 -04:00
Giacomo Travaglini	e6637fc852	stdlib: Use newly defined tree for PrivateL1PrivateL2 hierarchy Change-Id: I803c6118c4df62484018f9e4d995026adb1bbc2c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-04-10 16:17:56 -04:00
Giacomo Travaglini	d67672facc	stdlib: Add tree structure to the AbstractCacheHierarchy One of things we miss in gem5 is the capability to neatly compose the cache hierarchy of CPUs and clusters of CPUs. The BaseCPU addPrivateSplitL1Caches and addTwoLevelCacheHierarchy APIs have historically been used to bind cache levels together. These APIs have been superseeded by the introduction of the Cache hierarchy abstraction in the standard library. The standard library makes it cleaner for a user to quickly instantiate a hierarchy of caches with few lines of code. While this removes a lot of complexity for a user, the Hierarchy objects still have little information about their internal topology. To address this problem, this patch adds a tree data structure to the AbstractCacheHierarchy class, where every node of the tree represent a cache in the hierarchy. In this way we will expose APIs for traversing and querying the tree. For example a 2 CPUs system with private L1, private L2 and shared L3 will contain the following tree: [root] \| [L3] /\ / \ [L2] [L2] \| \| [L1] [L1] Change-Id: I78fe6ad094f0938ff9bed191fb10b9e841418692 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-04-10 16:17:56 -04:00
Matthew Poremba	1d64669473	mem,gpu-compute: Implement GPU TCC directed invalidate The GPU device currently supports large BAR which means that the driver can write directly to GPU memory over the PCI bus without using SDMA or PM4 packets. The gem5 PCI interface only provides an atomic interface for BAR reads/writes, which means the values cannot go through timing mode Ruby caches. This causes bugs as the TCC cache is allowed to keep clean data between kernels for performance reasons. If there is a BAR write directly to memory bypassing the cache, the value in the cache is stale and must be invalidated. In this commit a TCC invalidate is generated for all writes over PCI that go directly to GPU memory. This will also invalidate TCP along the way if necessary. This currently relies on the driver synchonization which only allows BAR writes in between kernels. Therefore, the cache should only be in I or V state. To handle a race condition between invalidates and launching the next kernel, the invalidates return a response and the GPU command processor will wait for all TCC invalidates to be complete before launching the next kernel. This fixes issues with stale data in nanoGPT and possibly PENNANT. Change-Id: I8e1290f842122682c271e5508a48037055bfbcdf	2024-04-10 11:35:25 -07:00
Matthew Poremba	833392e7b2	mem-ruby,gpu-compute: Allow memory reqs without inst The GPUDynInst for sending memory requests through the CUs data port is required but only used for DPRINTFs. Relax this constraint so that the methods can be reused for requests such as probes generated by the GPU device. Change-Id: I16094e400968225596370b684d6471580888d98a	2024-04-10 11:35:24 -07:00
Yu-Cheng Chang	116c483a42	arch-riscv: Make c.flwsp destination register more maintainable (#1006 ) RISC-V C.FLWSP format: ![image](https://github.com/gem5/gem5/assets/32214817/f4c8d114-cd6b-4946-afff-fa752b31e337) The name FC1 and FD share the same bits, change to FC1 to make it better `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L110)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L84)` `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L85)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L76)`	2024-04-10 08:11:51 -07:00
Hoa Nguyen	bc3627d682	arch-riscv: Remove a tab character (#1010 ) Change-Id: Id54ae8ba37faba11cf4055ddaf7e6b99cfd51999 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2024-04-10 08:08:57 -07:00
Giacomo Travaglini	5641c5e464	stdlib: Add tree structure to the AbstractCacheHierarchy (#967 ) One of things we miss in gem5 is the capability to neatly compose the cache hierarchy of CPUs and clusters of CPUs. The BaseCPU addPrivateSplitL1Caches and addTwoLevelCacheHierarchy APIs have historically been used to bind cache levels together. These APIs have been superseded by the introduction of the Cache hierarchy abstraction in the standard library. The standard library makes it cleaner for a user to quickly instantiate a hierarchy of caches with few lines of code. While this removes a lot of complexity for a user, the Hierarchy objects still have little information about their internal topology. To address this problem, this patch adds a tree data structure to the AbstractCacheHierarchy class, where every node of the tree represent a cache in the hierarchy. In this way we will expose APIs for traversing and querying the tree. For example a 2 CPUs system with private L1, private L2 and shared L3 will contain the following tree: [root] \| [L3] /\ / \ [L2] [L2] \| \| [L1] [L1]	2024-04-09 09:16:37 +01:00
Bobby R. Bruce	3af15a535e	mem-cache, configs, arch-arm: Handle partitioning policies through a PartitionManager (#966 ) This PR is offloading some of the partitioning logic to the partitioning manager, effectively changing the partitioning interface. Rather than always relying on the PartitionFieldExtention data structure to convey partition IDs, we make it implementation defined by introducing the partitioning manager abstraction. We want user to be able to extract the partitionId more flexibly and this requires using a SimObject. Users can extend the PartitioningManager, overriding the readPacketPartitionId, therefore providing their own mean of injecting/extracting partitioning data from a packet	2024-04-08 16:05:17 -07:00
Ivana Mitrovic	a8d778516d	arch-riscv,sim: m5ops argument / return fix for 32 bit RISC-V (#900 ) M5Ops C / C++ functions partially use 64 bit arguments and return value. In general, 64 bit arguments and return values are possible for 32 bit RISC-V systems as well, since the arguments and the return value is split into two registers. However, at the moment, this does not work for 32 bit RISC-V systems on the simulator side, since there is a one to one mapping between argument registers and m5op function parameters. To solve this problem, the get() function of the RISC-V reg_abi is updated. It now will merge two registers if there is a 64 bit argument. For this, the function code has to be passed to the get() function. The default value of this function code is set to 0xF00, since 0x00 is already used for M5_ARM. The parameter list of other get() functions for argument return is also extended by this function code parameter with the keyword [[maybe_unused]]. To enable a return value of size 64 bit, a0 is assigned with the lower 32 bit and a1 with the higher 32 bit. Related Issue: https://github.com/gem5/gem5/issues/881	2024-04-08 10:09:17 -07:00
Robert Hauser	841b821261	arch-riscv: fix c.fswsp source register (#998 ) RISC-V C.FSWSP format (RISC-V Unprivileged ISA V20191213, page 102): \|15-13\|12-7\|6-2\|1-0\| \|-------\|----\|----\|----\| \|funct3\|imm\|rs2\|op\| Source register is bit 2-6, not bit 20-24 `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L111)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L86)` `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L87)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L80)`	2024-04-08 08:41:11 -07:00
Yu-Cheng Chang	71b0b1f2b6	arch-riscv: Fix c.fsw source register (#1005 ) RISC-V C.FSW format: ![image](https://github.com/gem5/gem5/assets/32214817/31f46525-23e1-4b36-91ee-968f18b9d32a) Source register is bit 2-4, not bit 20-24 `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L112)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L88)` `ee6f1377d7/src/arch/riscv/isa/bitfields.isa (L87)` `ee6f1377d7/src/arch/riscv/isa/operands.isa (L80)`	2024-04-08 08:30:54 -07:00
Giacomo Travaglini	bdb08a5b6c	arch-arm, dev-arm: Fix typo in PartitionFieldExtention name Rename PartitionFieldExtention into PartitionFieldExtension Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I8072adf78d81b94c5b8bc61a317c0238cf0a9fd9	2024-04-07 11:45:57 +01:00
Giacomo Travaglini	dd45e1c319	misc: Make PartitionFieldExtention private to Arm The new ISA-agnostic interface is the PartitionManager. We therefore make the PartitionFieldExtention private to the Arm implementation of memory partitioning (FEAT_MPAM) Any other partitioning implementation should override the PartitionManager::readPacketPartitionID to provide a mean for extracting partitioning data (partition_id) from the incoming Packet. With this commit we also define an MPAM MSC which is supposed to be the partitioning manager for the Memory System Component Change-Id: I6959ace0c0cbca549dcc1aacd53dff223b5fe328 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-04-07 11:45:57 +01:00
Giacomo Travaglini	82a82c8793	configs: Change cache_partitioning.py to use PartitionManager Change-Id: I891cc4967dc5483313bcb1179d19b37123a37ba0 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-04-05 10:09:46 +01:00
Giacomo Travaglini	6c2ac8e641	Update src/python/gem5/components/cachehierarchies/abstract_cache_hierarchy.py Co-authored-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2024-04-05 08:53:07 +01:00
Bobby R. Bruce	c65071282d	stdlib,tests: Add StatTester SimObject and Scalar tests (#973 ) This SimObject can be used to quickly test the statistics are functioning correctly. The SimObject schedules a single event which sets the statistics to values dependent on the SimObject params. With this commit the "Scalar" stats have a StatTester subclass that can be used for testing. More can be added as required. Tests are included to check our Scalar SimStat functionality.	2024-04-04 19:12:44 -07:00
Bobby R. Bruce	504005da87	scons,tests: Add 'USE_TEST_OBJECTS' kconfig This has the SimObjects defined in "src/test_objects" only be compiled into the gem5 binary if the Kconfig 'USE_TEST_OBJECTS" == 'y'. This happens in two cases: 1. When 'ALL/gem5' is compiled via "build_opts". 2. When tests are run via "./tests/main.py". Change-Id: I2330008fd7c7900de5f4de142b8ac89ef4e351ce	2024-04-04 13:04:21 -07:00
Bobby R. Bruce	2771061207	stdlib,tests: Add StatTester SimObject and Scalar tests This SimObject can be used to quickly test the statistics are functioning correctly. The SimObject schedules a single event which sets the statistics to values dependent on the SimObject params. With this commit the "Scalar" stats have a StatTester subclass that can be used for testing. More can be added as required. Tests are included to check our Scalar SimStat functionality. Change-Id: I78fa5d9a0c3fc7115bd6c6d3410a5436aaa47f55	2024-04-04 13:04:20 -07:00
Bobby R. Bruce	be00691cd3	scons: Disable Address Sanitizer for GCC (#951 ) This removes the '--with-asan' as it's are not compatible with GCC. The `--with-asan` incompatibility with GCC is discussed in Issue #916.	2024-04-04 11:48:34 -07:00

1 2 3 4 5 ...

21499 Commits