derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Hoa Nguyen	628826896f	arch-riscv: Use TeX's escape seq in Python instead of Unicode (#985 ) Currently, the citation string has a Unicode character. This works well in gem5, but it breaks the gem5+SST simulation [1]. This change modifies the letter "u" with umlaut to use TeX's escape sequence for this letter instead of using the UTF-8 character. [1] https://github.com/gem5/gem5/issues/982 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2024-04-02 08:42:21 -07:00
Matthew Poremba	78cf39bf63	arch-vega: Operand selectors for accumulation registers (#955 ) AMD's MI100 introduced a new register file called accumulation registers for the matrix cores. In MI200 these were recombined into the same register file according to the documentation. The accumulation register file is the same size as the architectural register file, hence the size is doubled. The ISA spec does not explicitly state the register selector values, however it does say that the accumulation offset from the kernel dispatch packet should be added to the architecture register file selector number when an instruction sets the ACC bit. Therefore we can infer that the value must simply be an extension beyond the architectural VGPRs. This fixes errors of the form "invalid register selector: 512" (or higher value). This was tested with the Learn the Basics tutorial example on pytorch.org Change-Id: I48ced1532fc166d2f5032fe21fbeba70ac77f258	2024-04-01 08:45:37 -07:00
Nicholas Mosier	00d4b6825c	sim-se: Implement statx system call for Linux x86-64 (#887 ) Implement the statx Linux-specific system call for x86-64. statx is used by LLVM's libc. Change-Id: Ic000a36a5e5c1399996f520fa357b9354c73c864	2024-04-01 08:23:39 -07:00
Debjyoti Bhattacharjee	ec690de0da	arch-riscv: This commit fixes bug in vfmv.f.s impl. in riscv (#863 ) The existing implementation of vfmv instruction did not type cast the first element of the source vector, which caused the "freg" to interpret the result as a NaN. With the type cast to f32, the value is correctly recognized as float and sign extended to be stored in the fd register. Git issue: https://github.com/gem5/gem5/issues/827 Change-Id: Ibe9873910827594c0ec11cb51ac0438428c3b54e --------- Co-authored-by: Debjyoti B <bhatta53@imec.be> Co-authored-by: Tommaso Marinelli <tommarin@ucm.es>	2024-03-29 08:23:14 -07:00
Harshil Patel	9207458fd7	stdlib: add socks proxy to atlas client (#864 )	2024-03-28 14:30:02 -07:00
Giacomo Travaglini	63706f04b5	dev: Remove duplicate virtio files (#976 ) Remove the following files: * src/dev/virtio/rng 2.cc * src/dev/virtio/rng 2.hh Which were a copy of rng.hh and rng.cc. Probably added to the repository by accident. They were not compiled by scons Change-Id: I9d1da19cc243c513ab7af887b1b6260d8e361b57 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-28 14:32:11 +00:00
Yu-Cheng Chang	896c32cd0d	arch: Add getIsaName in BaseISA (#975 ) Change-Id: I81bfcd691d570430f7011f0d5023e5ea613e0dd9	2024-03-28 13:27:32 +00:00
Carson Molder	dd5a30d41e	sim-se,cpu-kvm: Fix SE workload setup on KVM CPUs (#956 ) This PR fixes #948 in which running KVM CPUs through the updated gem5 interface in SE mode causes an immediate crash. To fix this, I added a check to set_se_binary_workload that checks if any of the cores are KVM, and if so, sets a couple of knobs for the board and process that are required to make KVM work. The depecated se.py script, which sets these knobs, is able to run KVM in SE mode just fine, so doing the same here fixed the bug.	2024-03-23 15:15:11 -07:00
Ivan Fernandez	1e743fd85a	arch-riscv: adding vector unit-stride segment stores to RISC-V (#913 ) This commit adds support for vector unit-stride segment store operations for RISC-V (vssegXeXX). This implementation is based in two types of microops: - VsSegIntrlv microops that properly interleave source registers into structs. - VsSeg microops that store data in memory as contiguous structs of several fields. Change-Id: Id80dd4e781743a60eb76c18b6a28061f8e9f723d Gem5 issue: https://github.com/gem5/gem5/issues/382	2024-03-22 15:45:58 -07:00
Matthew Poremba	7d62da6d10	dev-amdgpu: Support for ROCm 6.0 (#926 ) Implement several features new in ROCm 6.0 and features required for future devices. Includes the following: - Support for multiple command processors - Improve handling of unknown register addresses - Use AddrRange for MMIO address regions - Handle GART writes through SDMA copy - Implement PCIe indirect reads and writes - Improve PM4 write to check dword count - Implement common MI300X instruction	2024-03-21 21:12:09 -07:00
Matthew Poremba	dca040983b	arch-vega: Various vega fixes to enable nanogpt (#950 ) This PR fixes some issues observed that were needed to get nanogpt working.	2024-03-21 21:11:44 -07:00
Michael Boyer	803dbbfdac	arch-vega: Implement flat_load_sbyte instruction (#953 ) Change-Id: I642a71c504e2d3afecd5d2dfd9db016945aed21b	2024-03-21 21:11:10 -07:00
Matthew Poremba	823b5a6eb8	dev-amdgpu: Support multiple CPs and MMIO AddrRanges Currently gem5 assumes that there is only one command processor (CP) which contains the PM4 packet processor. Some GPU devices have multiple CPs which the driver tests individually during POST if they are used or not. Therefore, these additional CPs need to be supported. This commit allows for multiple PM4 packet processors which represent multiple CPs. Each of these processors will have its own independent MMIO address range. To more easily support ranges, the MMIO addresses now use AddrRange to index a PM4 packet processor instead of the hard-coded constexpr MMIO start and size pairs. By default only one PM4 packet processor is created, meaning the functionality of the simulation is unchanged for devices currently supported in gem5. Change-Id: I977f4fd3a169ef4a78671a4fb58c8ea0e19bf52c	2024-03-21 10:13:55 -05:00
Matthew Poremba	39153cd234	dev-amdgpu: Implement PCIe indirect read/write PCIe can read/write to any 32-bit address using the PCI index/index2 registers as an address and then reading/writing the corresponding data/data2 register. This commit adds this functionality and removes one magic value being written to support GPU POST. This feature is disabled for Vega10 which relies on an MMIO trace for too many values to implement in the MMIO interface. Change-Id: Iacfdd1294a7652fc3e60304b57df536d318c847b	2024-03-21 10:13:55 -05:00
Matthew Poremba	047c194780	dev-amdgpu: Implement SRBM write The SRBM write packets where previously not required. This commit implements SRBM writes to set a register by using the new setRegVal interface. SRBM writes seem to be used for SRIOV enabled devices. Change-Id: I202653d339e882e8de59d69a995f65332b2dfb8c	2024-03-21 10:10:01 -05:00
Matthew Poremba	6bbde8fbb8	dev-amdgpu: Rework handling of unknown registers The top level AMDGPUDevice currently reads/writes all unknown registers to/from a map containing the previously written value. This is intended as a way to handle registers that are not part of the model but the driver requires for functionality. Since this is at the top level, it can mask changes to register values which do not go through the same interface. For example, reading an MMIO, changing via PM4 queue, and reading again returns the stale cached value. This commit removes the usage of the regs map in AMDGPUDevice, implements some important MMIOs that were previously handled by it, and moves the unknown register handling to the NBIO aperture only. To reduce the number of additional MMIOs to implement, the display manager in vega10 is now disabled. Change-Id: Iff0a599dd82d663c7e710b79c6ef6d0ad1fc44a2	2024-03-21 10:10:01 -05:00
Matthew Poremba	009cec56e0	dev-amdgpu: Check for SDMA copies to GART range The SDMA engine can potentially be used to write to the GART address range. Since gem5 has a shadow copy of the GART table to avoid sending functional reads to device memory, the GART table must be updated when copying to the GART range. This changeset adds a check in the VM for GART range and implements the SDMA copy packet writing to the GART range. A fatal is added to write and ptePde, which are the only other two ways to write to memory, as using these packets to update the GART table has not been observed. Change-Id: I1e62dfd9179cc9e987659e68414209fd77bba2bd	2024-03-21 10:10:01 -05:00
Matthew Poremba	998709d4fc	dev-amdgpu: Improve PM4 write data packet The write data packet can write multiple dwords but currently always assumes there is one dword, which can cause some write data to be missed. This case is not common, but the number of dwords is implicitly defined in the PM4 header. This changeset passes the PM4 header to write data so that the correct number of dwords can be determined. For now we assume no page crossing when writing multiple dwords as the driver should be checking for that. Change-Id: I0e8c3cbc28873779f468c2a11fdcf177210a22b7	2024-03-21 10:10:01 -05:00
Matthew Poremba	c045c68540	dev-amdgpu: Add node_id to interrupt handler The ROCm 6.0 driver adds a node_id field to interrupts which must match before passing on the interrupt to be cleared by the cookie from gem5's interrupt handler implementation. Add this field and enable for gfx942. The usage of the field can be seen in event_interrupt_isr_v9_4_3 at https://github.com/ROCm/ROCK-Kernel-Driver/blob/roc-6.0.x/drivers/ gpu/drm/amd/amdkfd/kfd_int_process_v9.c#L449 Change-Id: Iae8b8f0386a5ad2852b4a3c69f2c161d965c4922	2024-03-21 10:10:01 -05:00
Matthew Poremba	9ab004cccc	arch-vega: Implement V_LSHL_ADD_U64 This is a new instruction in MI300 and operates similar to V_LSHL_ADD_U32 but on 64-bit values. Change-Id: Ia4ac65160bdad748fccdcb28286ba03157cc4046	2024-03-21 10:10:01 -05:00
Matthew Poremba	f36be791aa	arch-vega: Expand FLAT subDecode range in main decoder The main decoder for GPU instructions looks at the first 9 bits of a dword to determine either the instruction or a subDecode table with more information for specific instructions types. For flat instructions the first 9 bits currently consist of 6 fixed encoding bits, a reserved bit, and the first two bits of the opcode. Hence to support all opcodes there are four indirections to the flat subDecode table. In MI300 the reserved bit is part of a field to determine memory scope and therefore may be non-zero. This commit adds four addition calls to the subDecode table for the cases where the scope bit is 1. See page 468 (PDF page 478) below: https://www.amd.com/content/dam/amd/en/documents/instinct-tech-docs/ instruction-set-architectures/ amd-instinct-mi300-cdna3-instruction-set-architecture.pdf Change-Id: Ic3c786f0ca00a758cbe87f42c5e3470576f73a32	2024-03-21 10:10:01 -05:00
Michael Boyer	acd9d3ff94	gpu-compute: Add support for skipping GPU kernels (#940 ) gpu-compute: Add support for skipping GPU kernels This commit adds two new command-line options: --skip-until-gpu-kernel N Skips (non-blit) GPU kernels until the target kernel is reached. Execution continues normally from there. Blit kernels are not skipped because they are responsible for copying the kernel code and metadata for the non-blit kernels. Note that skipping kernels can impact correctness; this feature is only useful if the kernel of interest has no data-dependent behavior, or its data-dependent behavior is not based on data generated by the skipped kernels. --exit-after-gpu-kernel N Ends the simulation after completing (non-blit) GPU kernel N. This commit also renames two existing command-line options: --debug-at-gpu-kernel -> --debug-at-gpu-task --exit-at-gpu-kernel -> --exit-at-gpu-task These were renamed because they count GPU tasks, which include both kernels launched by the application as well as blit kernels. Change-Id: If250b3fd2db05c1222e369e9e3f779c4422074bc	2024-03-21 07:46:27 -07:00
Matthew Poremba	e02f329d5d	arch-vega: Fix VOP3 decode table off-by-one There is no VOP3 opcode 667. Mark that invalid and move the opcodes after down by one. Change-Id: Ia4ccda91f6f501c1ce7c5898d7d0e924604a459a	2024-03-20 16:41:31 -05:00
Matthew Poremba	457d97ea52	arch-vega: Implement V_XNOR_B32 Change-Id: Id23a8d984f227ca23a92adb6c7fde3b4627af054	2024-03-20 16:37:37 -05:00
Matthew Poremba	1b15b2cc4b	arch-vega: Support negative modifiers for packed F32 math MI200 adds support for four FP32 packed math instructions. These are VOP3P instructions which have a negative input modifier field. The description made it unclear if these were used for F32 packed math however the assembly of some Tensile kernels are using these modifiers therefore adding support for them. Tested with PyTorch nn.Dropout kernel which is using negative modifiers. Change-Id: I568a18c084f93dd2a88439d8f451cf28a51dfe79	2024-03-20 16:37:23 -05:00
Matthew Poremba	3f8d0e1ef8	arch-vega: Fix V_FMAC_F32 data type The datatype is U32 but should be F32. This is causing an implicit cast leading to incorrect results. This fixes nn.Dropout in PyTorch. Change-Id: I546aa917fde1fd6bc832d9d0fa9ffe66505e87dd	2024-03-20 16:37:23 -05:00
Michael Boyer	ba2f5615ba	gpu-compute: Support cache line sizes >64B in GPUFS (#939 ) This change fixes two issues: 1) The --cacheline_size option was setting the system cache line size but not the Ruby cache line size, and the mismatch was causing assertion failures. 2) The submitDispatchPkt() function accesses the kernel object in chunks, with the chunk size equal to the cache line size. For cache line sizes >64B (e.g. 128B), the kernel object is not guaranteed to be aligned to a cache line and it was possible for a chunk to be partially contained in two separate device memories, causing the memory access to fail. Change-Id: I8e45146901943e9c2750d32162c0f35c851e09e1 Co-authored-by: Michael Boyer <Michael.Boyer@amd.com>	2024-03-20 11:09:25 -07:00
Giacomo Travaglini	2b67d0eba6	stdlib, tests, configs: Add a new PrivateL1PrivateL2WalkCache hierarchy (#935 ) From [1] The PrivateL1PrivateL2Cache hierarchy has been amended with an MMUCache, which is basically a small cache in front of the page table walker. Not every ISA makes use of it. Arm for example already implements caching of page table walks, via the partial_levels parameter in the ArmTLB. With this patch we define a new module which explicitly makes use of the WalkCache. Configurations that do not require another cache in the first level of the memsys (for the ptw) can use the PrivateL1PrivateL2CacheHierarchy [1]: https://gem5-review.googlesource.com/c/public/gem5/+/49364	2024-03-19 09:04:32 +00:00
Yu-Cheng Chang	dbae09e4d9	arch-riscv: Move alignment check to Physical Memory Attribute(PMA) (#914 ) In the RISC-V unprivileged spec[1], the misaligned load/store support is depend on the EEI. In the RISC-V privileged spec ver1.12[2], the PMA specify wether the misaligned access is support for each data width and the memory region. In the [3] of `mcause` spec, we cloud directly raise misalign exception if there is no memory region misalignment support. If the part of memory region support misaligned-access, we need to translate the `vaddr` to `paddr` first then check the `paddr` later. The page-fault or access-fault is rose before misalign-fault. The benefit of moving check_alignment option from ISA option to PMA option is we can specify the part region of memory support misalign load/store. MMU will check alignment with virtual addresss if there is no misaligned memory region specified. If there are some misaligned memory region supported, translate address first and check alignment at final. [1] https://github.com/riscv/riscv-isa-manual/blob/main/src/rv32.adoc#base-instruction-formats [2] https://github.com/riscv/riscv-isa-manual/blob/main/src/machine.adoc#physical-memory-attributes [3] https://github.com/riscv/riscv-isa-manual/blob/main/src/machine.adoc#machine-cause-register-mcause	2024-03-18 12:59:13 -07:00
Yan Lee	84da503d37	mem: Fix callback of functional access in port wrapper (#938 ) In previous implementation of port_wrapper, recvFunctional() will call timing request callback. This should be a typo and this change fix the typo.	2024-03-18 08:21:43 -07:00
Giacomo Travaglini	d32a438913	stdlib: Add a new private_l1_private_l2_walk_cache_hierarchy.py module From [1] The PrivateL1PrivateL2Cache hierarchy has been amended with an MMUCache, which is basically a small cache in front of the page table walker. Not every ISA makes use of it. Arm for example already implements caching of page table walks, via the partial_levels parameter in the ArmTLB. With this patch we define a new module which explicitly makes use of the WalkCache. Configurations that do not require another cache in the first level of the memsys (for the ptw) can use the PrivateL1PrivateL2CacheHierarchy [1]: https://gem5-review.googlesource.com/c/public/gem5/+/49364 Change-Id: I17f7e68940ee947ca5b30e6ab3a01dafeed0f338 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-18 09:42:05 +00:00
Giacomo Travaglini	0ec8cf8d05	dev-arm: Fix SMMUv3 DTB autogen (#934 ) Replacing FdtProperyWords (expecting an integer) with FdtPropertyStrings Change-Id: Icd1cf00704e253c88ac9b1d69c3cf946d2a8ca70 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-14 15:42:57 +00:00
Tiago Mück	942979162a	READ_MODIFY_WRITE flag fix (#922 ) Change bit for Request::READ_MODIFY_WRITE, which was the same as Request::ACQUIRE. Signed-off-by: Tiago Mück <tiago.muck@arm.com>	2024-03-11 08:32:11 -07:00
Giacomo Travaglini	5161195db5	dev-arm: Remove the SMMUv3 irq_interface_enable parameter The SMMU_IRQ_CTRL had been made optionally writeable by a prior patch [1] even if interrupts were not supported in the SMMUv3 model. As we are partially enabling IRQ support, we remove this option and we make the SMMU_IRQ_CTRL always writeable [1]: https://gem5-review.googlesource.com/c/public/gem5/+/38555 Change-Id: Ie1f9458d583a5d8bcbe450c3e88bda6b3c53cf10 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 13:53:44 +00:00
Giacomo Travaglini	d63282a9da	dev-arm: Implement wired interrupt for SMMU event queue See https://github.com/orgs/gem5/discussions/898 The SMMUv3 Event Queue is basically unused at the moment. Whenever a transaction fails we actually abort simulation. The sendEvent method could be used to actually report the failure to the driver but it is lacking interrupt support to notify the PE there is an event to handle. The SMMUv3 spec allows both wired and MSI interrupts to be used. We add the eventq_irq SPI param to the SMMU object and we draft an initial sendInterrupt utility that makes use of it whenever it is needed. Change-Id: I6d103919ca8bf53794ae4bc922cbdc7156adf37a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 13:53:21 +00:00
Giacomo Travaglini	63c815b5fc	dev-arm: Do not panic in the SMMUv3 for fauting transactions Rely on the architected solution instead of aborting simulation. This means handling writes to the Event queue to signal managing software there was a fault in the SMMU Change-Id: I7b69ca77021732c6059bd6b837ae722da71350ff Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 11:29:22 +00:00
Giacomo Travaglini	7d5d1cd9c8	dev-arm: Rewrite SMMUEvent The struct fields of the SMMUEvent were not matching the SMMUv3 specs. This was "not an issue" as events have been implicitly disabled until now (every translation error was aborting simulation) With generateEvent we automatically construct a SMMU event from a translation result. Change-Id: Iba6a08d551c0a99bb58c4118992f1d2b683f62cf Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 11:29:22 +00:00
Giacomo Travaglini	ef10db5a3e	dev-arm: Record additional information in the TranslResult A faulting translation should return additional information (other than the fault type). This will be used by future patches to properly populate the SMMU event record of the event queue As we currenlty support two faults only: 1) F_TRANSLATION 2) F_PERMISSION We add to TranslResult the relevant fault information only: type, class, stage and ipa Change-Id: I0a81d5fc202e1b6135cecdcd6dfd2239c2f1ba7e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 11:29:22 +00:00
Giacomo Travaglini	3d1f68f205	dev-arm: Return translation fault in doReadCD Reading the Context Descriptor (CD) might require a stage2 translation. At the moment doReadCD does not check for the return value of the translateStage2. This means that any stage2 fault will be silently discarded and an invalid address will be used/returned. By returning a translation result we make sure any error happening in the second stage of translation will be properly flagged Change-Id: I2ecd43f7e23080bf8222bc3addfabbd027ee8feb Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 11:29:22 +00:00
Giacomo Travaglini	4a4b775985	dev-arm: Provide encapsulation by adding TranslResult::isFaulting We don't check the fault type directly. This will improve readability once the TranslResult class will be augmented with extra fields Change-Id: I5acafaabf098d6ee79e1f0c384499cc043a75a9d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-08 11:29:22 +00:00
Ivan Fernandez	f6c61836b3	arch-riscv: adding vector unit-stride segment loads to RISC-V (#851 ) This commit adds support for vector unit-stride segment load operations for RISC-V (vlseg<NF>e<X>). This implementation is based in two types of microops: - VlSeg microops that load data as it is organized in memory in structs of several fields. - VectorDeIntrlv microops that properly deinterleave structs into destination registers. Gem5 issue: https://github.com/gem5/gem5/issues/382	2024-03-06 11:27:06 -08:00
Giacomo Travaglini	3d2052bc03	misc: Serialize the ISA as a string in the checkpoint With the introduction of multi-ISA gem5, we don't store the TARGET_ISA anymore as a string in the root section of the checkpoint [1]. There is therefore no way at the moment to asses the ISA of a CPU/ThreadContext. This is a problem when it comes to checkpoint updates which are ISA specific. By explicitly serializing the ISA as a string under the cpu.isa section we avoid this problem and we let cpt_upgraders be aware of the ISA in use. [1]: https://gem5-review.googlesource.com/c/public/gem5/+/48884 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I1e75230cbc370cab84f4a54141b1e425af2dbfac Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-03-04 17:51:40 +00:00
Nitish Arya	676d571009	arch-riscv: adding stats to show completed page walks (#869 ) This commit adds statistics showing completed page walks for 4KB and 2MB pages. This will add to stats.txt the variables num_4kb_walks, num_2mb_walks and the corresponding values. This is done based on the level of page table walk traversed specific to Sv39 Virtual Memory System.	2024-03-04 08:38:28 -08:00
Giacomo Travaglini	c57a6b0d59	mem-cache: Add support for partitioning caches (#765 ) * Add Cache partitioning policies to manage and enforce cache partitioning: * Add Way partition policy * Add MaxCapacity partition policy * Add PartitionFieldsExtension Extension class for Packets to store Partition IDs for cache partitioning and monitoring * Modify Cache SimObjects to store partition policies * Modify Cache block eviction logic to use new partitioning policies Co-authored-by: Adrian Herrera <adrian.herrera@arm.com> Change-Id: Ib35153a8b46803c22a433926270d82e5e19ce544 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-04 09:44:01 +00:00
Giacomo Travaglini	c1d5ffe7c7	mem-cache: Prefetchers Improvements (#872 ) This pull request contains a set of small patches which fix some bugs in the gem5 prefetchers, and aligns out-of-the box prefetcher performance more closely with that which a typical user would expect. The performance patches have been tested with an out-of-the-box (untuned) Stride prefetcher configuration against a set of SPEC 2017 SimPoints, and show a small-to-modest IPC uplift across the about half the benchmarks, with no significant IPC degradation. The new defaults were identified as part of work on gem5 prefetchers undertaken by Nikolaos Kyparissas while on internship at Arm. This PR is an updated version of PR #564, which was reverted due to Bug #580. Bug #580 was fixed in PR #871. This PR updates #564 to the latest state of the develop branch, and should be applied after PR #871.	2024-03-04 09:09:47 +00:00
Ivana Mitrovic	fae5f5e00b	sim-se: Catch None value if binary is not compatible with gem5 (#903 ) Adding an error message in case the binary is not compatible with gem5. This PR is addressing the comments in issue #807. Change-Id: I66466ed6f657276c13d237fde3b1ec12c20cfe91	2024-03-01 16:41:18 -08:00
Ivana Mitrovic	61adfa38b2	stdlib: Fix initialization for self.pic.hart_config in lupv_board (#904 ) Previously merged PR #886 created pic.hart_config, but it was not initialized properly in lupv_board.py. This issue is causing daily tests to fail. Change-Id: I193ff4a3e5ef787eefcf066404e762f024fa6603 --------- Co-authored-by: Yu-Cheng Chang <aucixw45876@gmail.com>	2024-03-01 11:25:00 -08:00
Giacomo Travaglini	c0e5d58a96	dev: RegisterBank addRegistersAt for fragmented reg banks (#902 ) One of the limitations of the RegBank class is that it does not allow you to pass a non-contiguous set of registers. Its simplest form will just accept an initializer list of registers and it will store them in sequence. A more refined version [1] will optionally accept an offset value to be passed alongside the register reference. This is not meant to be used by the register bank to store the register at the provided offset. It is rather used by the bank to sanity check the register sits exactly at the provided range. The way to work around this for a fragemented register space is to explicitly allocate RAZ/RAO blocks as registers and to pass them to addRegisters together with the others. (See the SysSecCtrl [2] as an example) This makes it a bit tedious to model a register bank with gaps between its registers. First, the exact number and position of the gaps needs to be extraced from a spec. These sometimes report only implemented registers and their offset, and omit to document gaps/reserved space. So a developer needs to manually add register offset and size to check if all registers are contiguous. Second, these reserved register blocks need to be instantiated in the bank adding boilerplate code and affecting readibility. For these reasons we add a new registration method, called addRegistersAt. It reuses the RegisterAdder class but this time the offset field is really used to instruct the bank where the register should be mapped. The method is templated and the template parameter tells the bank which register type should be used to fill the remaining space. We make the RegBank the owner of this filler space (registers are generated internally within addRegistersAt). [1]: https://github.com/gem5/gem5/blob/stable/src/dev/reg_bank.hh#L106 [2]: https://github.com/gem5/gem5/blob/stable/src/dev/arm/ssc.cc#L48 Change-Id: I614ae6e9eeb40b365ac9b6dd8b75abbfdb9cb687 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-01 15:32:40 +00:00
Hristo Belchev	27c8355565	mem-cache: Add support for partitioning caches * Add Cache partitioning policies to manage and enforce cache partitioning: * Add Way partition policy * Add MaxCapacity partition policy * Add PartitionFieldsExtension Extension class for Packets to store Partition IDs for cache partitioning and monitoring * Modify Cache Tags SimObjects to store partition policies * Modify Cache Tags block eviction logic to use new partitioning policies * Add example system and TrafficGen configurations for testing Cache Partitioning Policies Change-Id: Ic3fb0f35cf060783fbb9289380721a07e18fad49 Co-authored-by: Adrian Herrera <adrian.herrera@arm.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-01 15:26:38 +00:00
Mahyar Samani	9bd71bff0c	python: Adding fatal statement to notify user mistakes. (#826 ) This change adds a fatal statement to check all params for all SimObjects have been unproxied before C++ object are created. The fatal statement notifies the user of a mistake that could possibly lead to a SimObject to not have its params unproxied. This mistake could be made by adding a child SimObject with a name that starts with an underscore.	2024-02-29 10:47:26 -08:00

1 2 3 4 5 ...

14971 Commits