derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
dependabot[bot]	f35815cd48	misc: bump pre-commit from 3.6.0 to 3.6.2 (#905 ) Bumps [pre-commit](https://github.com/pre-commit/pre-commit) from 3.6.0 to 3.6.2. Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-03-06 14:20:42 -08:00
dependabot[bot]	ceee8fed29	misc: bump tqdm from 4.66.1 to 4.66.2 (#906 ) Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.66.1 to 4.66.2. Co-authored-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2024-03-06 14:20:03 -08:00
Ivan Fernandez	f6c61836b3	arch-riscv: adding vector unit-stride segment loads to RISC-V (#851 ) This commit adds support for vector unit-stride segment load operations for RISC-V (vlseg<NF>e<X>). This implementation is based in two types of microops: - VlSeg microops that load data as it is organized in memory in structs of several fields. - VectorDeIntrlv microops that properly deinterleave structs into destination registers. Gem5 issue: https://github.com/gem5/gem5/issues/382	2024-03-06 11:27:06 -08:00
Giacomo Travaglini	b930c57d54	misc: Tag checkpoints with the ISA of the CPUs (#908 ) With the introduction of multi-ISA gem5, we don't store the TARGET_ISA anymore as a string in the root section of the checkpoint [1]. There is therefore no way at the moment to asses the ISA of a CPU/ThreadContext. This is a problem when it comes to checkpoint updates which are ISA specific. By explicitly serializing the ISA as a string under the cpu.isa section we avoid this problem and we let cpt_upgraders be aware of the ISA in use. [1]: https://gem5-review.googlesource.com/c/public/gem5/+/48884	2024-03-05 10:04:06 +00:00
Giacomo Travaglini	5bce5673b0	util: Fix recent cpt_upgraders not checking for ISA A set of cpt_upgraders was patching old checkpoints regardless of the ISA in use. Thanks to the previous patch, we can now retrieve the ISA of the CPU from the isa section. Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: Ia110068c06453796cefac028ee13f21667e5371a Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-03-04 17:51:40 +00:00
Giacomo Travaglini	3d2052bc03	misc: Serialize the ISA as a string in the checkpoint With the introduction of multi-ISA gem5, we don't store the TARGET_ISA anymore as a string in the root section of the checkpoint [1]. There is therefore no way at the moment to asses the ISA of a CPU/ThreadContext. This is a problem when it comes to checkpoint updates which are ISA specific. By explicitly serializing the ISA as a string under the cpu.isa section we avoid this problem and we let cpt_upgraders be aware of the ISA in use. [1]: https://gem5-review.googlesource.com/c/public/gem5/+/48884 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I1e75230cbc370cab84f4a54141b1e425af2dbfac Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-03-04 17:51:40 +00:00
Nitish Arya	676d571009	arch-riscv: adding stats to show completed page walks (#869 ) This commit adds statistics showing completed page walks for 4KB and 2MB pages. This will add to stats.txt the variables num_4kb_walks, num_2mb_walks and the corresponding values. This is done based on the level of page table walk traversed specific to Sv39 Virtual Memory System.	2024-03-04 08:38:28 -08:00
Giacomo Travaglini	c57a6b0d59	mem-cache: Add support for partitioning caches (#765 ) * Add Cache partitioning policies to manage and enforce cache partitioning: * Add Way partition policy * Add MaxCapacity partition policy * Add PartitionFieldsExtension Extension class for Packets to store Partition IDs for cache partitioning and monitoring * Modify Cache SimObjects to store partition policies * Modify Cache block eviction logic to use new partitioning policies Co-authored-by: Adrian Herrera <adrian.herrera@arm.com> Change-Id: Ib35153a8b46803c22a433926270d82e5e19ce544 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-04 09:44:01 +00:00
Giacomo Travaglini	c1d5ffe7c7	mem-cache: Prefetchers Improvements (#872 ) This pull request contains a set of small patches which fix some bugs in the gem5 prefetchers, and aligns out-of-the box prefetcher performance more closely with that which a typical user would expect. The performance patches have been tested with an out-of-the-box (untuned) Stride prefetcher configuration against a set of SPEC 2017 SimPoints, and show a small-to-modest IPC uplift across the about half the benchmarks, with no significant IPC degradation. The new defaults were identified as part of work on gem5 prefetchers undertaken by Nikolaos Kyparissas while on internship at Arm. This PR is an updated version of PR #564, which was reverted due to Bug #580. Bug #580 was fixed in PR #871. This PR updates #564 to the latest state of the develop branch, and should be applied after PR #871.	2024-03-04 09:09:47 +00:00
Ivana Mitrovic	fae5f5e00b	sim-se: Catch None value if binary is not compatible with gem5 (#903 ) Adding an error message in case the binary is not compatible with gem5. This PR is addressing the comments in issue #807. Change-Id: I66466ed6f657276c13d237fde3b1ec12c20cfe91	2024-03-01 16:41:18 -08:00
Ivana Mitrovic	61adfa38b2	stdlib: Fix initialization for self.pic.hart_config in lupv_board (#904 ) Previously merged PR #886 created pic.hart_config, but it was not initialized properly in lupv_board.py. This issue is causing daily tests to fail. Change-Id: I193ff4a3e5ef787eefcf066404e762f024fa6603 --------- Co-authored-by: Yu-Cheng Chang <aucixw45876@gmail.com>	2024-03-01 11:25:00 -08:00
Giacomo Travaglini	c0e5d58a96	dev: RegisterBank addRegistersAt for fragmented reg banks (#902 ) One of the limitations of the RegBank class is that it does not allow you to pass a non-contiguous set of registers. Its simplest form will just accept an initializer list of registers and it will store them in sequence. A more refined version [1] will optionally accept an offset value to be passed alongside the register reference. This is not meant to be used by the register bank to store the register at the provided offset. It is rather used by the bank to sanity check the register sits exactly at the provided range. The way to work around this for a fragemented register space is to explicitly allocate RAZ/RAO blocks as registers and to pass them to addRegisters together with the others. (See the SysSecCtrl [2] as an example) This makes it a bit tedious to model a register bank with gaps between its registers. First, the exact number and position of the gaps needs to be extraced from a spec. These sometimes report only implemented registers and their offset, and omit to document gaps/reserved space. So a developer needs to manually add register offset and size to check if all registers are contiguous. Second, these reserved register blocks need to be instantiated in the bank adding boilerplate code and affecting readibility. For these reasons we add a new registration method, called addRegistersAt. It reuses the RegisterAdder class but this time the offset field is really used to instruct the bank where the register should be mapped. The method is templated and the template parameter tells the bank which register type should be used to fill the remaining space. We make the RegBank the owner of this filler space (registers are generated internally within addRegistersAt). [1]: https://github.com/gem5/gem5/blob/stable/src/dev/reg_bank.hh#L106 [2]: https://github.com/gem5/gem5/blob/stable/src/dev/arm/ssc.cc#L48 Change-Id: I614ae6e9eeb40b365ac9b6dd8b75abbfdb9cb687 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-01 15:32:40 +00:00
Hristo Belchev	27c8355565	mem-cache: Add support for partitioning caches * Add Cache partitioning policies to manage and enforce cache partitioning: * Add Way partition policy * Add MaxCapacity partition policy * Add PartitionFieldsExtension Extension class for Packets to store Partition IDs for cache partitioning and monitoring * Modify Cache Tags SimObjects to store partition policies * Modify Cache Tags block eviction logic to use new partitioning policies * Add example system and TrafficGen configurations for testing Cache Partitioning Policies Change-Id: Ic3fb0f35cf060783fbb9289380721a07e18fad49 Co-authored-by: Adrian Herrera <adrian.herrera@arm.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-03-01 15:26:38 +00:00
Mahyar Samani	9bd71bff0c	python: Adding fatal statement to notify user mistakes. (#826 ) This change adds a fatal statement to check all params for all SimObjects have been unproxied before C++ object are created. The fatal statement notifies the user of a mistake that could possibly lead to a SimObject to not have its params unproxied. This mistake could be made by adding a child SimObject with a name that starts with an underscore.	2024-02-29 10:47:26 -08:00
Matthew Poremba	db42aeb630	arch-vega: Implement accumulation offset (#895 ) This PR implements a few changes related to the accumulation offset which is new in MI200. Previously MI100 contained two vector register files: the architectural and accumulation register files. These have now been unified and the architectural register file is twice the size. As a result of this the dispatch packet set an offset into the unified vector register file for where the former accumulation registers would go. The changes are: - Calculate the accumulation offset from dispatch packet and store in HSA task. - Update the accumulation move instructions (v_accvgpr_read/write) to use it. - Update the current MFMA instructions to use it. - Make the MFMA examples more clean.	2024-02-29 09:05:39 -08:00
Nicholas Mosier	69762e272e	sim-se, arch-x86: initialize max stack size from parameter (#892 ) Initialize x86 process' max stack size to the value given in the process params, rather than hard-coding it to 8 MB, which made it impossible to run x86 programs requiring more than 8 MB of stack. Change-Id: I0b17fe60b016b1e4a82d704ef7ad367974ea6a08	2024-02-29 08:15:43 -08:00
amatabsc	0d79b5098b	Increased packets sanity check limit to 1024 (#797 ) For some simulations with big values for VLEN (e.g. 8k and 16k) there were more packets created on the fly and, as a consequence, failing the simulations. The sanity check has been increased in order to solve this high VLEN cases. Supervised by [@aarmejach](https://github.com/aarmejach) Change-Id: I137b0f3113687b3fc9c4154d19ca5e8017e6e992 Co-authored-by: Adrià Armejach <adria.armejach@bsc.es>	2024-02-29 08:12:59 -08:00
Matt Sinclair	777ac91bb0	mem-ruby: Add categorization of bypassed atomics in TCC (#899 ) Adds categorization of bypassed atomics in TCC to the TBE as either return or no-return, which gets consumed in pa_performAtomic to determine if atomic logs should be stored. Reestablishes TCC bypassed atomics after #546. Change-Id: Ibc1fa2b795ef1c47c3893a0b1911fa7993522d38	2024-02-28 14:26:09 -06:00
Matt Sinclair	8a28ca8ffb	mem-ruby: Add missing transition for SLC writes to VIPER TCC (#894 ) Bypassed write though requests on invalid lines in the TCC should be written though to the directory. This transition was previously missing. Change-Id: I16b117c4e085ce6be0ed5297aa0129d52cd35a51	2024-02-28 00:13:07 -06:00
Daniel Kouchekinia	de615836f0	mem-ruby: Add categorization of bypassed atomics in TCC Adds categorization of bypassed atomics in TCC to the TBE as either return or no-return, which gets consumed in pa_performAtomic to determine if atomic logs should be stored. Reestablishes TCC bypassed atomics after #546. Change-Id: Ibc1fa2b795ef1c47c3893a0b1911fa7993522d38	2024-02-27 23:12:45 -06:00
Daniel Kouchekinia	0fd73f4e05	Merge branch 'develop' into missing-tcc-transition	2024-02-27 16:46:30 -06:00
Richard Cooper	4e12f2486b	util: update list_changes.py to support multiple Change-Ids (#861 ) The original version of `list_changes.py` assumed no more than one `Change-Id` tag per commit. However, since transitioning to GitHub, the repository now contains some merge commits containing multiple `Change-Id`s. This patch updates `list_changes.py` to support commits with any number of `Change-Id` tags.	2024-02-27 11:10:31 -08:00
Giacomo Travaglini	e5eea7efcc	mem: QoS q_policy assertions fix (#889 ) Fix QoS Memory Queue Policies * Fix assertions in LRG policy to correctly assert requestor and list validity * Fix `selectPacket()` in LIFO Queue Policy to correctly return the end of the `deque` backing store for its packet queue	2024-02-27 13:32:19 +00:00
Hristo Belchev	e78a6b71fe	Merge branch 'develop' into qos-qpolicy-assertions-fix	2024-02-27 09:38:34 +00:00
Harshil Patel	920497c19f	tests: Add compiler test for gcc 13 (#858 ) Change-Id: I41bdf3ab7ffff21c4148ef17fc5229b5597ec953	2024-02-26 18:03:14 -05:00
Matthew Poremba	2ca7f48828	arch-vega: Accumulation offset for existing MFMA insts This commit update the two exiting MFMA instructions to support the accumulation offset for A, B, and C/D matrix. Additionally uses array indexed C/D matrix registers to reduce duplicate code. Future MFMA instructions have up to 16 registers for C/D and this reduces the amount of code being written. Change-Id: Ibdc3b6255234a3bab99f115c79e8a0248c800400	2024-02-26 14:30:50 -06:00
Daniel Kouchekinia	6374697a20	mem-ruby: Add missing transition for SLC writes to VIPER TCC Bypassed write though requests on invalid lines in the TCC should be written though to the directory. This transition was previously missing. Change-Id: I16b117c4e085ce6be0ed5297aa0129d52cd35a51	2024-02-26 13:13:06 -06:00
Matthew Poremba	e0e65221b4	arch-vega: Use accum offset for v_accvgpr_read/write The accum offset is used as an index into the unified VGPR register file in MI200 and is not the same as a move if accum_offset in the dispatch packet is non-zero. Change these instructions to use the stored accum_offset value. Change-Id: Ib661804f8f5b8392e4c586082c423645f539e641	2024-02-26 12:57:09 -06:00
Matthew Poremba	8722aef2e2	gpu-compute: Store accum_offset from code object in WF The accumulation offset is needed for some instructions. In order to access this value we need to place it somewhere instruction definitions can access. The most logical place is in the wavefront. This commit simply copies the value from the HSA task to the wavefront object. Change-Id: I44ef62ef32d2421953f096c431dd758e882245b4	2024-02-26 12:54:37 -06:00
Nicholas Mosier	1990186170	configs: Ensure m5ops base doesn't overlap physical mem in KVM (#875 ) Fix #874, in which running se.py with 4GB or more memory (via option --mem-size=4GB) causes all KVM programs to crash or hang. This occurred because the m5ops address range (set to 0xFFFF0000-0x100000000) overlapped with physical memory under such a configuration. This patch fixes the bug by moving the m5ops address range if phyiscal memory is >=4GB. Change-Id: Ic8a004517bc2be2c27860ed314460be749a11dc1	2024-02-26 10:33:48 -08:00
Yu-Cheng Chang	bcf455755e	arch-riscv,dev: Update the PLIC implementation (#886 ) Update the PLIC based on the [riscv-plic-spec](https://github.com/riscv/riscv-plic-spec) in the PR: - Support customized PLIC hardID and privilege mode configuration - Backward compatable with the n_contexts parameter, will generate the config like {0,M}, {0,S}, {1,M} ... Change-Id: Ibff736827edb7c97921e01fa27f503574a27a562	2024-02-26 10:32:53 -08:00
Yu-Cheng Chang	521a7c1de0	tests: Exit riscv_asmtest script with simulator status code (#891 ) It will be helpful to check if the instruction simulate well Change-Id: I5faa435fad79601682126ee7978d8444093df900	2024-02-26 10:31:18 -08:00
Ivana Mitrovic	61ee36eee6	mem-ruby: Fix possible dirty line loss in CHI when ReadShared hit on UD line (#791 ) In case ReadShared hit on a UD line and there's no sharers, this chage makes the downstream passes Dirty to the requestor whenever possible even though it doesn't deallocate the line. This will make the requestor to SD and the downstream to UD_RSD. In the previous implementation, loosely exclusive intermediate cache can cause loss of dirty data. Example error condition is as below. Configurations L2 cache: Roughly inclusive to L1 without back-invalidation - dealloc_on_* = false - dealloc_backinv_* = false L3 cache: Roughly exclusive to L2 without back-invalidation - alloc_on_readshared = tue - alloc_on_readunique = false - dealloc_on_shared = false - dealloc_on_unique = true - dealloc_backinv_* = false - is_HN = false LLC: Same clusivity as L3 except is_HN = true For all caches, allow_SD = true and fwd_unique_on_readshared = false Example problem sequence: 1. L1 sends ReadUnique then becomes UD. L2 is UC_RU. L3 and LLC are RU. 2. L1 evicts the line to L2 by WriteBackFull (UD_PD). L2 becomes UD. 3. L2 evicts the line to L3 using WriteBackFull (UD_PD). L3 becomes UD. 4. L1 reads the line with ReadShared which misses on L2. 5. L2 reads the line with ReadShared which hits on L3. L3 becomes UD_RSC because it doesn't deallocate the line (dataToBeInvalid=false) 6. L3 evicts the line to LLC by WriteCleanFull (UD_PD) because L3 doesn't back-invalidate and still has sharer. The local cache line is invalidated by Deallocate_CacheBlock. L3 becomes RUSC and LLC becomes UD_RU. 7. When UD_RU is evicted at LLC, the UD_RU line is dropped expecting the upstream to writeback, causing loss of dirty data	2024-02-26 10:06:17 -08:00
wmin0	00ed1d30cf	python,util: Fix SimObjectParams default constructor and destructor (#880 ) The empty constructor prevent zero-initialization working correctly. In this change we fix the issue by removing the unwanted empty constructor. We also change the default destructor specification with c++11 style. Change-Id: I869a93ca5283f811c2aa58406f1478459e0d7022	2024-02-26 06:42:27 -08:00
Giacomo Travaglini	1d5be8d9e5	mem-cache: Optimize strided prefetcher address generation This commit optimizes the address generation logic in the strided prefetcher by introducing the following changes (d is the degree of the prefetcher) * Evaluate the fixed prefetch_stride only once (and not d-times) * Replace 2d multiplications (d * prefetch_stride and distance * prefetch_stride) with additions by updating the new base prefetch address while looping Change-Id: I3ec0c642bc9ec7635b0d38308797e99b645304bb Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-02-26 10:40:45 +00:00
Nikolaos Kyparissas	a5fece3b91	mem: added distance parameter to stride prefetcher The Stride Prefetcher will skip this number of strides ahead of the first identified prefetch, then generate `degree` prefetches at `stride` intervals. A value of zero indicates no skip (i.e. start prefetching from the next identified prefetch address). This parameter can be used to increase the timeliness of prefetches by starting to prefetch far enough ahead of the demand stream to cover the memory system latency. [Richard Cooper <richard.cooper@arm.com>: - Added detail to commit comment and `distance` Param documentation. - Changed `distance` Param from `Param.Int` to `Param.Unsigned`. ] Change-Id: I4ce79c72d74445b12acf68e0a54e13966e30041c Co-authored-by: Richard Cooper <richard.cooper@arm.com> Signed-off-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-26 10:40:45 +00:00
Nikolaos Kyparissas	1ccdf407cb	mem-cache: Added clean eviction check for prefetchers. pkt->req->isCacheMaintenance() would not include a check for clean eviction before notifying the prefetcher, causing gem5 to crash. Change-Id: I4a56c7384818c63d6e2263f26645e87cef1243cb Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-26 10:40:45 +00:00
Richard Cooper	9fe998a8c0	mem-cache: Update default prefetch options. Update the default prefetch options to achieve out-of-the box prefetcher performance closer to that which a typical user would expect. Configurations that set these parameters explicitly will be unaffected. The new defaults were identified as part of work on gem5 prefetchers undertaken by Nikolaos Kyparissas while on internship at Arm. Change-Id: Ia6c1803c86e42feef01de40c34d928de50fe0bed Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-26 10:40:45 +00:00
Richard Cooper	05f33fbef5	mem-cache: Squash prefetch queue entries by block address. Prefetch queue entries were being squashed by comparing the address of each queued prefetch against the block address of the demand access. Only prefetches that happen to fall on a cache-line block boundary would be squashed. This patch converts the prefetch addresses to block addresses before comparison. Change-Id: I3a80a1e3d752f925595e33edebf5359d2cc67182 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-26 10:40:45 +00:00
Yu-Cheng Chang	47f3ad45d3	stdlib: Add get_last_exit_event_code to get m5 exit status code (#890 ) Change-Id: I7319437dff24e31f343e71b6b8993f833b62147c	2024-02-23 09:09:28 -08:00
Hristo Belchev	2138a4ec92	mem: Fix LIFO q_policy and add assetions * Fix selectPacket() in LIFO Queue Policy to correctly return the end of the `deque` backing store for its packet queue * Move selectPacket() implementations for FIFO and LIFO queues into `q_policy.cc` file Change-Id: I8c35e5fc83dc380b19f52be14c18b1f414f9e141	2024-02-22 21:57:08 +00:00
Yu-Cheng Chang	816ef46c78	arch-riscv: Fix fflags behavior of float inst. in O3 CPU (#868 ) According to the RISC-V spec [1]. Any float-point instructions accumulate FFLAGS register rather than write it to reflect the CSR behavior. In the previous implementation. We read the FFLAGS, set the exception flags, and write the result back to the FFLAGS. This works in the gem5 simple and minor CPU model as they are actually written to `regFile` after executing the instructions. However, in the gem5 O3 CPU model, it will record in the `destMiscReg` buffer until the commit stage when writing to the `miscReg` in the execution stage. The next instruction will get the old FFLAGS and cause the incorrect result. The CL introduced the `MISCREG_FFLAGS_EXE` and used the same size of `miscRegFile` because the `MISCREG_FFLAGS_EXE` and `MISCREG_FFLAGS` shared the same space. When executing the float-pointing instruction, any exception flags should be updated via `MISCREG_FFLAGS_EXE` to accumulate the FFLAGS in `setMiscReg` method. For the MISCREG_FFLAGS, it should only be called in the CSROp. [1] Syntactic Dependencies: Appendix A `c80ecada1c/src/mm-eplan.adoc (syntactic-dependencies-rules-9-11)` gem5 issue: https://github.com/gem5/gem5/issues/755 Change-Id: Ib7f13d95b8a921c37766a54a217a5a4b1ef17c6f	2024-02-22 08:33:34 -08:00
Hristo Belchev	f20ac07dde	mem: Fix assertions in LRG Q policy Fix assertions in LRG Queue Policy to correctly assert requestor and list validity Change-Id: I84e3f5b8936b74e7ac675faf7a3e6b9999026781	2024-02-22 14:16:20 +00:00
Harshil Patel	0f79b15b2f	tests: Update checkpoint tests to new checkpoints (#888 ) Change-Id: I1bf6d47017bcf77a4f93341c73de355372e1dea7	2024-02-21 16:37:28 -08:00
Jason Lowe-Power	c719ea960a	arch-arm: Add FEAT_FGT trapping for debug registers (#873 ) We already implemented FEAT_FGT but we were missing trapping capabilities for trapping debug registers accesses	2024-02-21 11:27:43 -08:00
Nicholas Mosier	7ac9733199	arch-x86, cpu-kvm: initialize x87 FCW (#877 ) Fix #876. The x87 floating-point control word (FCW) was not initialized at process startup in syscall emulation mode. This resulted in floating point exceptions in KVM mode when executing x87 floating-point instructions. This patch fixes the bug by initializing FCW to its reset value, 0x37F. Change-Id: Idd1573c6951524ef59466cc5c9f1e640ea7658ae	2024-02-20 07:46:44 -08:00
wmin0	4e75e35a33	dev-arm: Remove the dependency of Platform for ArmSigInterruptPin (#878 ) ArmSigInterruptPin don't send the interrupt to GIC. Instead it sends the interrupt to the irq specified in Param. When using ArmSigInterruptPin, we shouldn't ask users to provide "Platform" since it doesn't need it. To reduce the confusion, this change removes the dependency of Platform for ArmSigInterruptPin. Change-Id: I0ee507ed1c08b4fa6d3e384e28732f3acb4f6892	2024-02-20 08:50:27 +00:00
Giacomo Travaglini	8759131df3	cpu-o3, arch: Fix SMT bug arising from v23.0 and make gem5 more robust with SMT (#828 ) This PR is fixing https://github.com/gem5/gem5/issues/668. It fixes it for all ISAs other than Arm with the first commit, which is setting the number of architectural Matrix registers to 0 for those ISA which are not using them. It then partly fixes it for Arm as well with the 2nd commit: by removing RenameMap::numFreeEntries we don't stall renaming unless a matrix instruction is encountered... This means most binaries will run with SMT as long as they don't use FEAT_SME instructions. Please note: this is not simply a SMT fix, it will generally address a shortcoming in the way we were renaming instructions. If an Arm binary wants to use SMT with FEAT_SME, the 4th commit will make sure the lack of physical registers is notified explicitly at the beginning of simulation, rather than silently blocking renaming	2024-02-19 08:52:31 +00:00
Richard Cooper	308fef6b46	mem-cache: Fix possible crash in base prefetcher (#871 ) When processing memory Packets for prefetch, the `PrefetchInfo` class constructor will attempt to copy the `Packet` data. In cases where the `Packet` under consideration does not contain data, an assertion will be triggered in the Packet's `getConstPtr` method, causing the simulation to crash. This problem was first exposed by Bug #580 when processing an `UpgradeReq` memory packet. This patch addresses the problem by suppressing the copying of the `Packet` data during construction of a `PrefetchInfo` object in cases where the `Packet` has no data. This patch addresses Bug #580 [1], which was exposed by PR #564 [2], subsequently reverted by PR #581 [3] [1] https://github.com/gem5/gem5/issues/580 [2] https://github.com/gem5/gem5/pull/564 [3] https://github.com/gem5/gem5/pull/581 Change-Id: Ic1e828c0887f4003441b61647440c8e912bf0fbc Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2024-02-17 14:14:57 -08:00
Giacomo Travaglini	2c0cc0040b	arch-arm: Implement FEAT_FGT Debug trapping Change-Id: I30af2b49ee604bcaa43fd419f6bc69e9ee6d9350 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2024-02-15 15:58:34 +00:00

1 2 3 4 5 ...

21364 Commits