derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Bobby R. Bruce	2b791ff556	misc: fix g++13 overloaded-virtual warning (#363 ) There are two overloaded-virtual issues reported by g++13. 1. Copy assignment and move assignment overload is hidden in the derived class [ CXX] src/mem/cache/replacement_policies/weighted_lru_rp.cc -> ALL/mem/cache/replacement_policies/weighted_lru_rp.o In file included from src/mem/cache/base.hh:61, from src/mem/cache/base.cc:46: src/mem/cache/cache_blk.hh:172:5: error: ‘virtual gem5::CacheBlk& gem5::CacheBlk::operator=(gem5::CacheBlk&&)’ was hidden [-Werror=overloaded-virtual=] 172 \| operator=(CacheBlk&& other) \| ^~~~~~~~ src/mem/cache/cache_blk.hh:518:19: note: by ‘gem5::TempCacheBlk& gem5::TempCacheBlk::operator=(const gem5::TempCacheBlk&)’ 518 \| TempCacheBlk& operator=(const TempCacheBlk&) = delete; \| ^~~~~~~~ In this case, we can exiplict using parent operator= to keep the function overload. 2. Intended overload hidden in SystemC is reported as error. In file included from src/systemc/ext/tlm_utils/simple_initiator_socket.h:24, from src/systemc/tlm_bridge/gem5_to_tlm.hh:72, from build/ALL/python/_m5/param_Gem5ToTlmBridge256.cc:17: src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh: In instantiation of ‘class tlm::tlm_base_initiator_socket<256, tlm::tlm_fw_transport_if<>, tlm::tlm_bw_transport_if<>, 1, sc_core::SC_ONE_OR_MORE_BOUND>’: src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh:185:7: required from ‘class tlm::tlm_initiator_socket<256, tlm::tlm_base_protocol_types, 1, sc_core::SC_ONE_OR_MORE_BOUND>’ src/systemc/ext/tlm_utils/simple_initiator_socket.h:37:7: required from ‘class tlm_utils::simple_initiator_socket_b<sc_gem5::Gem5ToTlmBridge<256>, 256, tlm::tlm_base_protocol_types, sc_core::SC_ONE_OR_MORE_BOUND>’ src/systemc/ext/tlm_utils/simple_initiator_socket.h:156:7: required from ‘class tlm_utils::simple_initiator_socket<sc_gem5::Gem5ToTlmBridge<256>, 256, tlm::tlm_base_protocol_types>’ src/systemc/tlm_bridge/gem5_to_tlm.hh:147:46: required from ‘class sc_gem5::Gem5ToTlmBridge<256>’ /usr/include/c++/13/type_traits:1411:38: required from ‘struct std::is_base_of<sc_gem5::Gem5ToTlmBridgeBase, sc_gem5::Gem5ToTlmBridge<256> >’ ext/pybind11/include/pybind11/detail/../detail/common.h:880:59: required from ‘struct pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >::is_valid_class_option<sc_gem5::Gem5ToTlmBridgeBase>’ ext/pybind11/include/pybind11/detail/../detail/common.h:719:35: required by substitution of ‘template<class ... Ts> using pybind11::detail::all_of = pybind11::detail::bool_constant<(Ts::value && ...)> [with Ts = {pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >::is_valid_class_option<sc_gem5::Gem5ToTlmBridgeBase>, pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >::is_valid_class_option<std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >}]’ ext/pybind11/include/pybind11/pybind11.h:1506:70: required from ‘class pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >’ build/ALL/python/_m5/param_Gem5ToTlmBridge256.cc:34:179: required from here src/systemc/ext/tlm_utils/../core/sc_port.hh:125:18: error: ‘void sc_core::sc_port_b<IF>::bind(sc_core::sc_port_b<IF>&) [with IF = tlm::tlm_fw_transport_if<>]’ was hidden [-Werror=overloaded-virtual=] 125 \| virtual void bind(sc_port_b<IF> &p) { sc_port_base::bind(p); } \| ^~~~ In file included from src/systemc/ext/tlm_utils/simple_initiator_socket.h:27: src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh:133:18: note: by ‘tlm::tlm_base_initiator_socket<256, tlm::tlm_fw_transport_if<>, tlm::tlm_bw_transport_if<>, 1, sc_core::SC_ONE_OR_MORE_BOUND>::bind’ 133 \| virtual void bind(bw_interface_type &ifs) { (get_base_export())(ifs); } \| ^~~~ src/systemc/ext/tlm_utils/../core/sc_port.hh:124:18: error: ‘void sc_core::sc_port_b<IF>::bind(IF&) [with IF = tlm::tlm_fw_transport_if<>]’ was hidden [-Werror=overloaded-virtual=] 124 \| virtual void bind(IF &i) { sc_port_base::bind(i); } \| ^~~~ src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh:133:18: note: by ‘tlm::tlm_base_initiator_socket<256, tlm::tlm_fw_transport_if<>, tlm::tlm_bw_transport_if<>, 1, sc_core::SC_ONE_OR_MORE_BOUND>::bind’ 133 \| virtual void bind(bw_interface_type &ifs) { (get_base_export())(ifs); } \| ^~~~ From the code comment, it's intended in SystemC header. // The overloaded virtual is intended in SystemC, so we'll disable the warning. // Please check section 9.3 of SystemC 2.3.1 release note for more details. The issue is we should move the skip to the base class.	2023-09-29 10:53:52 -07:00
Harshil Patel	8182f8084b	stdlib, resources, tests: Introduce Suite of Workloads This patch introduces a new category called "suite". A suite is a collection of workloads. Each workload in a SuiteResource has a tag that can be narrowed down through the function with_input_group. Also, the set of input groups can be seen through list_input_groups. Added unit tests to test all functions of SuiteResource class. Change-Id: Iddda5c898b32b7cd874987dbe694ac09aa231f08 Co-authored-by: Kunal Pai <kunpai@ucdavis.edu>	2023-09-29 10:50:09 -07:00
Bobby R. Bruce	3a35bdf57a	arch-riscv: Update FS bits when doing floating point loads (#370 ) This problem is similar to the problem described in [1]. This problem produces symptoms as described in [2]. In short, the Linux kernel relies on the CSR_STATUS's FS bits to decide whether to save the floating point registers. If the FS bits are set to DIRTY, the floating point registers will be saved during context switching / task switching. Currently, with the patch in [1], we only change the FS bits upon every floating arithmetic instruction. However, since floating load instructions also mutate the state of floating point registers, the FS bits should be updated to DIRTY. The problem in [2] arose when the program populates the content of one floating register to an array by repeatedly using `fld fa5, EA`. A context switch occured upon a page fault, and while handling that page fault, the kernel might have to handle an interrupt. This caused the kernel to task switch between handling page fault and handling interrupt. This caused __switch_to() to be called, which will save the floating point registers only if the SD (indirectly set by FS) bits are set to DIRTY, while restoring the floating point registers to the switch-to task [3]. This caused the floating point registers to be zeroed out when it was restored as it was never saved before. [1] https://gem5-review.googlesource.com/c/public/gem5/+/65272 [2] https://github.com/gem5/gem5/issues/349 [3] https://github.com/torvalds/linux/blob/v6.5/arch/riscv/include/asm/switch_to.h#L56	2023-09-29 10:47:05 -07:00
Melissa Jost	a79dc3f23c	util: Add steps to compile clang-15 and clang-16 This updates the dockerfiles for ubuntu 22.04 to include the steps necessary to compile clang-15 and clang-16. Change-Id: I2bba6393ab93a6ce05a2c3ce31f3bbc71bcdca7c	2023-09-29 08:32:01 -07:00
Hoa Nguyen	6640447c1e	arch-riscv: Update FS bits when doing floating point loads This problem is similar to the problem described in [1]. This problem produces symptoms as described in [2]. In short, the Linux kernel relies on the CSR_STATUS's FS bits to decide whether to save the floating point registers. If the FS bits are set to DIRTY, the floating point registers will be saved during context switching / task switching. Currently, with the patch in [1], we only change the FS bits upon every floating arithmetic instruction. However, since floating load instructions also mutate the state of floating point registers, the FS bits should be updated to DIRTY. The problem in [2] arose when the program populates the content of one floating register to an array by repeatedly using `fld fa5, EA`. A context switch occured upon a page fault, and while handling that page fault, the kernel might have to handle an interrupt. This caused the kernel to task switch between handling page fault and handling interrupt. This caused __switch_to() to be called, which will save the floating point registers only if the SD (indirectly set by FS) bits are set to DIRTY, while restoring the floating point registers to the switch-to task [3]. This caused the floating point registers to be zeroed out when it was restored as it was never saved before. [1] https://gem5-review.googlesource.com/c/public/gem5/+/65272 [2] https://github.com/gem5/gem5/issues/349 [3] https://github.com/torvalds/linux/blob/v6.5/arch/riscv/include/asm/switch_to.h#L56 Change-Id: Ia5656da5a589a8e29fb699d2ee12885b8f3fa2d2 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-28 19:14:29 -07:00
Jason Lowe-Power	aaad79cf51	python: Add importer to standalone gem5py_m5 I believe the point of this binary was to allow people to use the m5 objects without the entire gem5 binary. However, without adding the importer call, this did not work. Unfortunately, with the importer call there is a circular dependence on the original gem5py.cc file. Therefore, this change creates a new file that has the importer call. Now, with the `gem5py_m5` binary you can run python code that references modules in `src/python`. Note that `_m5` is not available, so anything that depends on the gem5 SimObjects' implementation will not work. However, thic can still be useful for things like getting Resources, processing stats, etc. Change-Id: I5c0e5d1a669fe5ce491458df916f2049c81292eb Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2023-09-28 11:22:33 -07:00
Bobby R. Bruce	62d34ef374	misc: 'sim{out/err}' -> 'sim{out/err}.txt' (#250 ) By default, the `--stderr-file` and `--stdout-file` arguments were directing the simulator to output files named "simerr" and "simout" respectively if an output redirect was requested. A small annoyance is these files lack an extension meaning programs refuse to open them, or don't do so withou additional effort. On many systems they are assumed to scripts. This patch adds the `.txt` extension to both, thus clearly indicating to other programs these are text files and can be opened and read as such.	2023-09-27 17:36:03 -07:00
Bobby R. Bruce	5d254ffb02	stdlib, resources: Added pretty printing resource (#323 ) - Implemented a __str__ for AbstractResource __str__ prints resource category, id and version. link to resources website is also printed.	2023-09-27 17:32:35 -07:00
Bobby R. Bruce	14b928f77c	base: Add a warning when failing to insert a whole symbol table (#361 ) Currently we drop the insertion of a whole symbol table if the name of one symbol already exists in the base table. Having similar symbols across different binaries is common. This change adds a warning and recommends a fix instead of silently dropping the table.	2023-09-27 17:26:03 -07:00
Bobby R. Bruce	074fa4c604	misc,ext,tests: Automatically split CI TestLib tests across GitHub Action jobs (#263 ) This PR utilizes GitHub Action's matrix's to automatically distribute the CI testlib gem5 build and test jobs across available GitHub Action Runners. The CI tests (the `quick` testlib tests, i.e. those run with `./main.py run`) are distributed across the runners on a per directory basis --- all directories under "tests/gem5" are run as their own jobs. The necessary gem5 builds for each workflow are now automatically inferred via the introduction of `./main.py list`'s `--build-targets` flag which returns the gem5 build target for a given test or collection of tests. E.g., `./main.py list --build-targets` will return the build targets for all the `quick` testlib tests and `./main.py list --build-target --uid=<id>` will return the build targets the test suite `<id>` requires. Moving from monolithic jobs to fine-grained ones will make the locaiton of test failures more obvious. Each job has it's own artifact containing "test/testing-results" for the tests run in that job. In addition, maintenance of these files should become less burdensome due to less hardcoding.	2023-09-27 14:32:16 -07:00
Bobby R. Bruce	3a0f4598b9	cpu-o3: Mark getWritableRegOperand() in O3CPU as a regwrite (#360 ) As discussed here, [1], O3CPU counts getWritableRegOperand() as a reg read, while SimpleCPU variants count getWriableRegOperand() as a reg write. This patch fixes this inconsistency. Here, I assume that if getWritableRegOperand() is used, setReg() will not be used again to write to the destination register. [1] https://github.com/gem5/gem5/pull/341	2023-09-27 14:31:38 -07:00
Bobby R. Bruce	49a1d48264	arch-x86: properly initialize the auxv platform string (#347 ) The auxv platform string was not copied to the same location that was pointed to by the value of AT_PLATFORM; instead, it was copied over the auxv random buffer. This patch fixes this by copying the auxv platform string to the right offset in the initial program stack. GitHub issue: https://github.com/gem5/gem5/issues/346	2023-09-27 14:31:19 -07:00
Bobby R. Bruce	4638434b97	arch-x86: make popx87 micro-op actually pop st(0) (#345 ) The popx87 micro-op did not in fact pop the st(0) floating-point register off the stack; it acted as a no-op. This patch fixes the bug by passing the spm=1 argument to PopX87's superclass to indicate the floating-point stack pointer should be incremented. GitHub issue: https://github.com/gem5/gem5/issues/344	2023-09-27 14:31:00 -07:00
Harshil Patel	633bdc08f2	stdlib: Addressed requested changes - Added mulitline string for print message - Added get_category_name method instead of having category as variable Change-Id: I51e0e14a70e802453c21070711b200bc47994ba3	2023-09-27 11:36:51 -07:00
Melissa Jost	34c3676105	misc: Update gem5 to use clang-15 and clang-16 This introduces the changes necessary for clang-15 and clang-16 to run within gem5, and adds them to the compiler tests. Change-Id: If809eae1bd8c366b4d62476891feff0625bdf210	2023-09-27 09:35:18 -07:00
Yu-hsin Wang	9ca2672cab	misc: fix g++13 overloaded-virtual warning There are two overloaded-virtual issues reported by g++13. 1. Copy assignment and move assignment overload is hidden in the derived class [ CXX] src/mem/cache/replacement_policies/weighted_lru_rp.cc -> ALL/mem/cache/replacement_policies/weighted_lru_rp.o In file included from src/mem/cache/base.hh:61, from src/mem/cache/base.cc:46: src/mem/cache/cache_blk.hh:172:5: error: ‘virtual gem5::CacheBlk& gem5::CacheBlk::operator=(gem5::CacheBlk&&)’ was hidden [-Werror=overloaded-virtual=] 172 \| operator=(CacheBlk&& other) \| ^~~~~~~~ src/mem/cache/cache_blk.hh:518:19: note: by ‘gem5::TempCacheBlk& gem5::TempCacheBlk::operator=(const gem5::TempCacheBlk&)’ 518 \| TempCacheBlk& operator=(const TempCacheBlk&) = delete; \| ^~~~~~~~ In this case, we can exiplict using parent operator= to keep the function overload. 2. Intended overload hidden in SystemC is reported as error. In file included from src/systemc/ext/tlm_utils/simple_initiator_socket.h:24, from src/systemc/tlm_bridge/gem5_to_tlm.hh:72, from build/ALL/python/_m5/param_Gem5ToTlmBridge256.cc:17: src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh: In instantiation of ‘class tlm::tlm_base_initiator_socket<256, tlm::tlm_fw_transport_if<>, tlm::tlm_bw_transport_if<>, 1, sc_core::SC_ONE_OR_MORE_BOUND>’: src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh:185:7: required from ‘class tlm::tlm_initiator_socket<256, tlm::tlm_base_protocol_types, 1, sc_core::SC_ONE_OR_MORE_BOUND>’ src/systemc/ext/tlm_utils/simple_initiator_socket.h:37:7: required from ‘class tlm_utils::simple_initiator_socket_b<sc_gem5::Gem5ToTlmBridge<256>, 256, tlm::tlm_base_protocol_types, sc_core::SC_ONE_OR_MORE_BOUND>’ src/systemc/ext/tlm_utils/simple_initiator_socket.h:156:7: required from ‘class tlm_utils::simple_initiator_socket<sc_gem5::Gem5ToTlmBridge<256>, 256, tlm::tlm_base_protocol_types>’ src/systemc/tlm_bridge/gem5_to_tlm.hh:147:46: required from ‘class sc_gem5::Gem5ToTlmBridge<256>’ /usr/include/c++/13/type_traits:1411:38: required from ‘struct std::is_base_of<sc_gem5::Gem5ToTlmBridgeBase, sc_gem5::Gem5ToTlmBridge<256> >’ ext/pybind11/include/pybind11/detail/../detail/common.h:880:59: required from ‘struct pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >::is_valid_class_option<sc_gem5::Gem5ToTlmBridgeBase>’ ext/pybind11/include/pybind11/detail/../detail/common.h:719:35: required by substitution of ‘template<class ... Ts> using pybind11::detail::all_of = pybind11::detail::bool_constant<(Ts::value && ...)> [with Ts = {pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >::is_valid_class_option<sc_gem5::Gem5ToTlmBridgeBase>, pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >::is_valid_class_option<std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >}]’ ext/pybind11/include/pybind11/pybind11.h:1506:70: required from ‘class pybind11::class_<sc_gem5::Gem5ToTlmBridge<256>, sc_gem5::Gem5ToTlmBridgeBase, std::unique_ptr<sc_gem5::Gem5ToTlmBridge<256>, pybind11::nodelete> >’ build/ALL/python/_m5/param_Gem5ToTlmBridge256.cc:34:179: required from here src/systemc/ext/tlm_utils/../core/sc_port.hh:125:18: error: ‘void sc_core::sc_port_b<IF>::bind(sc_core::sc_port_b<IF>&) [with IF = tlm::tlm_fw_transport_if<>]’ was hidden [-Werror=overloaded-virtual=] 125 \| virtual void bind(sc_port_b<IF> &p) { sc_port_base::bind(p); } \| ^~~~ In file included from src/systemc/ext/tlm_utils/simple_initiator_socket.h:27: src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh:133:18: note: by ‘tlm::tlm_base_initiator_socket<256, tlm::tlm_fw_transport_if<>, tlm::tlm_bw_transport_if<>, 1, sc_core::SC_ONE_OR_MORE_BOUND>::bind’ 133 \| virtual void bind(bw_interface_type &ifs) { (get_base_export())(ifs); } \| ^~~~ src/systemc/ext/tlm_utils/../core/sc_port.hh:124:18: error: ‘void sc_core::sc_port_b<IF>::bind(IF&) [with IF = tlm::tlm_fw_transport_if<>]’ was hidden [-Werror=overloaded-virtual=] 124 \| virtual void bind(IF &i) { sc_port_base::bind(i); } \| ^~~~ src/systemc/ext/tlm_utils/../tlm_core/2/sockets/initiator_socket.hh:133:18: note: by ‘tlm::tlm_base_initiator_socket<256, tlm::tlm_fw_transport_if<>, tlm::tlm_bw_transport_if<>, 1, sc_core::SC_ONE_OR_MORE_BOUND>::bind’ 133 \| virtual void bind(bw_interface_type &ifs) { (get_base_export())(ifs); } \| ^~~~ From the code comment, it's intended in SystemC header. // The overloaded virtual is intended in SystemC, so we'll disable the warning. // Please check section 9.3 of SystemC 2.3.1 release note for more details. The issue is we should move the skip to the base class. Change-Id: I6683919e594ffe1fb3b87ccca1602bffdb788e7d	2023-09-27 13:43:28 +08:00
Andreas Sandberg	cfa13f9feb	sim: Probe listener template with lambda (#356 ) Adds a new probe listener template which can be used to instantiate with a lambda function that is called by notify(). It is similar to ProbeListenerArg with class but provides more flexibility. I.e. the can be another object than the one instantiating the lambda which allows to listen to any object. Furthermore additional parameters can be passed in easily. Change-Id: Iba451357182caf25097b9ae201cd5c647aff3a4f	2023-09-26 10:08:24 +01:00
Giacomo Travaglini	f5968da41c	mem-ruby: start using txnid and DBID identifiers in CHI transactions (#288 ) With this PR our CHI implementation starts making use of the txnid and DBID identifiers. Note: we were already making use of the txnId for DVM messages to convey the DVM address. This is still the case. In the future we should realign the DVM logic so that the txnId is solely used as a transaction identifier.	2023-09-26 09:51:47 +01:00
Hoa Nguyen	91e55d9c60	base: Add warning when failing to insert a whole symbol table Current we drop the insertion of a whole symbol table if the name of one symbol already exists in the base table. Having similar symbols across different binaries is very common. This change adds a warning and recommends a fix instead of silently dropping the table. This is useful for debugging when there are two or more workloads, e.g. bootloader + kernel, are added separately. Change-Id: I9e4cf06037cd70926fb5cee3c4dab464daf0912e Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-25 16:27:21 -07:00
Hoa Nguyen	b759f22cc9	cpu-o3: Mark getWritableRegOperand() in O3CPU as a regwrite As discussed here, [1], O3CPU counts getWritableRegOperand() as a reg read, while SimpleCPU variants count getWriableRegOperand() as a reg write. This patch fixes this inconsistency. Here, I assume that if getWritableRegOperand() is used, setReg() will not be used again to write to the destination register. [1] https://github.com/gem5/gem5/pull/341 Change-Id: If00049eb598f6722285e9e09419aef98ceed759f Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-25 12:29:28 -07:00
Jason Lowe-Power	010ac43369	arch-riscv: Make RISC-V decodeInst overridable (#350 ) The change will allow developers to implement and decode their non-standard instructions to the CPU models	2023-09-25 06:43:56 -07:00
David Schall	7cb308db90	sim: Probe listener template with lambda Adds a new probe listener template which can be used to instantiate with a lambda function that is called by notify(). It is similar to ProbeListenerArg with class but provides more flexibility. I.e. the can be another object than the one instantiating the lambda which allows to listen to any object. Furthermore additional parameters can be passed in easily. Change-Id: Iba451357182caf25097b9ae201cd5c647aff3a4f Signed-off-by: David Schall <david.schall@ed.ac.uk>	2023-09-25 08:51:00 +00:00
Giacomo Travaglini	9d63a1492a	cpu: Add override to TraceCPU init function (#348 ) This introduces a fix that caused the clang compiler tests to fail here: https://github.com/gem5/gem5/actions/runs/6195015407 Change-Id: I48c61539f497976c038c6e8e379d00285e1c39c7	2023-09-25 09:10:33 +01:00
Giacomo Travaglini	83224e2c85	arch: Enable customized decoder class name (#351 ) Developers can make the own ISADesc action in the SConscript with their decoder class name. Change-Id: I011cf059642e178913e1f62df4e5c02401cc132e	2023-09-25 09:10:06 +01:00
Giacomo Travaglini	df60b0f5c9	arch-arm: Implement FEAT_FGT Change-Id: I89391f17f353ab6ce555d65783977c1f30f64fc5 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-09-22 16:33:58 +01:00
Giacomo Travaglini	37b6824c4c	arch-arm: Fix disassembly for NZCV read/writes At the moment the instruction is disassembled as an integer operation: msrNZCV x547, x0 Instead of msr nzcv x0 Change-Id: I3f6576dccbe86db401c73747750ca3cfdf4055d5 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-09-22 16:33:58 +01:00
Roger Chang	d55f8f2716	arch: Enable customized decoder class name Developers can make the own ISADesc action in the SConscript with their decoder class name. Change-Id: I011cf059642e178913e1f62df4e5c02401cc132e	2023-09-22 15:45:56 +08:00
Roger Chang	5b41112e03	arch-riscv: Make RISC-V decodeInst overridable The change will allow developers to implement and decode their non-standard instructions to the CPU models Bug: 289467440 Test: None Change-Id: I67f4abc71596f819c1265e325784f51c8e9bb359	2023-09-22 11:38:22 +08:00
Bobby R. Bruce	391f62b213	misc: 'sim{out/err}' -> 'sim{out/err}.txt' By default, the --stderr-file and --stdout-file arguments were directing the simulator output to files named "simerr" and "simout" respectively if an output redirect was requested. A small annoyance is these files lack an extension meaning programs refuse to open them, or to do so without some additional effort. On many systems they are assumed to scripts. This patch adds the .txt extension to both, thus clearly indicating to other programs these are text files and can be opened to be read as such. Change-Id: Iff5af4a9e6966b4467d005a029dbf401099fbd35	2023-09-21 12:57:43 -07:00
Bobby R. Bruce	f5a255c68d	configs: Fixed Typo (#337 ) Fixed a typo importing obtain_resource	2023-09-21 11:58:49 -07:00
Bobby R. Bruce	3f9afe96c6	python,util: Add Python MyPy Stubgen to enable Pylance IntelliSense (#307 ) This allows us to generate stubs for the modules in gem5. The output will be a "typings" directory which can be used by Pylance (Python IntelliSense) to infer typings in Visual Studio Code. Note: A "typings" directory in the root of the workspace is the default location for Pylance to look for typings. This can be changed via `python.analysis.stubPath` in "settings.json". Usage ===== ``` pip3 install -r requirements.txt scons build/ALL/gem5.opt -j$(nproc) ./build/ALL/gem5.opt util/gem5-stubgen.py ```	2023-09-21 11:52:16 -07:00
Melissa Jost	d297da3654	cpu: Add override to TraceCPU init function This introduces a fix that caused the clang compiler tests to fail here: https://github.com/gem5/gem5/actions/runs/6195015407 Change-Id: I48c61539f497976c038c6e8e379d00285e1c39c7	2023-09-21 10:11:39 -07:00
Nicholas Mosier	7298ebd49b	arch-x86: properly initialize the auxv platform string The auxv platform string was not copied to the same location that was pointed to by the value of AT_PLATFORM; instead, it was copied over the auxv random buffer. This patch fixes this by copying the auxv platform string to the right offset in the initial program stack. GitHub issue: https://github.com/gem5/gem5/issues/346 Change-Id: Ied4b660d5fc444a94acb97b799be0a3722438b5e	2023-09-21 05:16:17 +00:00
Nicholas Mosier	5697bf26a8	arch-x86: make popx87 micro-op actually pop st(0) The popx87 micro-op did not in fact pop the st(0) floating-point register off the stack; it acted as a no-op. This patch fixes the bug by passing the spm=1 argument to PopX87's superclass to indicate the floating-point stack pointer should be incremented. GitHub issue: https://github.com/gem5/gem5/issues/344 Change-Id: I6e731882b6bcf8f0e06ebd2f66f673bf9da80717	2023-09-21 04:29:05 +00:00
Bobby R. Bruce	958eda6961	arch-riscv: Fix inst flags for jal and jalr (#325 ) The jal and jalr share the same instruction format JumpConstructor, which sets the IsCall and IsReturn flags by the register ID. However, it may cause wrong instruction flags set for jal because the section "handle the 'Jalr' instruction" misses the opcode checking. The PR fix the issue to ensure the IsReturn can be only set in Jalr.	2023-09-20 16:25:21 -07:00
Bobby R. Bruce	aa0702c6eb	dev-amdgpu: Handle GPU atomics on host memory addresses (#328 ) It is possible to execute a GPU atomic instruction using a memory address that is in the host memory space (e.g, HMM, __managed__, hipHostMalloc'd address). Since these are in host memory they are passed to the SystemHub DmaDevice. However, this currently executes as a write packet without modifying data. This leads to hangs in applications that use atomics for forward progress (e.g., HeteroSync). It is not clear where these are handled on a real GPU, but they are certainly not handled by the software stack nor driver, so they must be handled in hardware and therefore implemented in gem5. Handling for atomics in the SystemHub makes the most sense. To make atomics work a few extra changes need to be made to the SystemHub. (1) The atomic is implemented as a host memory read, followed by calling the AtomicOpFunctor, followed by a write. This requires a second event to handle read response, performing atomic, and issuing a write. (2) Atomics must be serialized otherwise two atomics might return the same value which is incorrect. This patch adds serialization logic for all request types to the same address to handle this. (3) With the added complexity of the SystemHub, a new debug flag explicitly for SystemHub is added. Testing done: The heterosync application with input "sleepMutex 10 16 4" previously hung before this patch. It passes with the patch applied. This application tests both (1) and (2) above, as it allocates locks with hipHostMalloc and has multiple workgroups sending an atomic request in the same Tick, verifying the serialization mechanism.	2023-09-20 16:24:56 -07:00
Bobby R. Bruce	4526a314a9	arch-x86: fix negative overflow check bug in PACK micro-op (#332 ) The implementation of the x86 PACK micro-op had a logical bug that caused the `PACKSSWB` and `PACKSSDW` instructions to produce incorrect results. Specifically, due to a signedness error, the overflow check for negative integers being packed always evaluated to true, resulting in all negative integers being packed as -1 in the output. This patch fixes the signedness error that causes the bug. GitHub issue: https://github.com/gem5/gem5/issues/331	2023-09-20 16:18:16 -07:00
Leo Redivo	83374bdf99	misc: changed name get_default_disk_device to get_disk_device Change-Id: Ida9673445a4426ddedc8221010204bd2b71103a5	2023-09-20 15:28:49 -07:00
Marco Kurzynski	516dcf3bcd	configs: Fixed Typo Fixed a typo importing obtain_resource Change-Id: I5792ca161187c6576e2501e5aaea610d8b8ee5ea	2023-09-20 21:42:56 +00:00
Pu (Luke) Yi	3c38d4952a	mem: fix bug in 3-level cache Change-Id: I5b875908ac8f81180d781e609869e2f6fe1a8dc4	2023-09-20 12:15:33 -07:00
Matthew Poremba	63cabf2848	dev-amdgpu: Handle GPU atomics on host memory addresses It is possible to execute a GPU atomic instruction using a memory address that is in the host memory space (e.g, HMM, __managed__, hipHostMalloc'd address). Since these are in host memory they are passed to the SystemHub DmaDevice. However, this currently executes as a write packet without modifying data. This leads to hangs in applications that use atomics for forward progress (e.g., HeteroSync). It is not clear where these are handled on a real GPU, but they are certianly not handled by the software stack nor driver, so they must be handled in hardware and therefore implemented in gem5. Handling for atomics in the SystemHub makes the most sense. To make atomics work a few extra changes need to be made to the SystemHub. (1) The atomic is implemented as a host memory read, followed by calling the AtomicOpFunctor, followed by a write. This requires a second event to handle read response, performing atomic, and issuing a write. (2) Atomics must be serialized otherwise two atomics might return the same value which is incorrect. This patch adds serialization logic for all request types to the same address to handle this. (3) With the added complexity of the SystemHub, a new debug flag explicitly for SystemHub is added. Testing done: The heterosync application with input "sleepMutex 10 16 4" previously hung before this patch. It passes with the patch applied. This application tests both (1) and (2) above, as it allocates locks with hipHostMalloc and has multiple workgroups sending an atomic request in the same Tick, verifying the serialization mechanism. Change-Id: Ife84b30037d1447dd384340cfeb06fdfd472fff9	2023-09-20 13:52:25 -05:00
Bobby R. Bruce	6eb7c10eb9	misc: Add HACC GPU tests (#258 ) This adds the HACC GPU tests to be run weekly	2023-09-20 11:26:54 -07:00
Roger Chang	70c1d762c7	arch-riscv: Fix inst flags for jal and jalr The jal and jalr share the same instruction format JumpConstructor, which sets the IsCall and IsReturn flags by the register ID. However, it may cause wrong instruction flags set for jal because the section "handle the 'Jalr' instruction" misses the opcode checking. The PR fix the issue to ensure the IsReturn can be only set in Jalr. Change-Id: I9ad867a389256f9253988552e6567d2b505a6901	2023-09-20 14:27:23 +08:00
Nicholas Mosier	741a901d8d	arch-x86: fix negative overflow check bug in PACK micro-op The implementation of the x86 PACK micro-op had a logical bug that caused the `PACKSSWB` and `PACKSSDW` instructions to produce incorrect results. Specifically, due to a signedness error, the overflow check for negative integers being packed always evaluated to true, resulting in all negative integers being packed as -1 in the output. This patch fixes the signedness error that causes the bug. GitHub issue: https://github.com/gem5/gem5/issues/331 Change-Id: I44b7328a8ce31742a3c0dfaebd747f81751e8851	2023-09-20 05:09:32 +00:00
Bobby R. Bruce	561f3bd75b	misc,tests: Split testlib CI Tests to one dir-per-job This splits the CI Tests to one job per sub-directory in "tests/gem5" via a matrix. Advantages: * We can utilize more runners to run the quick tests. This should mean tests run quicker. * This approach does not require editing of the workflow as more tests are added or taken away. * There is now an output artifact for each directory in "tests/gem5" instead of one for the entriety of every quick test in "tests". In addition: * The artifact retention for the test outputs has been increased to 30 days. * The output test artifacts have been renamed to be more descriptive of the job, run, attempt, directory run, and the status. * The 'tar' step has been removed. GitHub's 'action/artifact' can handle directories. Change-Id: I5b3132b424e3769d81d9cd75db2a8c59dbe4a7e5	2023-09-19 19:35:58 -07:00
Bobby R. Bruce	6921d94373	python: Recursively create checkpoint dir While there was code present in "serialize.cc" to create the checkpoint directory, it did not do recursively. This patch ensures all the directories are created in a path to the checkpoint directory. Change-Id: Ibcf7f800358fd89946f550b8cfb0cef8b51fceac	2023-09-19 15:48:11 -07:00
Bobby R. Bruce	efd58f9b72	tests: Remove ":" from testing results output dir name Colons in path names is not advisable. Change-Id: I7748a36cabafde69759f7a9892f7b8910470b85e	2023-09-19 15:48:11 -07:00
Bobby R. Bruce	0337613afc	ext,tests: Add `--build-targets` option to `./main.py list` This allows for build target information (i.e., the gem5 binary to be built for the tests) to be returned. Change-Id: I6638b54cbb1822555f58e74938d36043c11108ba	2023-09-19 15:48:10 -07:00
Bobby R. Bruce	13b77b3e41	ext,tests: Allow passing of `--uid` to `./main.py list` This is useful for listing the fixtures of a Suite. Change-Id: Id2f1294cc7dea03a6b26e8abc5083886fe0299d9	2023-09-19 15:48:10 -07:00
Bobby R. Bruce	43226004a1	ext,tests: Fix `--figures` flag when using `./main.py list` Now the "tests/main.py" script will accept the `--fixtures` flag when using the `list` command. This will only list the fixtures needed. To have this implemented `__str__` for the `Fixture` class has been implemented. Change-Id: I4bba26e923c8b0001163726637f2e48c801e92b1	2023-09-19 15:48:10 -07:00

1 2 3 4 5 ...

20732 Commits