derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Tommaso Marinelli	b5e27f5ed8	configs: Generalize class types in CHI RNF/MN generators (#1851 ) Classes CHI_RNF and CHI_MN can be specialized to override base class/subclass attributes, like it happens in CustomMesh with router_list (see configs/example/noc_config/2x4.py). To avoid missing these attributes, it is needed to generalize the class types when instantiating the objects in the recently added generators.	2024-12-18 21:16:26 -08:00
2channelkrt	f799d91309	ruby-chi: fix wrong ruby-CHI base class name (#1817 ) fix ruby-CHI base class name so it actually runs previously was combined with PR #1797	2024-12-04 15:47:44 -08:00
Harshil Patel	e51bc00dc7	misc: revert riscvmatched-fs.py due to a bug - link to issue https://github.com/gem5/gem5/issues/1554 Change-Id: Ic9cf6e5166eeee2226b6022e6f7c971d4e7caaeb	2024-12-02 08:41:58 -08:00
Erin (Jianghua) Le	e221a70355	Add ExitEvent import to arm-ubuntu-run.py	2024-12-02 08:41:58 -08:00
Harshil Patel	630173a845	misc: update fs examples to use ubuntu 24.04 boot workloads Change-Id: I7e16f69eff3a7ff0ab16c18e6d35e846d07ac829	2024-12-02 08:41:55 -08:00
Mahyar Samani	2fca39cec7	dev-amdgpu: Separating gpu_memory from gpu_cache. This change separates the instantiation of gpu memory from instantiatiing the gpu cache. Prior to this change, the gpu cache instantiated the memories for the gpu by receiving number of channels as a parameter. With this change, the gpu memory should be constructed outside the gpu, without being added as a child to any other object, and passed to the constructor of the gpu.	2024-12-02 08:33:12 -08:00
Matthew Poremba	2105dc47a9	stdlib: Add viper board, viper cache, and gpu components Adds GPU_VIPER protocol related caches to stdlib components: CorePair cache, TCP, SQC, TCC, Directory, and DMA controllers. Adds GPU related components in a new components/devices/gpus/ directory. Adds prebuilt GPU and CPU cache hierarchies, GPU and CPU network classes, and a board overriding the X86Board to provide helper methods for disk image root, the complex kernel parameter list, and method to provide functionality to the current GPUFS scripts to load in applications and handle loading the GPU driver. The new GPU components can be used as follows: - Create a GPU device before the CPU cache hierarchy is created. - Add the GPU's CPU-side DMA controllers to the list of CPU cache controllers. - Use GPU device method to connect to an AbstractBoard. Each GPU components has it's own RubySystem, PCI device ID, and address ranges for VBIOS and legacy PCI BARs. Therefore, in theory, multiple GPUs can be created. This requires PR #1453 . An example of using this board is added to configs/example/gem5_library under x86-mi300x-gpu.py. It is designed to work with the disk image, kernel, and applications provided in the gem5-resources repository. Change-Id: Ie65ffcfee5e311d9492de935d6d0631260645cd3	2024-12-02 08:33:12 -08:00
Giacomo Travaglini	76541929c9	configs: Instantiate RNFs and MN via callbacks This commit allows top level configs making use of the Ruby module to define node generation callbacks. The config_ruby function will check the system object for two factory methods 1) _rnf_gen, if defined, will be called to generate RNFs 2) _mn_gen, if defined, will be called to generate MNs Change-Id: I9daeece646e7cdb2d3bfefa761a9650562f8eb4b Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-12-02 08:33:11 -08:00
Erin Le	bcfa988a67	tests, scons: Fix Testlib test failures This commit changes the fs/linux/arm and learning_gem5 tests as they were previously failing with the Ruby change. The fs/linux/arm long tests require the addition of a new gem5 build, ARM_X86, which builds the ARM and X86 ISAs with the MESI_Two_Level cache hierarchy.	2024-11-19 11:00:37 -08:00
Jason Lowe-Power	97542c1a4c	mem-ruby,scons: Add scons option for multiple protocols This change does many things, but they must all be atomically done. USER FACING CHANGE: The Ruby protocols in Kconfig have changed names (they are now the same case as the SLICC file names). So, after this commit, your build configurations need to be updated. You can do so by running `scons menuconfig <build dir>` and selecting the right ruby options. Alternatively, if you're using a `build_opts` file, you can run `scons defconfig build/<ISA> build_opts/<ISA>` which should update your config correctly. Detailed changes are described below. Kconfig changes: - Kconfig files in ruby now must all be declared in the ruby/Kconfig file - All of the protocol names are changed to match their slicc file names including the case - A new option is available called "Use multiple protocols" which should be selected if multiple protocols are selected. This is only used to set the PROTOCOL variable to "MULTIPLE" when in multiple mode. - The PROTOCOL variable can now be "MULTIPLE" which means it will be ignored. If it's not "MULTIPLE" then it holds the "main" protocol, which is necessary for backwards compatibility with the Ruby.py files. Ruby config changes: To make this change backwards compatible with Ruby.py, this change adds a new "protocol" config called MULTIPLE.py which is used to allow the user to set a "--protocol" option on the command line. This is only needed if you are using a gem5 binary with multiple protocols but need to use Ruby.py. stdlib changes: - Make the coherence protocol file behave like the ISA file - Add a function to get the coherence protocol from the `CacheHierarchy` like we do with the ISA in the `Processor`. - Use this function where `get_runtime_coherence_protocol` was used - Update the requires code to work with the ne CoherenceProtocol - Fix a typo in the AMD Hammer name and also add the missing MSI protocol Scons changes: - In Ruby we now gather up all of the protocols and build them all if there are multiple protocols - There's some bending over backwards to tell the user if they are using an out of date gem5.build/config file and how to update it - Note that multiple ruby protocols adds a significant amount of time to the build since we have to run slicc twice for each file. build_opts: - Update all files with new names - Add a new NULL_All_Ruby that will be used for testing Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2024-11-19 11:00:34 -08:00
Jason Lowe-Power	4f53451073	mem-ruby,configs: Update AMD protos with new names Update the MOESI_AMD_Base and GPU_VIPER configuration files with the new full protocol-specific names for the controllers instead of the deprecated names. Note: If you have any files which use the `CntrlBase` base, you will likely need to update the class names that you are inheriting from. Change-Id: I623fea7dd4cd151f7b15fe7cb43f8a4c45492d89 Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2024-11-19 10:53:59 -08:00
Jason Lowe-Power	42fe5accea	configs,mem-ruby: Procotol-spec. names in CHI Use the protocol-specific controller names in CHI. Important: This could change some scripts. As long as people use CHI_config (likely), this shouldn't be a problem, but if you have a different version of CHI_config.py locally, you will need to make the following updates: `Cache_Controller` -> `CHI_Cache_Controller` `Memory_Controller` -> `CHI_Memory_Controller` Website updates coming soon! Change-Id: I7afdcede884ac5f9a9a76cc3d3dd35941e4e2faa Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2024-11-19 10:53:59 -08:00
Jason Lowe-Power	d56d561102	configs,mem-ruby: Protocol-spec. in learning gem5 Use protocol-specific names in Learning gem5 configs. Now, we should no longer use the generic names for the controllers (it's deprecated). This updates Learning gem5. Website changes coming soon. (Hopefull before I push this...) Change-Id: I18fc5b8bb0fef7c3b8b5cea8de4f73fc0f66a1b3 Signed-off-by: Jason Lowe-Power <jason@lowepower.com>	2024-11-19 10:53:58 -08:00
studyztp	0d16c92341	cpu: add comments and change input type to list Change inst_threshold param to inst_thresholds, which it is now expecting a list of thresholds instead of one threshold. Add more getter and setter functions: addThreshold: it is for adding new thresholds getCounter: it is for getting the current counter getThresholds: it returns the list of targeted thresholds resetThresholds: it clears all the targeted thresholds Change-Id: I48d022effe7b315112ac150e6a4eaf5aab41c514	2024-11-18 11:24:26 -08:00
studyztp	9fede07f44	cpu: modified with review feedback x86-global-inst-tracker.py: - change the incorrect use of comment styly - add more comments about the usage of the script and the purpose of the script src/cpu/probes/inst_tracker.cc: - change the way of stopListening to use the manager function to remove listeners. If in the future, the ProbeListner object does not call the manager to remove itself in the destruction, then we should call it here. - fix stlying src/cpu/probes/inst_tracker.hh: - fix stlying Change-Id: I6f3d745e15883a8a702593f72f984e0d4cc4c526	2024-11-18 11:24:04 -08:00
studyztp	6e39b737c8	cpu: add an example config script Change-Id: Id24f60d43d61766526bd45086f9aeda02fe24822	2024-11-18 11:23:58 -08:00
Tommaso Marinelli	7f50372979	configs: Update legacy RISC-V FS Linux script (#1753 ) This PR improves the legacy RISC-V FS Linux script in the following ways: - Adds an argument to specify the bootloader, to (optionally) use the `RiscvBootloaderKernelWorkload` class. - Updates the DTB generation function adding the Chosen node. This fixes the execution with recent Linux kernels. - Checks if the `--kernel` required argument is set.	2024-11-05 10:57:57 -08:00
Matt Sinclair	853f2ea012	configs,scons: Update scripts and build_opts to make GPU-FS simulations more configurable (#1693 ) This PR adds support for command line arguments in GPU-FS runs to allow the user to configure several parts of the GPU. It also increases the bits per set in the build_opts/VEGA_X86 file to enable GPU-FS simulations to use 64 directories or more.	2024-10-28 17:19:18 -05:00
Erin (Jianghua) Le	f01d68bf96	stdlib, configs: Add RiscvDemoBoard (#1490 ) This PR adds a RiscvDemoBoard that can be used with both SE and FS mode.This was tested using the workloads riscv-matrix-multiply-run for SE and riscv-ubuntu-20.04-boot for FS. Two example config scripts have also been added.	2024-10-22 10:13:22 -07:00
Matthew Poremba	16217f843f	mem-ruby: Fix issues in protocols due to multi-RubySystem (#1690 ) Starting with https://github.com/gem5/gem5/pull/1453 , some Ruby structures require a block size be set and other require a pointer to the Ruby system. This fixes some cases which were not covered by the per-checkin tests but seen in daily+ tests. In particular: - WriteMasks and PerfectCacheMemory must explicitly set a block size. - NetDest and RubyProxyPort require RubySystem pointer. - Classes inheriting Message now have a setRubySystem collecting all objects that need a RubySystem pointer and this should be called in the constructor of the Message. This commit makes sure all of these happen. This should fix daily arm_boot_tests and daily learning_gem5 tests.	2024-10-21 12:30:03 -07:00
Bobby R. Bruce	b705629b83	learning-gem5: Add `ruby_system` param set to `RubyPortProxy` (#1686 ) This missing parameter causing the Learning gem5 tests to fail. Note: We need to update the website's learning gem5 examples to reflect this change.	2024-10-20 13:04:47 -07:00
Nagendra-KJ	a443b5cbb8	configs: Added command line arguments to gpufs config scripts This commit adds command line arguments to the scripts that GPU-FS mode uses. Change-Id: I5514e77e699b9144461bbd2be6e267e7d44a6fb2	2024-10-20 11:53:21 -05:00
Harshil Patel	946bf83b75	arch-arm: Add arm demo board (#1478 ) This demo board is a preset arm board, that can be used to run example gem5 simulations. This board doesnt simulate any known hardware. The board will be used to run benchmarks such as gapbs and npb to collect stats. The plan is to show these stats on the gem5 resources website to provide more details about the resources.	2024-10-18 05:36:31 -07:00
Bobby R. Bruce	0341c5a502	SE script and tests for risc-v's vector extension (#1542 ) This two commits add the SE config and test script, respectively, to run the rvv tests mentioned in #1246.	2024-10-17 10:26:30 -07:00
Ivana Mitrovic	20965f571b	stdlib: Extend `AbstractBoard` pre_instantiation functionality (#1497 ) * Deprecates the setting of FS/SE mode via the `Simulator` module. * Moved the creation of the `Root` object from the `Simulator` to the board. * Moved the setting of `sim_quantum` from the `Simulator` to the processor. * Allows for easier development of boards which support both SE and FS mode simulation by moving board setup function calls to occur after the set_workload function is call which sets a boards stats `is_fs` status.	2024-10-14 10:12:41 -07:00
Saúl Adserias	a35f146ba2	configs: add example RVV SE parametrized config Change-Id: I0776c5751da8b80340166ab518593686d141a4dd	2024-10-11 17:32:09 +02:00
Pranith	50f652a2ee	Implement BTB using the cache library (#1537 ) This enables the BTB to be associative and use various replacement policies.	2024-10-10 17:05:22 +01:00
pre-commit-ci[bot]	54487d3bf6	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2024-10-09 14:04:56 +00:00
Matthew Poremba	4f7b3ed827	mem-ruby: Remove static methods from RubySystem (#1453 ) There are several parts to this PR to work towards #1349 . (1) Make RubySystem::getBlockSizeBytes non-static by providing ways to access the block size or passing the block size explicitly to classes. The main changes are: - DataBlocks must be explicitly allocated. A default ctor still exists to avoid needing to heavily modify SLICC. The size can be set using a realloc function, operator=, or copy ctor. This is handled completely transparently meaning no protocol or config changes are required. - WriteMask now requires block size to be set. This is also handled transparently by modifying the SLICC parser to identify WriteMask types and call setBlockSize(). - AbstractCacheEntry and TBE classes now require block size to be set. This is handled transparently by modifying the SLICC parser to identify these classes and call initBlockSize() which calls setBlockSize() for any DataBlock or WriteMask. - All AbstractControllers now have a pointer to RubySystem. This is assigned in SLICC generated code and requires no changes to protocol or configs. - The Ruby Message class now requires block size in all constructors. This is added to the argument list automatically by the SLICC parser. (2) Relax dependence on common functions in src/mem/ruby/common/Address.hh so that RubySystem::getBlockSizeBits is no longer static. Many classes already have a way to get block size from the previous commit, so they simply multiple by 8 to get the number of bits. For handling SLICC and reducing the number of changes, define makeCacheLine, getOffset, etc. in RubyPort and AbstractController. The only protocol changes required are to change any "RubySystem::foo()" calls with "m_ruby_system->foo()". For classes which do not have a way to get access to block size but still used makeLineAddress, getOffset, etc., the block size must be passed to that class. This requires some changes to the SimObject interface for two commonly used classes: DirectoryMemory and RubyPrefecther, resulting in user-facing API changes User-facing API changes: - DirectoryMemory and RubyPrefetcher now require the cache line size as a non-optional argument. - RubySequencer SimObjects now require RubySystem as a non-optional argument. - TesterThread in the GPU ruby tester now requires the cache line size as a non-optional argument. (3) Removes static member variables in RubySystem which control randomization, cooldown, and warmup. These are mostly used by the Ruby Network. The network classes are modified to take these former static variables as parameters which are passed to the corresponding method (e.g., enqueue, delayHead, etc.) rather than needing a RubySystem object at all. Change-Id: Ia63c2ad5cf0bf9d1cbdffba5d3a679bb4d3b1220 (4) There are two major SLICC generated static methods: getNumControllers() on each cache controller which returns the number of controllers created by the configs at run time and the functions which access this method, which are MachineType_base_count and MachineType_base_number. These need to be removed to create multiple RubySystem objects otherwise NetDest, version value, and other objects are incorrect. To remove the static requirement, MachineType_base_count and MachineType_base_number are moved to RubySystem. Any class which needs to call these methods must now have a pointer to a RubySystem. To enable that, several changes are made: - RubyRequest and Message now require a RubySystem pointer in the constructor. The pointer is passed to fields in the Message class which require a RubySystem pointer (e.g., NetDest). SLICC is modified to do this automatically. - SLICC structures may now optionally take an "implicit constructor" which can be used to call a non-default constructor for locally defined variables (e.g., temporary variables within SLICC actions). A statement such as "NetDest bcast_dest;" in SLICC will implicitly append a call to the NetDest constructor taking RubySystem, for example. - RubySystem gets passed to Ruby network objects (Network, Topology).	2024-10-08 08:14:50 -07:00
Matthew Poremba	f5858fe81f	dev-amdgpu: Deprecate rom and mmio trace params (#1633 ) The ROM field was originally intended as a future alternate way to load VBIOS without the ROM being on the disk image. This code path is never taken for the devices gem5 supports and there is no gem5 implementation. Deprecate the rom_binary field for this reason. Similarly, MMIO traces were only used for Vega10. Deprecate this as Vega10 is now deprecated. The MMIO trace reader is kept as it may still be useful in the future. It is still the primary way to handle devies which have graphics capability. None of the devices supported by gem5 have graphics now that Vega10 is deprecated.	2024-10-07 07:12:07 -07:00
Bobby R. Bruce	4bdcb040d0	stdlib: Move Root obj creation from Simulator to Board It makes much more sense for the Root Object to be create within the board and passed where required. Creating it in the Simulator class is not required. For this to work the signuature of the `_pre_instantiate` function in `AbstractBoard` has been updated to return the Root object.	2024-10-04 11:40:13 -07:00
Matthew Poremba	24504c9a3e	dev-amdgpu: Use GPU specific cache line size (#1621 ) Invalidate requests align to system cache line size. This causes problems if the GPU cache hierarchy's cache line size is different than the system as the unlaigned requests never return, leading to deadlock on deferred dispatch. This commit uses the cache line size from the GPU memory manager and makes the cache line size there non-optional. Tested with multiple RubySystems where CPU side was 64B and GPU side was 128B cache lines.	2024-10-03 08:47:08 -07:00
Matthew Poremba	c8c75959ad	configs: Deprecate Vega10 (#1619 ) Vega10 is no longer officially supported by ROCm and ROCm is starting to use some packet types not supported. These were originally kept to allow users to use older disk images with newer gem5. Going forward the gem5 version and gem5-resources releases will be required to be the same to prevent lingering old configs. As a replacement for vega10*.py, mi300.py or mi200.py should be used. HIP examples, cookbook, and rodinia configs can be replaced with the standard flow of building / obtaining the GPU application and running using mi300.py or mi200.py as they do not require any input options and therefore do not require changes to the disk image.	2024-10-02 14:18:41 -07:00
Erin (Jianghua) Le	c10feed524	tests, configs, util, mem, python, systemc: Change base 10 units to base 2 (#1605 ) This commit changes metric units (e.g. kB, MB, and GB) to binary units (KiB, MiB, GiB) in various files. This PR covers files that were missed by a previous PR that also made these changes.	2024-10-01 11:18:05 -07:00
Bobby R. Bruce	f2f86a3e42	stdlib, python: Add warning message and clarify binary vs metric units (#1479 ) This PR changes memory and cache sizes in various parts of the gem5 codebase to use binary units (e.g. KiB) instead of metric units (e.g. kB). This makes the codebase more consistent, as gem5 automatically converts memory and cache sizes that are in metric units to binary units. This PR also adds a warning message to let users know when an auto-conversion from base 10 to base 2 units occurs. There were a few places in configs and in the comments of various files where I didn't change the metric units, as I couldn't figure out where the parameters with those units were being used.	2024-09-17 17:32:27 +00:00
Daniel Carvalho	51863d322f	gpu-compute: Reuse RP list in GPU_VIPER (#1530 ) It is safer to reuse the dynamic list than manually listing all possible replacement policies. --------- Signed-off-by: odanrc <odanrc@yahoo.com.br>	2024-09-09 09:18:01 -07:00
Giacomo Travaglini	57d82fdbb4	sim-se, arch: Fix syscall parametre sizes for 32-bit OSs (#1482 ) A bug was uncovered in that for various syscalls that used 64bit parametres, the ABI for 32bit operating systems was passing the wrong values to the syscalls, due to discrepancies between the target and guest OS. This commit fixes that by replacing 64-bit types, or types that are platform specific in size, with the exact correspondent for the guest OS, thus producing the correct signature for the respective syscalls. On top of this, the --param argument is added to the starter_se script, in order to support attachment of remote debuggers.	2024-09-03 09:49:59 +01:00
Marco Kurzynski	a8447b7fc0	arch-vega: Pass s_memtime through smem pipe (#1350 ) The Vega ISA's s_memtime instruction is used to obtain a cycle value from the GPU. Previously, this was implemented to obtain the cycle count when the memtime instruction reached the execute stage of the GPU pipeline. However, from microbenchmarking we have found that this under reports the latency for memtime instructions relative to real hardware. Thus, we changed its behavior to go through the scalar memory pipeline and obtain a latency value from the the SQC (L1 I$). This mirrors the suggestion of the AMD Vega ISA manual that s_memtime should be treated like a s_load_dwordx2. The default latency was set based on microbenchmarking. Change-Id: I5e251dde28c06fe1c492aea4abf9f34f05784420	2024-08-26 19:47:04 -07:00
Erin Le	e1db67c4bd	configs, dev, learning-gem5, python, tests: more clarification This commit contains the rest of the base 2 vs base 10 cache/memory size clarifications. It also changes the warning message to use warn(). With these changes, the warning message should now no longer show up during a fresh compilation of gem5. Change-Id: Ia63f841bdf045b76473437f41548fab27dc19631	2024-08-23 18:02:42 -07:00
Tiberiu Bucur	fe6ef662d1	configs: Add --param to starter_se This commit adds the --param option to the starter_se configuration script for the Arm ISA. This is in order to support attaching remote debugger sessions. Change-Id: I2d8cc9f677f731948872003cca6066d1072ad570 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2024-08-20 16:18:24 +01:00
Bobby R. Bruce	f600db4a98	gpu-compute,tests: Move GPU tests to testlib (#1270 ) A new host tag `gcn_gpu` has been added. This allows for selection of those GPU tests which depend upon the gcn-gpu docker image to run. In addition to this, the square GPU tests has been moved to the CI tests. This ensures some GPU code is compiled and run on every PR.	2024-08-19 10:58:06 -07:00
Matt Sinclair	03ddd0b75f	gpu-compute: fix GPU TLB outstandingReqs vs. associativity The GPU TLB maxOutstandingReqs field gets limited by the associativity. In the current setup, this means that the max outstanding requests is 32 even though the setup is for 64 entries. Update the associativity to all 64 entries. Change-Id: I2104e4647d97bf4d1cf5ac447e38ad6ac6a1a0d8	2024-08-07 16:16:01 -05:00
Matthew Poremba	ddc9a18536	configs: GPUFS: Disable KVM perf counters by default (#1391 ) This is on by default in gem5 (see src/cpu/kvm/BaseKvmCPU.py), however the perf counters only measure host instruction counters and GPUFS is not concerned about accuracy of KVM CPU stats. There are also a larger set of users who have access to KVM, but do not have the paranoid level low enough to attach performance counters. Therefore, make the performance counters OFF by default. They can still be enabled, but this will allow for a larger set of users to follow the upcoming GPUFS documentation without needing to read through a troubleshooting section after seeing a gem5 error about the KVM paranoid level. Change-Id: I6b465559edf3ce17e7117ada049c60bd39aecd83	2024-07-29 12:26:10 -07:00
Yangyu Chen	2b902b0aec	arch-riscv: add rv32 option to FS Linux config file (#1312 ) Since we have supported RISC-V 32, add this option to allow the RISC-V 32 full system to run easily. Signed-off-by: Yangyu Chen <cyy@cyyself.name>	2024-07-10 11:41:48 -07:00
Mahyar Samani	590bb1fbbb	Adding an example for Spatter (#1272 ) This change adds a new utility function for processing Spatter traces into SpatterKernels under parse_kernels. Additionally, it adds documentation for all the utility functions in spatter_kernel.py. Lastly, it adds an example script for running one spatter trace using SpatterGenerator to the examples.	2024-06-21 02:23:41 -07:00
Matthew Poremba	ed860dfe54	configs: Check before use replacement policy options (#1261 ) Rather than adding the options to every config that might be using GPU_VIPER.py, just change the Ruby config to check if the option is available before trying to use it. Otherwise, reverts to what was the default on stable. Change-Id: Ia6f1d0827d489ee2a35c598b644461cbff59e247	2024-06-20 09:50:29 -07:00
Bobby R. Bruce	1a00ecfaf9	stdlib,configs,tests: Add gem5 MultiSim (MultiProcessing for gem5) (#1167 ) This allows for multiple gem5 simulations to be spawned from a single parent gem5 process, as defined in a simgle gem5 configuration. In this design _all_ the `Simulator`s are defined in the simulation script and then added to the mutlisim module. For example: ```py from gem5.simulate.Simulator import Simulator import gem5.utils.multisim as multisim # Construct the board[0] and board[1] as you wish here... simulator1 = Simulator(board=board[0], id="board-1") simulator2 = Simulator(board=board[1], id="board-2") multisim.add_simulator(simulator1) multisim.add_simulator(simulator2) ``` This specifies that two simulations are to be run in parallel in seperate threads: one specified by `simulator1` and another by `simulator2`. They are then added to MultiSim via the `multisim.add_simulator` function. The user can specify an id via the Simulator constructor. This is used to give each process a unique id and output directory name. Given this, the id should be a helpful name describing the simulation being specified. If not specified one is automatically given. To run these simulators we use `<gem5 binary> -m gem5.utils.multisim <script> -p <num_processes>`. Note: multisim is an executable module in gem5. This is the same module we input into our scripts to add the simulators. This is an intentionally modular encapsulated design. When the module processes a script it will schedule multiple gem5 jobs and, dependent on the number of processes specified, will create child gem5 processes to processes tjese jobs (jobs are just gem5 simulations in this case). The `--processes` (`-p`) argument is optional and if not specified the max number of processes which can be run concurrently will be the number of available threads on the host system. The id for each process is used to create a subdirectory inside the `outputdor` (`m5out`) of that id name. E.g, in the example above the ID's are `board-1` and `board-2`. Therefore the m5 out directory will look as follows: ```sh - m5out - board-1 - stats.txt - config.ini - config.json - terminal.out - board-2 - stats.txt - config.ini - config.json - terminal.out ``` Each simulations output is encapsulated inside the subdirectory of the id name. If the multisim configuation script is passed directly to gem5 (like a traditional gem5 configuraiton script, i.e.: `<gem5 binary> <script>`), the user may run a single simulation specified in that script by passing its id as an argument. E.g. `<gem5 binary> <script> board-1` will run the `board-1` simulation specified in `script`. If no argument is passed an Exception is raised asking the user to either specify or use the MultiSim module if multiprocessing is needed. If the user desires a list of ids of the simulations specified in a given MultiSim script, they can do so by passing the `--list` (`-l`) parameter to the config script. I.e., `<gem5 binary> <script> --list` will list all the IDs for all the simulations specified in`script`. This change comes with two new example scripts found in 'configs/example/gem5_library/multsim" to demonstrate multisim in both an SE and FS mode simulation. Tests have been added which run these scripts as part of gem5' Daily suite of tests. Notes ===== * Bug fixed: The `NoCache` classic cache hierarchy has been modified so the Xbar is no longet set with a `__func__` call. This interfered with MultiProcessing as this structure is not serializable via Pickle. This was quite bad design anyway so should be changed * Change: `readfile_contents` parameter previously wrote its value to a file called "readfile" in the output dorectory. This has been changed to write to a file called "readfile_{hash}" with "{hash}" being a hash of the `readfile_contents`. This ensures that, during multisim running, this file is not overwritten by other processes. * Removal note: This implementation supercedes the functionality outlined in 'src/python/gem5/utils/multiprocessing'. As such, this code has been removed. Limitations/Things to Fix/Improve ================================= * Though each Simulator process has its own output directory (a subdirectory within m5out, with an ID set by the user unique to that Simulator), the stdout and stderr are still output to the terminal, not the output directory. This results in: 1. stdout and stderr data lost and not recorded for these runs. 2. An incredibly noisy terminal output. * Each process uses the same cached resources. While there are locks on resources when downloading, each processes will hash the resources they require to ensure they are valid. This is very inefficient in cases where resources are common between processes (e.g., you may have 10 processes each using the same disk image with each processes hashing the disk images independently to give the same result to validate the resources). Change-Id: Ief5a3b765070c622d1f0de53ebd545c85a3f0eee --------- Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Co-authored-by: Jason Lowe-Power <jason@lowepower.com>	2024-06-18 09:34:39 -07:00
Matthew Poremba	3cf638e217	gpu-compute, util-m5: add GPU kernel exit events (#1217 ) The GPUFS scripts include support for dumping and resetting stats at kernel boundaries by identifying specific GPU kernel exit events. This commit extends that support to work with GPU SE-mode support. Change-Id: I662233ae71e2987d90af1fd0100e29036b2ef1c6	2024-06-14 08:13:27 -07:00
Matthew Poremba	b3d9dc42d4	configs: Add replacement policy options for GPUFS (#1230 ) GPU_VIPER.py was modified to use these options but they did not exist, breaking GPUFS. This commit adds them to fix the issue. Change-Id: I0095f400ea606c4e8d91a41870ef208465cef803	2024-06-13 11:23:50 -07:00
Jarvis Jia	b6b2e8c6c5	Black format Change-Id: If224c106262bae25127675160ea78386eedace3b	2024-06-12 15:57:04 -05:00

1 2 3 4 5 ...

1473 Commits