derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Bobby R. Bruce	5204b58e19	stdlib: Rename JsonSerializable to SerializableStat As this abstract class now allows the output of text stats, it's more appropriate to rename it. It no longer handles processing just for JSON output Change-Id: Ia9a1e3ef4029de45a11ac261fb14c9bdfa412cdd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59273 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-11-03 22:32:36 +00:00
Bobby R. Bruce	2211fdb607	tests,stdlib: Add a test for JsonSerializable Change-Id: I9e1fa6ee67f5d73d41fa9972bdb9a3da7dda8957 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58729 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-03 22:32:36 +00:00
Melissa Jost	23d405ea55	tests, resources: CVE-2007-4559 Patch Hi, we are security researchers from the Advanced Research Center at Trellix. We have began a campaign to patch a widespread bug named CVE-2007-4559. CVE-2007-4559 is a 15 year old bug in the Python tarfile package. By using extract() or extractall() on a tarfile object without sanitizing input, a maliciously crafted .tar file could perform a directory path traversal attack. We found at least one unsantized extractall() in your codebase and are providing a patch for you via pull request. The patch essentially checks to see if all tarfile members will be extracted safely and throws an exception otherwise. We encourage you to use this patch or your own solution to secure against CVE-2007-4559. If you have further questions you may contact us through this projects lead researcher Kasimir Schulz. Change-Id: I891ac6652cfbd479aed51d64ef6d4e0fe740e06d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65271 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-11-03 22:17:50 +00:00
Jui-Min Lee	f61a640d30	mem: Fix SHM server path cleanup logic Previously, shared memory server remove old socket before filling the target path into API's data structure. However, the target path might get truncated hence the path we check against might not be the one we will be using in the end. In a case where the path specified by user is free while the truncated path is in used, gem5 will get a mysterious EADDRINUSE. We swap the two steps in the CL, so we'll be checking against the actual path we use, instead of the path user request to use. Change-Id: Ib34f8b00ea1d2f15dcd4e7b6d2d4a6d6ddc4e411 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65153 Reviewed-by: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabeblack@google.com>	2022-11-03 07:31:36 +00:00
Earl Ou	5bf88bf7a1	sim: allow specifying remote gdb port for each workload In a platform with multiple systems, we may want to specify the remote gdb port for each system. This change makes it possible to specify the port number at each Workload instance. Change-Id: I755b3960ee920ae5289819aa05d98902614a5615 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65151 Maintainer: Earl Ou <shunhsingou@google.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-03 05:34:09 +00:00
Yu-hsin Wang	540e6515de	ext: upgrade to googletest 1.12.0 Upgrade googletest to 1.12.0 upstream commit: 15460959cbbfa20e66ef0b5ab497367e47fc0a04 sha1sum e1e4ab7f4add6d403c37970a83a596b3081077d6 generated by command: find . -type f ! -name SConscript ! -path "./.*" -print0 \ \| sort -z \| xargs -0 sha1sum \| sha1sum This upgrade is mainly for solving the infinite-recursion warning from g++12 ext/googletest/googlemock/include/gmock/internal/gmock-internal-utils.h: In function ‘testing::internal::Invalid<gem5::Port&>()gem5::Port&’: ext/googletest/googlemock/include/gmock/internal/gmock-internal-utils.h:296:10: error: infinite recursion detected [-Werror=infinite-recursion] 296 \| inline T Invalid() { \| ^~~~~~~ ext/googletest/googlemock/include/gmock/internal/gmock-internal-utils.h:301:20: note: recursive call 301 \| return Invalid<T>(); \| ~~~~~~~~~~^~ ext/googletest/googlemock/include/gmock/internal/gmock-internal-utils.h: In function ‘testing::internal::Invalid<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&>()std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&’: ext/googletest/googlemock/include/gmock/internal/gmock-internal-utils.h:296:10: error: infinite recursion detected [-Werror=infinite-recursion] 296 \| inline T Invalid() { \| ^~~~~~~ ext/googletest/googlemock/include/gmock/internal/gmock-internal-utils.h:301:20: note: recursive call 301 \| return Invalid<T>(); \| ~~~~~~~~~~^~ Change-Id: I14594f7bc148281784043b3f715173316e6d62d4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65211 Maintainer: Gabe Black <gabeblack@google.com> Reviewed-by: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-03 02:44:51 +00:00
Jason Lowe-Power	c6918c8f74	python,stdlib: Add multiprocessing module This changeset replicates some of the multiprocessing module implementation from the python standard library in gem5. The goal of this and following changesets is to enable users to use a single set of python scripts to run and analyze a suite of gem5 simulations. We must reimplement some of the multiprocessing module becaue it is not flexible enough to allow for customized command line parameter to the "python" executable (gem5 in our case). To get around this, I extended the Process and context objects to be gem5 specific. The next steps is to wrap the Process and Pool types with gem5-specific versions that will improve their usability for our needs. With this changeset, these objects are usable, but it will require significant user effort to reach the goal of running/analyzing many different gem5 simulations. There are some limitation: - The pool will only work if the max tasks per child is 1 - The functions that are executed must come from another module As an example, the following code should work after applying this change. test.py: ```python from gem5.utils.multiprocessing import Process, Pool from sim import info, run_sim if __name__ == '__m5_main__' or __name__ == '__main__': info('main line') p1 = Process(target=run_sim, args=('bob',)) p2 = Process(target=run_sim, args=('jane',)) p1.start() p2.start() p2.join() p1.join() with Pool(processes=4, maxtasksperchild=1) as pool: pool.map(run_sim, range(10)) ``` sim.py: ``` import os def info(title): print(title) print('module name:', __name__) print('parent process:', os.getppid()) print('process id:', os.getpid()) def run_sim(name): info('function g') from gem5.prebuilt.demo.x86_demo_board import X86DemoBoard from gem5.resources.resource import Resource from gem5.simulate.simulator import Simulator board = X86DemoBoard() board.set_kernel_disk_workload( kernel=Resource("x86-linux-kernel-5.4.49"), disk_image=Resource("x86-ubuntu-18.04-img"), ) simulator = Simulator(board=board) simulator.run(max_ticks=10000000) ``` Change-Id: I4348ebaa75d006949ec96e732f5dc2a5173c6048 Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63432 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Richard Cooper <richard.cooper@arm.com>	2022-11-02 20:05:49 +00:00
Giacomo Travaglini	aead8fb0fd	arch-arm: Remove ISA::haveGICv3CpuIfc method The method is really not needed as we do not implement GICv3 in legacy mode... Therefore when we want to check if the GICv3 cpu interface is present, we can just check for GICv3 being present Change-Id: I264f887392d188a515480c2e31a4a4da3e67c498 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65173 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 08:32:44 +00:00
Giacomo Travaglini	d348df8763	arch-arm: Fix GICv3 List register mapping Change-Id: I870104cf27cc9ba28763adc5b43ff850c1ea279f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65172 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 08:32:44 +00:00
Giacomo Travaglini	ba8f59ff17	arch-arm: Fix access permissions for GICv3 cpu registers * ICC_SRE_EL3/ICC_CTLR_EL3/MISCREG_ICC_IGRPEN1_EL3 are accessible at EL3 only * ICH_LR<n>_EL2 are accessible at EL2 and EL3 only Change-Id: Idcd9656abafc3014d2715cd6f138a6d786bc6c34 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65171 Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 08:32:44 +00:00
Earl Ou	e65adccd6f	util: update termios to replace nl with cr-nl This change enables OPOST to enable output post-processing. It then enables ONLCR to prepend newline characters with carriage return so that start of each line is always left aligned. Note that on some terminals this might display a redundant ^M. Change-Id: Ia0b4c61725ab7478e7341273a8279b96e53d9f26 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65152 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 06:26:03 +00:00
Bobby R. Bruce	19bad67a6e	tests: Remove ARM compilation requirement for quick/Kokoro The ARM Boot Tests required the compilation of ARM/gem5.opt to run a quick test that the CHI protocol was functioning correctly with ARM and the ArmBoard. This test has been removed and the test refactored slightly to use the ALL/gem5.opt. The CHI protocol is already tested nightly. Change-Id: Ibe406348caefa2493860036eb89a20681478ea66 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65195 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-11-02 05:57:41 +00:00
Bobby R. Bruce	54322872dd	tests: Remove unneeded build step from nightly.sh The `main.py` script will build the ISAs required to run tests. Our compiler tests (see "tests/compiler-tests.sh") are run nightly and already test to ensure these ISAs are compiled correctly. Compiling these ISAs as part of this script is therefore redundant. This patch removes this step to save testing time. Change-Id: I58636acfd5512886ac11ca84ee96cbdc9e344c68 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65175 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 05:57:41 +00:00
Bobby R. Bruce	3c171a1ae4	tests: Remove tests requiring comp of novel ISAs in long The only tests requiring the the compilation of SPARC, MIPS, and POWER for the long/nightly suite were "tests/gem5/multi_isa/test_multi_isa.py" and "gem5/stdlib/test_requires.py". As compilation of gem5 is quite costly, it'd best we simply remove these tests. They are minor and not very important. Compilation of these ISAs will continue to be tested via the compilation tests. Change-Id: I98b33eec5d0adb144109d32851033380f1641ad4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65193 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 05:57:41 +00:00
Bobby R. Bruce	e07ca68ed7	tests: Remove long/nightly X86 Boot tests The long/nightly tests are failing due to timeout (e.g.,: https://jenkins.gem5.org/job/nightly/398/). We must therefore be more careful about what we test on a nightly basis. Each of these X86 boot tests takes an hour and, generally, are largely the same (just with different CPU cores, cache hierarchy, and memory system). Given this is largely redundant, some of these tests have been remove dto save on testing time. Change-Id: I761fca1aa5e111a03183f83d4e326aaea1bdbc3a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65192 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 05:57:41 +00:00
Bobby R. Bruce	4145d9fb91	tests: Remove long/nightly ARM Boot tests The long/nightly tests are failing due to timeout (e.g.: https://jenkins.gem5.org/job/nightly/398/). We must therefore be more careful about what we test on a nightly basis. Each of these ARM boot tests takes an hour and, generally, are largely the same (just with different CPU cores, cache hierarchy, and memory system). Given this is largely redundant, some of these tests have been removed to save on testing time. Change-Id: I8d80d3e0869aca67aa7279a164bdce85d20f3682 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65191 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-02 05:57:41 +00:00
Bobby R. Bruce	8ffecdd966	stdlib,configs,tests: Rename config to arm-ubuntu-run.py The "config/example/gem5_library/arm-ubuntu-boot-exit.py" script is renamed to "config/example/gem5_library/arm-ubuntu-run.py". This makes it more consistent with similar scripts in the "config/example/gem5_library" directory: "x86-ubuntu-run.py" and "riscv-ubuntu-run.py". Change-Id: I9d96fd68e122f2841573b1717b0969cd44972771 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65132 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-11-01 18:57:36 +00:00
Bobby R. Bruce	26858db854	stdlib: Refactor the ArmBoard for _connect_things move Since moving `_connect_things` to a pre-init step, the ArmBoard can now be refactored to set up things in a more logical manner. In particular, this patch moves activity out of the `_add_disk_to_board` function and into the `_pre_initialization` function. Change-Id: I5d40267f28ae87cd483a0396739c09b8b2b46383 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65052 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-11-01 18:57:36 +00:00
Bobby R. Bruce	25d4fb2d91	stdlib: Move `_connect_things` to run as pre_instantiation Through working with the gem5 stdlib there have been instances where connecting the memory, processor, and cache hierarchy to the board (via the AbstractBoard's `_connect_things` function) at the point of the AbstractBoard's construction is problematic as the memory, processor, and cache hierarchy may require information to connect correctly that is only known to the AbstractBoard after construction. In particular this can occur when a Workload contains information needed to configure correctly. To resolve this problem the `_connect_things` function has been moved to run as a pre-initialization step. That is, run immediately before `m5.instantiate`. This is done in the Simulator module. This will break cases where a user utilizes the stdlib AbstractBoard but does not use the stdlib Simulator module. As such, an Exception is raised in these cases explaining the fix to the user. This is done via a hack where the boards' `createCCObject` function (inheritted from SimObject) is overriden with a check to ensure `_connect_things` has been run. To fix the `_pre_instantiate` function must be executed prior to `m5.instantiate` in the Python configuration script. Test and config scripts in the gem5 repo have been updated accordingly. Change-Id: Ibaef36eb7433ce104b861b1da80fc600f08f715a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65051 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-01 18:57:36 +00:00
Kaustav Goswami	75c1df0d06	stdlib,arch-arm: Add ruby cache support to the ArmBoard This change adds ruby cache support to the ArmBoard. Previously only classic caches were supported by the ArmBoard. The ArmBoard was tested with CHI, MESI_Two_Level and MI_example caches from the gem5's stdlib. Change-Id: I480fe6ae13e3bd8438a425548ed113d443fcee40 Signed-off-by: Kaustav Goswami <kggoswami@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64011 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-11-01 18:57:36 +00:00
Jason Lowe-Power	04ac9d9f4f	stdlib: Give board interface for mem ports It is possible that the board has more than just a "main" memory. For instance, the ArmBoard has a boot memory which is separate from the `get_memory` function. This moves the `get_mem_ports` function to the board so that the board can optionally override it. Change-Id: I05e388cc93e691e9a4fa674023f158af447349f9 Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64631 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-01 18:57:36 +00:00
Matthew Poremba	27da9b3576	configs: GPUFS: use multiple event queues for >1 CPU The KVM CPU hangs if there are not multiple event queues when more than one CPU is created. Since GPUFS primarily relies on the KVM CPU, support for multiple event queues is needed. Some GPU libraries, such as AMD Research's ATMI library, assume more than one CPU. This changeset adds support for multiple CPUs and was tested for up to four CPUs. Change-Id: Ia354e02209d0fa18195f3ad44f4fb1d58e93b5ca Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65131 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-11-01 15:34:29 +00:00
Matthew Poremba	489074fbfd	dev-amdgpu: Fix issues with PM4 queue map, fences The PM4 release_mem packet is used as a DMA fence in the driver. It specifies which queue the interrupt came from by encoding the me, pipe, and queue fields from the map_queue packet into the interrupt ring ID. Currently these fields are incorrect because (1) the order in the bitfield is backwards, (2) the queue constructor assigns a pointer to the PM4MapQueue packet containing this data to the dmaBuffer which gets deleted in short order, and (3) the order of the encoding of ring ID is incorrect. This change fixes these issues by (1) placing the struct vales in correct order, (2) creating a const copy of the dmaBuffer on construction, and (3) using the ring ID encoding expected by the driver: https://github.com/RadeonOpenCompute/ROCK-Kernel-Driver/blob/roc-4.3.x/ drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c#L5989 Change-Id: I72c382980e57573f8a8a6879912c4139c7e2f505 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65095 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2022-11-01 15:34:17 +00:00
Matthew Poremba	c5feca8251	dev-amdgpu: Rework PM4 NOP packet The PM4 NOP header is used to insert spaces in the PM4 ring and can therefore be any size. This includes zero. A size of zero is denoted by a value of 0x3fff in the NOP packet header. Currently we assume this means the remainder of the PM4 queue up to the wptr is empty/NOPs. This is not always true. This changeset reworks the PM4 NOP packet to handle the value of 0x3fff as a special value and advances the rptr by 0 bytes. This fixes issues where there were additional packets in the queue which were being skipped over by fast forwarding. Since those packets could be anything, that leads to undefined behavior afterwards. Change-Id: I3f5c3f4b7dd50f93ba503fea97454a9d41771e30 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65094 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-11-01 15:34:08 +00:00
Matthew Poremba	752b696883	dev-amdgpu: Fix SDMA trap ring ID, context SDMA traps are used in the driver as a DMA fence. To pass a fence, the SDMA sends the driver the interrupt context from a trap packet and the ring ID which specifies which queue in the SDMA engine is passing a fence. Currently the interrupt context is using the wrong value in the packet and the ring ID is hard-coded to always be the gfx queue. This changeset uses the correct interrupt context from the SDMA packet and sets the ring ID to either 0 if the gfx queue is currently being processed or 3 if the page queue is being processed. The relevant interrupt service routine in the driver can be found at: https://github.com/RadeonOpenCompute/ROCK-Kernel-Driver/blob/roc-4.3.x/ drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c#L2129 Change-Id: Ie4a4a9d6ab1d3bf83bf76bb57a02a91100217b51 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65093 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-01 15:34:08 +00:00
Matthew Poremba	8899291db6	dev-amdgpu: Fix interrupt handler address assignment The interrupt handler's base address is sent via MMIO and must be shifted by 8 bits to convert to a byte address. The current code is shifting the MMIO dword first then assigning, resulting in the top 8 bits being shifted out. This changeset fixes the issue by assigning the dword to the 64-bit address first then shifting after. Similarly, the upper dword is cast to a 64-bit value first before shifting. This fixes some "fence fallback timeout" errors in the m5term output. These timeouts become a problem because the driver will reset after a few hundred of them, killing any running GPU applications as part of the process. Change-Id: I0beec313f533765c94063bcf4de8c65aacf2986b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65092 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2022-11-01 15:34:08 +00:00
Yu-hsin Wang	91cd599f05	systemc: sync the response error between gem5 packet and tlm payload For now we don't return the correct error status to the upstream of the protocol conversion bridge. This prevents from the requestor to distinguish if the response is good or not. This change fixes the issue. Change-Id: Iec2a388b50fb1bd4fd97ece19e9061138b0b0a1f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64591 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Earl Ou <shunhsingou@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-11-01 05:51:27 +00:00
Yu-hsin Wang	80c3bd3bdf	mem: introduce bad command error to packet commands The bad command is used to model a request is sent to target but the target cannot make it. The bad command error is designed to model AXI SLVERR. Change-Id: I8142df36a5ed3e461493796266821a2b30a3415e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64872 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Earl Ou <shunhsingou@google.com>	2022-11-01 05:51:20 +00:00
Matthew Poremba	144ce7f12c	dev-amdgpu: Fix GART PTE size The GART table is a legacy 1-level page table primarily used for supervisor mode accesses to GPUs. The PTE size is 64-bits, not 32-bit. This causes memory sizes >3GB (in X86) to fail loading amdgpu driver. This changeset fixes the issue by setting the GART mappings to the correct data type. Change-Id: Ibfba2443675fe28316d26afa5f1a14885fdce40c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65091 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-10-31 14:40:30 +00:00
Matthew Poremba	7b16b17e61	dev-amdgpu: Chunkify SDMA copies that use device memory The current implementation of SDMA copy calls the GPU memory manager's read/write method one time passing a physical address as the source/destination. This implicitly assumes the physical addresses are contiguous which is generally not true for large allocations. This results in reading from/writing to the wrong address. This changeset fixes the problem by copying large copies in chunks of the minimum possible page size on the GPU (4kB). Each page is translated seperately to ensure the correct physical address. The final copy "done" callback is only used for the last transfer. The transfers should complete in order so the copy command will not complete until all chunks have been copied. Tested and verified on an application with a large allocation (~5GB). Change-Id: I27018a963da7133f5e49dec13b0475c3637c8765 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64752 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-31 14:30:24 +00:00
Matthew Poremba	6a4a12ebbd	arch-vega: Improve non-native page size support Vega allows for any integer multiple of 4kB pages. However, the current implementation is designed for 4kB page primarily. In order to support variable page sizes, the physical address calculation needs to be updated to add the virtual page offset to the base physical address rather than bitwise-OR. Bitwise-OR assumes physical pages are at aligned to the page size which is generally not the case for very large pages (1GB+). This changeset changes all of the physical address computations to add the virtual offset to the physical page address. This fixes many GPUFS applications which use larger pages. The support was tested by hipMalloc'ing ~5GB to induce a large page being created. The test application now passes verification with this change. Change-Id: Ic8d1475e001def443f3e4ab609449bca0c40b638 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64751 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-31 14:30:13 +00:00
Earl Ou	6fd2a64656	mem: implement ThreadBridge ThreadBridge is used for communication between two SimObjects from different threads (EventQueue). Change-Id: I3e00df9184404599dfacef64b505cd0b64ee46aa Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/65071 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-31 01:05:16 +00:00
Bobby R. Bruce	3ab6a7496b	stdlib: Move setting of checkpoints to set_workload funcs It never made much sense to set checkpoint via the Simulator module as Checkpoints are very tightly coupled with the Workload being run. This change therefore moves the checkpoint to the set_workload functions. Setting checkpoints via the Simulator is deprecated and will be removed in a future release. Change-Id: I24d2133b38a86423d3553ec888c917c5fe47b93d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64571 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-10-28 00:16:44 +00:00
Melissa Jost	ceac4b8f1a	stdlib,configs: Update simpoint example to use the Workload With the inclusion of the "x86-print-this-15000-with-simpoints" workloads (introduced here: https://gem5-review.googlesource.com/c/public/gem5-resources/+/64531) This patch utilizes this workload for the simpoint examples. Change-Id: I5e2c4a48206fd7108a33a4a64ac64235ea9f1f33 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64552 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-28 00:16:44 +00:00
Bobby R. Bruce	36e5feb0de	stdlib: add 'get_simpoint' function to se_binary_workload.py This function is necessary to obtain the workload from a board once set. This is a stop-gap solution to get SimPoints working with SE workloads but will need revision when implementing this functionality for FS. Change-Id: Ided2b1a5867655a98730879524e0be61c3f20295 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64551 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Melissa Jost <mkjost@ucdavis.edu>	2022-10-28 00:16:44 +00:00
Melissa Jost	f1be0c808a	stdlib: Added set_se_simpoint_workload to SEBinaryWorkload Change-Id: I815d4aff655e96619a44fc6fc04b674a794056a2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64432 Reviewed-by: Melissa Jost <mkjost@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-10-28 00:16:44 +00:00
Bobby R. Bruce	87e774c1d5	arch-x86,sim-se: Add clone3 syscall This also expands the syscall table for x86 to 450. The clone3 version of the syscall puts the parameters in a struct. This pulls out the parameters that gem5 uses and updates a couple of other places with new flags and structs. Reference: https://github.com/torvalds/linux/blob/master/arch/x86/entry/syscalls/syscall_64.tbl This is part of the reason for the failing Nightly tests: https://jenkins.gem5.org/job/nightly/392/, triggered by updating the tests to use Ubuntu 22.04 Change-Id: Ia934d54c391c6bd4f655bf65538d85371b6dbfb9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64931 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-10-27 23:57:43 +00:00
Giacomo Travaglini	c2541a8175	arch-arm: Use ThreadContext in ArmISA::currEL implementation This is partly reverting a previous patch [1] which was moving most functionalities within the ISA class. This evidently does not work well with thread context implementations which are bypassed by the ISA objects as noted by [2] [1]: https://gem5-review.googlesource.com/c/public/gem5/+/53624 [2]: https://gem5-review.googlesource.com/c/public/gem5/+/64653 Change-Id: I0c91c76f690542219ffbbf53359531d9dea9e86d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64914 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2022-10-27 15:33:18 +00:00
Giacomo Travaglini	506bd9d9e7	dev-arm: Use ThreadContext instead if ISA in GICV3 cpu interface Some CPU wrappers like the Fastmodel one do extend the ThreadContext interface in order to retrieve system register state... By bypassing the TC interface and by using the ISA instead, we are basically forcing users to extend the ISA as well to intercept these calls. So with this patch we are making sure every system register is accessed (like HCR_EL2 or SCR_EL3) through the thread context. This of course does not apply to the CPU interface registers as we still use the ISA storage for them. In the future we should probably move that storage from the ISA class to the Gicv3CPUInterface class itself This is also simplifying Gicv3CPUInterface::isEL3OrMon: currEL already covers the AArch32 case so no need to differentiate between AArch32 and AArch64 Change-Id: I446a14a6e12b77e1a62040b3422f79ae52cc9eec Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64913 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2022-10-27 15:33:18 +00:00
Giacomo Travaglini	8a78358a30	arch-arm: Allow TarmacTracer to dump trace to a file This patch is adding an outfile parameter to the TarmacTracer This has 3 options: 1) stdoutput = dump to standard output (default behaviour) 2) stderror = dump to standard error 3) file = dump to a file. As there is one tracer per CPU, this means every CPU will dump its trace to a different file, named after the tracer name (e.g. cpu0.tracer, cpu1.tracer) It is still possible to redirect to a file with option 1 and 2 thanks to common bash redirection. What the third option is really buying us is the capability to dump CPU traces on separate files, and to separate the trace output from the debug-flag output Change-Id: Icd2bcc721f8598d494c9efabdf5e092666ebdece Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/63892 Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2022-10-27 15:33:06 +00:00
Giacomo Travaglini	9a9de78811	dev-arm: Implement System Security Control registers This block of system registers is part of the N1 SDP [1] [1]: https://developer.arm.com/documentation/101489/0000/\ Programmers-model/System-Security-Control-registers Change-Id: I2ecf5cd247bd68eddcd359e91f3954070dbffaa8 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64951 Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2022-10-27 10:33:38 +00:00
Giacomo Travaglini	4db981576e	arch-arm: Setup ThreadContext in GICv3 cpu interface Change-Id: If019b4b114031f880dff43e05658a162c201ea6a Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64912 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-27 10:31:10 +00:00
Giacomo Travaglini	62ce119139	arch-arm: Always use AArch64 version of HCR/SCR We are slowly replacing AArch32 code in favour of AArch64. Change-Id: I2857a198a0169e882e5f997debc76808244ab42d Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64911 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2022-10-27 10:31:10 +00:00
handsomeliu	a13549ebef	scons: Capture TMPDIR envvar During gem5 build, the compiler may produce some large intermediate files. The default path is /tmp, but in some usecase, it's under a small file system, and we may want to change the storage path to a sufficient large file system. This CL captures TMPDIR environment variables, to allow users change the default temporary directory. Change-Id: Ib3fad301f36df9f3f08eb9b6cfeb4df1b7df5d1a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64873 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-27 09:17:41 +00:00
Mahyar Samani	32e4ac65a8	tests: Adding trusted stats for stats tests. This changes adds the trusted stats required for comparing the output stats from each test. Change-Id: Id85f84f3086481f5d4b8eb7ee36e7a6f862782cc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58569 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2022-10-25 01:08:11 +00:00
Mahyar Samani	a3d2f68120	tests: Adding tests for gem5stats. This change adds the first of many tests for the gem5stats. The current test only considers a setup consisiting of traffic generator, cache and memory. Change-Id: I3ebe16131aed5bf6dad68de673f9491f1bea1c78 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55743 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu>	2022-10-25 01:08:11 +00:00
Zhantong Qiu	84acdd4836	stdlib: add SimPoint checkpoint generator The previous SimPoint warmup length was limited by the gaps between the starting instruction of one SimPoint and the ending instruction of the SimPoint before it. This was to prevent duplicate SimPoints, but it can significantly limit the warmup length. In this commit, the warmup length limitation will be extended to the starting instruction of one SimPoint regardless of the gap between SimPoints. A SimPoint checkpoint generator is created to help taking checkpoints for SimPoints and make sure multiple SimPoint checkpoints are taken when there are multiple SimPoints sharing the same starting instruction Change-Id: If95f6813e8cbf5c01e41135c1b1bb91ed2e950ad Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64351 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu>	2022-10-22 00:23:19 +00:00
Sascha Bischoff	eed80d083e	arch-arm: Set ZCR_ELx before updating vector length in decoder For SVE it is possible to override the run-time vector length (VL) per exception level by setting the value in the appropriate ZCR_ELx registers. In general instructions query the appropriate registers during execute() to determine the actual vector length. The exception to this rule are the SVE Macromem instructions which use the VL to determine the number of micro-ops to crack into during decode. However, as there is no available ExecContext during the decode stage these instructions rely on a cached value stored in the decoder. Previously we were updating the VL in the decoder using potentially stale values of ZCR_ELx by calling the update before actually setting the registers themselves. We now set the registers before updating the decoder. Change-Id: I0167095699f7f950ee99fc42c7c8292fe8938d28 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64331 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com>	2022-10-21 16:07:54 +00:00
Bobby R. Bruce	c021bb6099	tests: Update presubmit.sh to use Clang-14 As of https://gem5-review.googlesource.com/c/public/gem5/+/64176 we now support Clang-14. This patch therefore updates the Kokoro tests to utilize this for the `ALL/gem5.fast` compilation. This should help catch Clang compilation errors earlier. Change-Id: I6b5bac47565823bbd413e006cfe577468b1bbb70 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64292 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2022-10-20 19:10:03 +00:00
Bobby R. Bruce	21cb85f201	tests: Updating the tests to use Ubuntu 22.04 docker image As of https://gem5-review.googlesource.com/c/public/gem5/+/64177 we support version 22.04. This patch therefore updates the testing infrastructure (kokoro/quick, nightly/long, weekly/extra-long) to use the Ubuntu 22.04 docker image. The "jenkins/gem5art-tests.sh" test script has been updated to no longer require the `pip upgrade`. This was needed for Ubuntu 20.04 as it utilized an older version of pip which did not have all the dependencies these tests requried. As of Ubuntu 20.04 this is no longer required. Change-Id: Ia8f8b1b2c62ad5d5a8419cb31b6a1d2b6dff7ac9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/64291 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-10-20 19:10:03 +00:00

1 2 3 4 5 ...

19572 Commits