derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Yu-Cheng Chang	2825bc1d55	misc: Add missing RISCV valid ISA option to README.md (#462 ) The list of valid ISA options should be same as the website: https://www.gem5.org/documentation/general_docs/building Change-Id: Id5ace5b0356ec35634caec5b11159551801c0615	2023-10-16 09:45:28 -07:00
Hoa Nguyen	9b2b6cd8d2	arch-riscv: Mark vector configuration insts as vector insts (#463 )	2023-10-16 09:40:09 -07:00
Bobby R. Bruce	a9464a41f5	stdlib,resources: Generalize exception for request retry (#466 ) In commit `bbc301f2f0` the generalized `Exception` was changed back to the more specific `HTTPError`. In this case we do not desire specific error handling. If the connection to the database fails I want the exception handled in the way outlined: i.e., i want the connection to be retried 4 times before giving up. With `HTTPError`, only `HTTPError`s warrent a retry. Changing this to `HTTPError` cause tests to fail due to a failure to retry downloading of a resource. Here is an example: https://github.com/gem5/gem5/actions/runs/6521543885/job/17710779784 In this case `request.urlopen` raised a `URLError`. I suspect this was some issued to do with reaching the DNS servers. It likely would've succeeded if it had just tried again.	2023-10-16 09:39:44 -07:00
Bobby R. Bruce	5240c07d3c	util: Fix runners to extent to max disk size (#460 ) THe `lvextend` command extends the logical volume. However, the `resize2fs` command is needed to extend the filesystem to fill the logical volume. Prior to this patch the filesystem ran out of space despite there being enough room in the volume. This was just wasted free space.	2023-10-16 09:20:13 -07:00
Bobby R. Bruce	97f4b44dd3	arch-arm: Fix line-length error in misc.cc (#459 )	2023-10-16 08:35:54 -07:00
Giacomo Travaglini	f9cf8bf8a2	cpu, arch-arm: Add IsPseudo tag for gem5 pseudo instructions (#465 ) This only applies to pseudo instructions with their own encoding (m5 ops)... In other words memory mapped m5 operations are not supported. This make sense as they should rather be treated as device accesses... Though it is something to take into consideration when relying on the flag	2023-10-16 16:15:05 +01:00
Bobby R. Bruce	d42eeb6b68	cpu: Explicitly define cache_line_size -> 64-bit unsigned int (#329 ) While it's plausible to define the cache_line_size as a 32-bit unsigned int, the use of cache_line_size is way out of its original scope. cache_line_size has been used to produce an address mask, which masking out the offset bits from an address. For example, [1], [2], [3], and [4]. However, since the cache_line_size is an "unsigned int", the type of the value is not guaranteed to be 64-bit long. Subsequently, the bit twiddling hacks in [1], [2], [3], and [4] produce 32-bit mask, i.e., 0x00000000FFFFFFC0. This behavior at least caused a problem in LLSC in RISC-V [5], where the load reservation (LR) relies on the mask to produce the cache block address. Two distinct 64-bit addresses can be mapped to the same cache block using the above mask. This patch explicitly defines cache_line_size as a 64-bit unsigned int so the cache block mask can be produced correctly for 64-bit addresses. [1] `3bdcfd6f7a/src/cpu/simple/atomic.hh (L147)` [2] `3bdcfd6f7a/src/cpu/simple/timing.hh (L224)` [3] `3bdcfd6f7a/src/cpu/o3/lsq_unit.cc (L241)` [4] `3bdcfd6f7a/src/cpu/minor/lsq.cc (L1425)` [5] `3bdcfd6f7a/src/arch/riscv/isa.cc (L787)`	2023-10-16 07:50:35 -07:00
Jason Lowe-Power	d702d3b90a	misc: fix clang13 overloaded-virtual warning (#454 ) Like #363 clang is also unhappy about the overloaded virtual. However, clang needs to have the diagnostic in a different place Fixes #437	2023-10-16 07:23:08 -07:00
Giacomo Travaglini	3f925c4084	arch-arm: Mark gem5 pseudo-ops with IsPseudo flag Change-Id: I9c8a146d73596597f28cdeca22ad7b7b01b381a7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-16 13:42:23 +01:00
Giacomo Travaglini	a3b1bfdbf0	cpu: Add a IsPseudo StaticInstFlag for gem5 pseudo-ops Being able to recognise pseudo ops from the static instruction pointer is actually quite useful in several circumstances Change-Id: Ib39badf9aabba15ab3ebe7a8e9717583412731e4 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-16 13:41:04 +01:00
Giacomo Travaglini	2e85c95f4b	arch-arm: Remove Jazelle state + ThumbEE support (#364 ) This PR removes Jazelle state (while still keeping a "Trivial Jazelle implementation", see Arm Architecture Reference Manual) and ThumbEE support	2023-10-16 09:41:44 +01:00
Jason Lowe-Power	20f5555f30	python: Enable -m switch on gem5 binary (#453 ) With -m, you can now run a module from the command line that is embedded in the gem5 binary. This will allow us to put some common "scripts" in the stdlib instead of in the "configs" directory.	2023-10-14 20:08:06 -07:00
Matthew Poremba	ca2592d3ba	configs: Fix missing param exchange for GPUFS (#457 ) PR #367 adds an option to configs/ruby/GPU_VIPER.py that was not added to the corresponding dGPU equal for GPUFS and thus all GPUFS runs are failing. Fixed in this patch.	2023-10-14 20:07:39 -07:00
Daniel Kouchekinia	4931fb0010	mem-ruby: Always pass on GPU atomics to dir in write-through TCC (#367 ) Added checks to ensure that atomics are not performed in the TCC when it is configured as a write-through cache. Also added SLC bit overwrite to ensure directory preforms atomics when there is a write-through TCC. Change-Id: I4514e6c8022aeb7785f2c59871cd9acec8161ed8	2023-10-14 06:39:50 -07:00
Yu-Cheng Chang	a3c51ca38c	arch-riscv: Fix write back register issue of vmask_mv_micro (#443 ) After removing the setRegOperand in VecRegOperand https://github.com/gem5/gem5/pull/341. The vmask_vm_micro will not write back to register because tmp_d0 is not the reference type. The PR will make tmp_d0 as reference of regFile. Change-Id: I2a934ad28045ac63950d4e2ed3eecc4a7d137919	2023-10-13 15:20:42 -07:00
Matthew Poremba	7706e958e5	mem-ruby: Update cache recorder to use RubyPort and remove BUILD_GPU guards (#448 ) This PR updates cache recorder to use a vector of RubyPorts for cache cooldown and warmup instead of Sequencer or GPUCoalescer vectors (refer to issue #403 for more details). It also removes the extra guards that were added in #377 to prevent compile-time failures in non-GPU builds.	2023-10-13 14:36:45 -07:00
Kaustav Goswami	68af3f45c9	tests: updated the nightly tests to use SST 13.0.0 (#441 ) PR https://github.com/gem5/gem5/pull/396 updates the gem5 SST bridge to use StandardMem in SST. This change updates the nightly tests to use SST 13.0.0 instead of SST 11.1.0. It also updates the dockerfile. Change-Id: I5c109c40379d2f09601a1c9f19c51dd716c6582e --------- Signed-off-by: Kaustav Goswami <kggoswami@ucdavis.edu> Co-authored-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2023-10-13 14:31:35 -07:00
Andreas Sandberg	59f96deb0f	cpu: Refactor indirect predictor (#429 )	2023-10-13 11:35:02 +01:00
Giacomo Travaglini	1c45cdcc41	arch-arm: Remove legacy ThumbEE references ThumbEE had already been removed but there were still some references to it dangling around. We were also signaling ThumbEE as being available through HWCAPS in SE which was not correct. This patch is fixing it Change-Id: I8b196f5bd27822cd4dd8b3ab3ad9f12a6f54b047 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-13 09:25:48 +01:00
Giacomo Travaglini	a33f3d3967	arch-arm: Remove Jazelle state support Jazelle state has been officially removed in Armv8. Every AArch32 implementation must still support the "Trivial Jazelle implementation", which means that while the instruction set has been removed, it is still possible for privileged software to access some Jazelle registers like JIDR,JMCR, and JOSCR which are just treated as RAZ Change-Id: Ie403c4f004968eb4cb45fa51067178a550726c87 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-13 09:25:48 +01:00
Vishnu Ramadas	8d54a5cbab	mem-ruby: Remove BUILD_GPU guards from ruby coalescer models A previous commit added BUILD_GPU guards to gpu coalescer models since a related cache recorder commit added GPU support. This is no longer needed since the cache recorder moved to using a vector of RubyPorts instead of Sequencer/GPUCoalescer pointers. This commit removes BUILD_GPU guards from the Ruby coalescer models Change-Id: I23a7957d82524d6cd3483d22edfb35ac51796eca	2023-10-12 14:53:29 -05:00
Vishnu Ramadas	08c1af1b16	mem-ruby: Use RubyPort vector to access Ruby in cache recorder Previously, the cache recorder used a vector of sequencer pointers to access Ruby objects. A recent commit updated the cache recorder to also maintain a vector of GPUCoalescer pointers in order for GPUs to support flushin. This added redundant code to the cache recorder. This commit replaces the sequencer and GPUCoalescer vectors with a vector of RubyPort pointers so that the code does not contain redundant lines Change-Id: Id5da33fb870f17bb9daef816cc43c0bcd70a8706	2023-10-12 14:49:06 -05:00
Bobby R. Bruce	3455d9e68d	misc,tests: Add dummy jobs to workflows for status checks (#444 ) With this change we can block PRs if the compiler, daily, and/or weekly tests are failing by setting these dummy jobs as require status checks.	2023-10-12 11:14:08 -07:00
Bobby R. Bruce	bf1c10d4b2	tests,misc: Update CI Tests 'testlib-quick' runs-on Here it's more sensible to use a GitHub hosted runner. This job is miniscule and is used to check the other tests have completed successfully. It makes sense for this not to be on our own self-hosted runner. Change-Id: I5377e025334d43eaedd0fc61e5c708ba61255d28	2023-10-12 07:37:11 -07:00
Bobby R. Bruce	3816ea5633	misc,tests: Add dummy jobs to workflows for status checks Change-Id: I52e42b6f93cfbb1a8e4800a3f6e264d49bebb06c	2023-10-12 07:37:04 -07:00
Matthew Poremba	4d336c0636	arch-vega: Implement buffer_atomic_cmpswap (#439 ) This is a standard compare and swap but implemented on vector memory buffer instructions (i.e., it is the same as FLAT_ATOMIC_CMPSWAP with MUBUF's special address calculation). This was tested using a Tensile kernel, a backend for rocBLAS, which is used by PyTorch and Tensorflow. Prior to this patch both ML frameworks crashed. With this patch they both make forward progress. Change-Id: Ie76447a72d210f81624e01e1fa374e41c2c21e06	2023-10-12 07:33:40 -07:00
Matthew Poremba	7bae5464dc	arch-vega: Ignore s_setprio instruction instead of panic (#438 ) This instruction is used by ML frameworks to prioritize certain wavefronts. Since gem5 does not have any support for wavefront scheduling based on priority (besides wavefront age), we ignore this instruction and warn_once rather than calling panic. Since hardware can override this priority anyways, we can be sure that ignoring the value will not inhibit forward progress resulting in application hangs. Change-Id: Ic5eef14f9685dd2b316c5cf76078bb78d5bfe3cc	2023-10-12 07:32:58 -07:00
Matthew Poremba	f7ad8fe435	configs: GPUFS option to disable KVM perf counters (#433 ) Add a --no-kvm-perf option to disable KVM perf counters for GPUFS scripts. This is useful for users who have KVM enabled but configured with more restrictive settings, which seems to be the default in newer Linux distros. Change-Id: I7508113d0f7c74deb21ea7b2770522885a0ec822	2023-10-11 14:20:27 -07:00
Matthew Poremba	4b7f25fcb6	arch-vega: Ignore s_setprio instruction instead of panic This instruction is used by ML frameworks to prioritize certain wavefronts. Since gem5 does not have any support for wavefront scheduling based on priority (besides wavefront age), we ignore this instruction and warn_once rather than calling panic. Since hardware can override this priority anyways, we can be sure that ignoring the value will not inhibit forward progress resulting in application hangs. Change-Id: Ic5eef14f9685dd2b316c5cf76078bb78d5bfe3cc	2023-10-11 15:55:16 -05:00
Matthew Poremba	4b85a1710e	arch-vega: Implement buffer_atomic_cmpswap This is a standard compare and swap but implemented on vector memory buffer instructions (i.e., it is the same as FLAT_ATOMIC_CMPSWAP with MUBUF's special address calculation). This was tested using a Tensile kernel, a backend for rocBLAS, which is used by PyTorch and Tensorflow. Prior to this patch both ML frameworks crashed. With this patch they both make forward progress. Change-Id: Ie76447a72d210f81624e01e1fa374e41c2c21e06	2023-10-11 15:42:50 -05:00
Bobby R. Bruce	c855dbf7c5	configs,ext: Updated the gem5 SST Bridge to use SST 13.0.0 (#396 ) This change updates the gem5 SST Bridge to use SST 13.0.0. Changes are made to replace SimpleMem class to StandardMem class as SimpleMem will be deprecated in SST 14 and above. In addition, the translator.hh is updated to translate more types of gem5 packets. A new parameter `ports` was added on SST's side when invoking the gem5 component which does not require recompiling the gem5 component whenever a new outgoing bridge is added in a gem5 config.	2023-10-11 13:34:48 -07:00
Bobby R. Bruce	70b6b53e54	misc,python: Add `pyupgrade` to pre-commit (#424 ) This adds the [pyupgrade](https://github.com/asottile/pyupgrade) hook to pre-commit. This hook automatically upgrades the syntax to the recommended standards for the newer version of the language.	2023-10-11 09:07:09 -07:00
Matthew Poremba	da11427ba6	gpu-compute: Update tokens for flat global/scratch (#408 ) Memory instructions acquire coalescer tokens in the schedule stage. Currently this is only done for buffer and flat instructions, but not flat global or flat scratch. This change now acquires tokens for flat global and flat scratch instructions. This provides back-pressure to the CUs and helps to avoid deadlocks in Ruby. The change also handles returning tokens for buffer, flat global, and flat scratch instructions. This was previously only being done for normal flat instructions leading to deadlocks in some applications when the tokens were exhausted. To simplify the logic, added a needsToken() method to GPUDynInst which return if the instruction is buffer or any flat segment. The waitcnts were also incorrect for flat global and flat scratch. We should always decrement vmem and exp count for stores and only normal flat instructions should decrement lgkm. Currently vmem/exp are not decremented for flat global and flat scratch which can lead to deadlock. This change set fixes this by always decrementing vmem/exp and lgkm only for normal flat instructions. Change-Id: I673f4ac6121e4b5a5e8491bc9130c6d825d95fc5	2023-10-11 09:00:10 -07:00
Andreas Sandberg	891250192d	arch-arm: Implement FEAT_TCR2 and FEAT_SCTLR2 (#416 ) This is simply adding the new Armv8.9 registers defined in the related features: - FEAT_TCR2 - FEAT_SCTLR2	2023-10-11 10:14:31 +01:00
David Schall	f65df9b959	cpu: Refactor indirect predictor Simplify indirect predictor interface. Several of the existing functions where merged together into four clear once. Those four are similar to the main direction predictor interface. 'lookup', 'update', 'squash' and 'commit'. This makes the interface much more clear, allows better functionality isolation and makes it simpler to develop new predictor models. A new parameter is added to allow additional buffer space for speculative path history. Change-Id: I6d6b43965b2986ef959953a64c428e50bc68d38e Signed-off-by: David Schall <david.schall@ed.ac.uk>	2023-10-11 07:50:32 +00:00
Bobby R. Bruce	c4156b06fb	python: Fix `base` logic in `MetaSimObject` This ensures `class Foo` is considered equivalent to `class Foo(object)`. Change-Id: I65a8aec27280a0806308bbc9d32281dfa6a8f84e	2023-10-10 21:47:08 -07:00
Bobby R. Bruce	298119e402	misc,python: Run `pre-commit run --all-files` Applies the `pyupgrade` hook to all files in the repo. Change-Id: I9879c634a65c5fcaa9567c63bc5977ff97d5d3bf	2023-10-10 21:47:07 -07:00
Bobby R. Bruce	83af4525ce	misc,python: Add `pyupgrade` hook to pre-commit This hook automatically upgrades the syntax to recommended standards for new versions of the language. These are numerous and are outlined here: https://github.com/asottile/pyupgrade Change-Id: I73fc58a08160ed9a21cfa3b3e023c259a84592ba	2023-10-10 21:43:39 -07:00
Bobby R. Bruce	3f5d7d647a	misc: Run `pre-commit autoupdate` (#419 ) 1. Runs `pre-commit autoupdate`. 2. Runs `pre-commit run --all-files`. 3. Adds (2.) to ".git-blame-ignore-rev".	2023-10-10 21:41:33 -07:00
Bobby R. Bruce	d559c24ac2	stdlib: Improve handing of errors in Atlas request failures (#404 ) Now: * The Atlas Client will attempt a connection 4 times, using an exponential backoff approach between attempts. * When a failure does arise a rich output is given so problems can be easily diagnosed. Addresses: #340	2023-10-10 21:34:24 -07:00
Bobby R. Bruce	ad2fe42686	Learning-gem5: fix formatting (#401 ) Using f-strings instead of % for formatting.	2023-10-10 16:47:37 -07:00
Harshil Patel	bbc301f2f0	stdlib, tests: Fixed bugs and tests - Fixed bugs rekated to retrying on request faliure. - Updated the pyunit tests. Change-Id: Ia484690267bf27018488324f3408f7e47c59bef3	2023-10-10 15:54:20 -07:00
Bobby R. Bruce	25b2786db8	misc,python: Add `requirements-txt-fixer` to pre-commit (#422 )	2023-10-10 14:30:39 -07:00
Kaustav Goswami	937b829e8f	configs,ext: Updated the gem5 SST Bridge to use SST 13.0.0 This change updates the gem5 SST Bridge to use SST 13.0.0. Changes are made to replace SimpleMem class to StandardMem class as SimpleMem will be deprecated in SST 14 and above. In addition, the translator.hh is updated to translate more types of gem5 packets. A new parameter `ports` was added on SST's side when invoking the gem5 component which does not require recompiling the gem5 component whenever a new outgoing bridge is added in a gem5 config. Change-Id: I45f0013bc35d088df0aa5a71951422cabab4d7f7 Signed-off-by: Kaustav Goswami <kggoswami@ucdavis.edu>	2023-10-10 14:16:29 -07:00
Bobby R. Bruce	1502f7c09f	misc: Add black update change to .git-blame-ignore-rev Change-Id: Ief04aec128bc48e66b79fc2f5c474948dd5eb9eb	2023-10-10 14:02:37 -07:00
Bobby R. Bruce	ddf6cb88e4	misc: Run `pre-commit run --all-files` This is reflect the updates made to black when running `pre-commit autoupdate`. Change-Id: Ifb7fea117f354c7f02f26926a5afdf7d67bc5919	2023-10-10 14:01:58 -07:00
Bobby R. Bruce	317d2fb5b8	misc: Run `pre-commit autoupdate` This updates the pre-commit utility from v4.3.0 to v4.5.0 and updates black from 22.6.0 to 23.9.1. Change-Id: I7ebb551f30e617059ce49f89a30207f739b1cb14	2023-10-10 14:00:57 -07:00
ivanaamit	486763b671	learning-gem5: use f-string for print Change-Id: If27af6524af4e4a6a59e914e9e40ba10de24adf4	2023-10-10 13:54:07 -07:00
Bobby R. Bruce	58140bba1f	tests: Update test workflows for new runners (#417 ) #371 Updates the runners. This PR updates the tests to: 1. Drop the 'run' and 'build' labels (all runners are now of the same type). 2. Utilize the threading where possible (runners now have 4 cores minimum).	2023-10-10 12:03:00 -07:00
Bobby R. Bruce	0ec1fb167b	stdlib: Fix use internal _hashlib in md5_utils.py (#427 ) Removes the use of the internal _hashlib, which is an internal Python API This is a fix for issue #383	2023-10-10 08:32:45 -07:00

1 2 3 4 5 ...

20777 Commits