derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Matthew Poremba	4b7f25fcb6	arch-vega: Ignore s_setprio instruction instead of panic This instruction is used by ML frameworks to prioritize certain wavefronts. Since gem5 does not have any support for wavefront scheduling based on priority (besides wavefront age), we ignore this instruction and warn_once rather than calling panic. Since hardware can override this priority anyways, we can be sure that ignoring the value will not inhibit forward progress resulting in application hangs. Change-Id: Ic5eef14f9685dd2b316c5cf76078bb78d5bfe3cc	2023-10-11 15:55:16 -05:00
Bobby R. Bruce	c855dbf7c5	configs,ext: Updated the gem5 SST Bridge to use SST 13.0.0 (#396 ) This change updates the gem5 SST Bridge to use SST 13.0.0. Changes are made to replace SimpleMem class to StandardMem class as SimpleMem will be deprecated in SST 14 and above. In addition, the translator.hh is updated to translate more types of gem5 packets. A new parameter `ports` was added on SST's side when invoking the gem5 component which does not require recompiling the gem5 component whenever a new outgoing bridge is added in a gem5 config.	2023-10-11 13:34:48 -07:00
Bobby R. Bruce	70b6b53e54	misc,python: Add `pyupgrade` to pre-commit (#424 ) This adds the [pyupgrade](https://github.com/asottile/pyupgrade) hook to pre-commit. This hook automatically upgrades the syntax to the recommended standards for the newer version of the language.	2023-10-11 09:07:09 -07:00
Matthew Poremba	da11427ba6	gpu-compute: Update tokens for flat global/scratch (#408 ) Memory instructions acquire coalescer tokens in the schedule stage. Currently this is only done for buffer and flat instructions, but not flat global or flat scratch. This change now acquires tokens for flat global and flat scratch instructions. This provides back-pressure to the CUs and helps to avoid deadlocks in Ruby. The change also handles returning tokens for buffer, flat global, and flat scratch instructions. This was previously only being done for normal flat instructions leading to deadlocks in some applications when the tokens were exhausted. To simplify the logic, added a needsToken() method to GPUDynInst which return if the instruction is buffer or any flat segment. The waitcnts were also incorrect for flat global and flat scratch. We should always decrement vmem and exp count for stores and only normal flat instructions should decrement lgkm. Currently vmem/exp are not decremented for flat global and flat scratch which can lead to deadlock. This change set fixes this by always decrementing vmem/exp and lgkm only for normal flat instructions. Change-Id: I673f4ac6121e4b5a5e8491bc9130c6d825d95fc5	2023-10-11 09:00:10 -07:00
Andreas Sandberg	891250192d	arch-arm: Implement FEAT_TCR2 and FEAT_SCTLR2 (#416 ) This is simply adding the new Armv8.9 registers defined in the related features: - FEAT_TCR2 - FEAT_SCTLR2	2023-10-11 10:14:31 +01:00
Bobby R. Bruce	c4156b06fb	python: Fix `base` logic in `MetaSimObject` This ensures `class Foo` is considered equivalent to `class Foo(object)`. Change-Id: I65a8aec27280a0806308bbc9d32281dfa6a8f84e	2023-10-10 21:47:08 -07:00
Bobby R. Bruce	298119e402	misc,python: Run `pre-commit run --all-files` Applies the `pyupgrade` hook to all files in the repo. Change-Id: I9879c634a65c5fcaa9567c63bc5977ff97d5d3bf	2023-10-10 21:47:07 -07:00
Bobby R. Bruce	83af4525ce	misc,python: Add `pyupgrade` hook to pre-commit This hook automatically upgrades the syntax to recommended standards for new versions of the language. These are numerous and are outlined here: https://github.com/asottile/pyupgrade Change-Id: I73fc58a08160ed9a21cfa3b3e023c259a84592ba	2023-10-10 21:43:39 -07:00
Bobby R. Bruce	3f5d7d647a	misc: Run `pre-commit autoupdate` (#419 ) 1. Runs `pre-commit autoupdate`. 2. Runs `pre-commit run --all-files`. 3. Adds (2.) to ".git-blame-ignore-rev".	2023-10-10 21:41:33 -07:00
Bobby R. Bruce	d559c24ac2	stdlib: Improve handing of errors in Atlas request failures (#404 ) Now: * The Atlas Client will attempt a connection 4 times, using an exponential backoff approach between attempts. * When a failure does arise a rich output is given so problems can be easily diagnosed. Addresses: #340	2023-10-10 21:34:24 -07:00
Bobby R. Bruce	ad2fe42686	Learning-gem5: fix formatting (#401 ) Using f-strings instead of % for formatting.	2023-10-10 16:47:37 -07:00
Harshil Patel	bbc301f2f0	stdlib, tests: Fixed bugs and tests - Fixed bugs rekated to retrying on request faliure. - Updated the pyunit tests. Change-Id: Ia484690267bf27018488324f3408f7e47c59bef3	2023-10-10 15:54:20 -07:00
Bobby R. Bruce	25b2786db8	misc,python: Add `requirements-txt-fixer` to pre-commit (#422 )	2023-10-10 14:30:39 -07:00
Kaustav Goswami	937b829e8f	configs,ext: Updated the gem5 SST Bridge to use SST 13.0.0 This change updates the gem5 SST Bridge to use SST 13.0.0. Changes are made to replace SimpleMem class to StandardMem class as SimpleMem will be deprecated in SST 14 and above. In addition, the translator.hh is updated to translate more types of gem5 packets. A new parameter `ports` was added on SST's side when invoking the gem5 component which does not require recompiling the gem5 component whenever a new outgoing bridge is added in a gem5 config. Change-Id: I45f0013bc35d088df0aa5a71951422cabab4d7f7 Signed-off-by: Kaustav Goswami <kggoswami@ucdavis.edu>	2023-10-10 14:16:29 -07:00
Bobby R. Bruce	1502f7c09f	misc: Add black update change to .git-blame-ignore-rev Change-Id: Ief04aec128bc48e66b79fc2f5c474948dd5eb9eb	2023-10-10 14:02:37 -07:00
Bobby R. Bruce	ddf6cb88e4	misc: Run `pre-commit run --all-files` This is reflect the updates made to black when running `pre-commit autoupdate`. Change-Id: Ifb7fea117f354c7f02f26926a5afdf7d67bc5919	2023-10-10 14:01:58 -07:00
Bobby R. Bruce	317d2fb5b8	misc: Run `pre-commit autoupdate` This updates the pre-commit utility from v4.3.0 to v4.5.0 and updates black from 22.6.0 to 23.9.1. Change-Id: I7ebb551f30e617059ce49f89a30207f739b1cb14	2023-10-10 14:00:57 -07:00
ivanaamit	486763b671	learning-gem5: use f-string for print Change-Id: If27af6524af4e4a6a59e914e9e40ba10de24adf4	2023-10-10 13:54:07 -07:00
Bobby R. Bruce	58140bba1f	tests: Update test workflows for new runners (#417 ) #371 Updates the runners. This PR updates the tests to: 1. Drop the 'run' and 'build' labels (all runners are now of the same type). 2. Utilize the threading where possible (runners now have 4 cores minimum).	2023-10-10 12:03:00 -07:00
Bobby R. Bruce	0ec1fb167b	stdlib: Fix use internal _hashlib in md5_utils.py (#427 ) Removes the use of the internal _hashlib, which is an internal Python API This is a fix for issue #383	2023-10-10 08:32:45 -07:00
Yu-Cheng Chang	141b06d335	arch,arch-riscv: Remove setRegOperand in VecRegOperand (#341 ) The RISC-V vector instructions still work without setRegOperand. We should fix the register statistic issue by https://github.com/gem5/gem5/pull/360 to avoid duplicate statistic register write count Change-Id: Ib6a52935e00c3e557b366abfcf60450dca05614d	2023-10-10 08:00:10 -07:00
Matthew Poremba	9f4d334644	gpu-compute: Update tokens for flat global/scratch Memory instructions acquire coalescer tokens in the schedule stage. Currently this is only done for buffer and flat instructions, but not flat global or flat scratch. This change now acquires tokens for flat global and flat scratch instructions. This provides back-pressure to the CUs and helps to avoid deadlocks in Ruby. The change also handles returning tokens for buffer, flat global, and flat scratch instructions. This was previously only being done for normal flat instructions leading to deadlocks in some applications when the tokens were exhausted. To simplify the logic, added a needsToken() method to GPUDynInst which return if the instruction is buffer or any flat segment. The waitcnts were also incorrect for flat global and flat scratch. We should always decrement vmem and exp count for stores and only normal flat instructions should decrement lgkm. Currently vmem/exp are not decremented for flat global and flat scratch which can lead to deadlock. This change set fixes this by always decrementing vmem/exp and lgkm only for normal flat instructions. Change-Id: I673f4ac6121e4b5a5e8491bc9130c6d825d95fc5	2023-10-10 09:48:16 -05:00
Matt Sinclair	ec633b3d68	dev-amdgpu,mem-ruby: Add support to checkpoint and restore between kernels in GPUFS (#377 ) Earlier, GPU checkpointing was working only if a checkpoint was created before the first kernel execution. This pull request adds support to checkpoint in-between any two kernel calls. It does so by doing the following. - Adds flush support in the GPU_VIPER protocol - Adds flush support in the GPUCoalescer - Updates cache recorder to use the GPUCoalescer during simulation cooldown and cache warmup times.	2023-10-10 09:41:21 -05:00
Giacomo Travaglini	d9fe0cfe1c	arch-arm: Make interrupt masking handle VHE/SEL2 cases (#430 ) The new implementation matches the table in the ARM Architecture Reference Manual (version DDI 0487J.a, section D1.3.6, table R_SXLWJ) It takes into consideration features like FEAT_SEL2 (scr.eel2 bit) and FEAT_VHE (hcr.e2h bit) which affect the masking of interrupts under certain circumstances	2023-10-10 15:22:34 +01:00
Giacomo Travaglini	8acf49b6fa	arch-arm: Revamp takeInt to take VHE/SEL2 into account The new implementation matches the table in the ARM Architecture Reference Manual (version DDI 0487J.a, section D1.3.6, table R_SXLWJ) It takes into consideration features like FEAT_SEL2 (scr.eel2 bit) and FEAT_VHE (hcr.e2h bit) which affect the masking of interrupts under certain circumstances Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I07ebd8d859651475bd32fd201eea0f4e64a7dd5f	2023-10-10 09:46:47 +01:00
Giacomo Travaglini	e412ddddbd	arch-arm: Split takeInt into AArch64/32 versions We pay a small duplication cost but we make the code more readable and we enable further modifications to the AArch64 code without forcing the same code on the AArch32 method Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I1efa33cf19f91094fd33bd48b6a0a57d8df8f89f	2023-10-10 09:45:59 +01:00
root	05ebbd2184	stdlib: Fix use internal _hashlib in md5_utils.py Removes the use of the internal _hashlib, which is an internal Python API Change-Id: Id4541a143adb767ca7d942c0fd8a1cf1a08a04ab	2023-10-10 06:18:59 +00:00
Bobby R. Bruce	dc38a801b7	Merge branch 'develop' into workflows-for-new-runners	2023-10-09 23:10:18 -07:00
Bobby R. Bruce	da212e04b5	Merge branch 'develop' into requirements-fixer-hook	2023-10-09 22:37:25 -07:00
Bobby R. Bruce	486916b5d4	configs,tests: Remove `mkdir` in simpoint-se-checkpoint.py (#425 ) This `mkdir` is problematic as it doesn't create the directory recursively. This casues errors if `dir` is `X/Y/Z` and both `Y` and `Z` has not been created. An error will be returned (`No such file or directory`). This issue was fixed with: https://github.com/gem5/gem5/pull/263. The checkpointing code already recursively creates directories as needed. Ergo was can remove this `mkdir` statement.	2023-10-09 22:34:19 -07:00
Bobby R. Bruce	c5f06265bb	misc,python: Add yaml formatter to pre commit (#423 )	2023-10-09 17:55:25 -07:00
Bobby R. Bruce	51c881d0f1	stdlib: Improve handing of errors in Atlas request failures Now: * The Atlas Client will attempt a connection 4 times, using an exponential backoff approach between attempts. * When a failure does arise a rich output is given so problems can be easily diagnosed. Change-Id: I3df332277c33a040c0ed734b9f3e28f38606af44	2023-10-09 16:30:02 -07:00
Bobby R. Bruce	93704a81f1	dev-amdgpu,gpu-compute: Implement GPU and HSA timestamps (#410 ) This PR adds two commit to handle timestamps in the ROCm runtime. ROCr uses a mix of GPU timestamp reads and HSA packet timestamps to output profiling information for a task dispatch. The first patch added timestamps to the HSA completion signal indicating when the task started and ended and require changing the flow of completion signal DMAs to ensure the DMA of the timestamp values completed before writing the completion signal value. Second commit adds MMIOs for reading the GPU's timestamp counter. This MMIO resides in the GFX MMIO space so a new class is added to handle MMIOs in that address range.	2023-10-09 14:11:52 -07:00
Bobby R. Bruce	21c5d77000	configs: Add an example elastic trace generation script (#415 ) Current [TraceCPU documentation](https://www.gem5.org/documentation/general_docs/cpu_models/TraceCPU) still references the deprecated se.py/fs.py scripts for elastic trace generation (script paths are also outdated). With this PR we provide a simpler Arm based elastic trace generation script that can be used out of the box by a user or that can be extended as needed.	2023-10-09 14:11:33 -07:00
Bobby R. Bruce	1fe0056d3b	configs,tests: Remove `mkdir` in simpoint-se-checkpoint.py This `mkdir` is problematic as it doesn't create the directory recursively. This casues errors if `dir` is `X/Y/Z` and both `Y` and `Z` has not been created. An error will be returned (`No such file or directory`). This issue was fixed with: https://github.com/gem5/gem5/pull/263. The checkpointing code already recursively creates directories as needed. Ergo was can remove this `mkdir` statement. Change-Id: Ibae38267c8ee1eba76d7834367aa1c54013365bc	2023-10-09 14:00:21 -07:00
Bobby R. Bruce	fa8c9414b2	misc,python: Run `pre-commit run --all-files` This applies the automatical formatting to the .yaml files. Change-Id: I10d067ba65722aca8aaf64a62b42ae57de468e75	2023-10-09 13:20:25 -07:00
Bobby R. Bruce	5b09777011	misc,python: Add `pre-commit-hook-yamlfmt` to pre-commit This automatically formats .yaml files. By deault has the following parameters: * `mapping`: 4 spaces. * `sequence`: 6 spaces. * `offset`: 4 spaces. * `colons`: do not align top-level colons. * `width`: None. Change-Id: Iee5194cd57b7b162fd7e33aa6852b64c5578a3d2	2023-10-09 13:16:52 -07:00
Bobby R. Bruce	402ec3a57c	misc,python: Run `pre-commit run --all-files` This applies the `requirement-txt-fixer` to the repo. Change-Id: I23b1d26ad8deb49ec0019095efc6d253ac1c817c	2023-10-09 13:10:23 -07:00
Bobby R. Bruce	c53529783b	misc,python: Add `requirements-txt-fixer` to pre-commit This sorts entries in requirements.txt files. Change-Id: I7ee6e31f3cbe5078f24d13471a6aa9edc482cecd	2023-10-09 13:09:21 -07:00
Bobby R. Bruce	bbe05b0cba	tests,misc: Fix compilation tests failures (#400 ) Exposed in our failing compiler tests: https://github.com/gem5/gem5/actions/runs/6348223508, this PR: * Adds missing overrides to `PCState`'s `set` function. * Removes `std::binary_function` from DramPower (it was deprecated in CPP-11 and officially removed in CPP-17).	2023-10-09 11:20:52 -07:00
Harshil Patel	452a600c49	New function to kernel_disk_workload to allow new disk device location (#151 ) Added a parameter (_disk_device) to kernel_disk_workload which allows users to change the disk device location. get_disk_device() now chooses between the parameter and, if no parameter was passed, it calls a new function _get_default_disk_device() which is implemented by each board and has a default disk device according to each board, eg /dev/hda in the x86_board. The previous way of setting a disk device still exists as a default, however, with the new function users can now override this default	2023-10-09 10:33:45 -07:00
Harshil Patel	79f40ffdab	stdlib: Del comment stating SE mode limited to single thread (#402 ) This comment was left in the codebase in error. The `set_se_binary_workload` function works fine with multi-threaded applications. This hasn't been a restriction for some time.	2023-10-09 10:30:32 -07:00
Harshil Patel	d8fc0180a5	cpu: Restructure BTB (#412 ) This is the first PR in a series of enhancements to the BPU proposed in #358. However, I think putting everything into one PR is not nice to review and prone to oversee I might did. This PR restructures the BTB: - A new abstract BTB class is created to enable different BTB implementations. The new BTB class gets its own parameter and stats. - An enum is added to differentiate branch instruction types. This enum is used to enhance statistics and BPU management. - The existing BTB is moved into `simple_btb` as default. - An additional function is added to store the static instruction in the BTB. This function is used for the decoupled front-end. - Update configs to match new BTB parameters.	2023-10-09 10:13:00 -07:00
Bobby R. Bruce	d5e454138a	util: Remove 'run' and 'build' tags from runners Change-Id: Ib7b2eba5f08a1d8a311dc20cb55f540a5cd7dc7b	2023-10-09 09:56:32 -07:00
Bobby R. Bruce	243a261491	tests: Update Testlib CI tests to use multiheading These were previously only running on single-threaded machines. Now they'll be running on 4-core VMs so may as well run tests in parallel. Change-Id: I7ee86512dc72851cea307dfd800dcf9a02f2f738	2023-10-09 09:56:32 -07:00
Bobby R. Bruce	70f8c49e8b	tests,misc: Remove 'run' and 'build' labels All runners are now equal, these labels are pointless. Change-Id: I9d5fb31e20e95d30e9726d4bf0353dc87af614d7	2023-10-09 09:56:25 -07:00
Giacomo Travaglini	eac5a8b215	arch-arm: Implement FEAT_TCR2 Change-Id: I0396f5938c09b68fcc3303a6fdda1e4dde290869 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-09 17:19:57 +01:00
Giacomo Travaglini	49cbb24351	arch-arm: Implement FEAT_SCTLR2 Change-Id: Ifb8c8dc1729cc21007842b950273fe38129d9539 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-09 17:12:53 +01:00
Giacomo Travaglini	c4c5d2e172	arch-arm: Implement ID_AA64MMFR3_EL1 register Change-Id: If8c37bdccf35a070870900c06dc4640348f0f063 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-10-09 17:12:53 +01:00
Andreas Sandberg	ec7921305b	arch-arm: Implement FEAT_TLBIRANGE extension (#414 )	2023-10-09 17:09:31 +01:00

1 2 3 4 5 ...

20744 Commits