derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Kyle Roarty	078dc689b9	sim-se: Fix execve syscall There were three things preventing execve from working Firstly, the entrypoint for the new program wasn't correct. This was fixed by calling Process::init, which adds a bias to the entrypoint. Secondly, the uname string wasn't being copied over. This meant when the new executable tried to run, it would think the kernel was too old to run on, and would error out. This was fixed by copying over the uname string (the `release` string in Process) when creating the new process. Additionally, this patch also ensures we copy over the uname string in the clone implementation, as otherwise a cloned thread that called execve would crash. Finally, we choose to not delete the new ProcessParams or the old Process. This is done both because it matches what is done in cloneFunc, but also because deleting the old process results in a segfault later on. Change-Id: I4ca201da689e9e37671b4cb477dc76fa12eecf69 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48345 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-26 18:36:55 +00:00
Kyle Roarty	1577897265	arch-gcn3: Validate if scalar sources are scalar gprs Scalar sources can either be a general-purpose register or a constant register that holds a single value. If we don't check for if the register is a general-purpose register, it's possible that we get a constant register, which then causes all of the register mapping code to break, as the constant registers aren't supposed to be mapped like the general-purpose registers are. This fix adds an isScalarReg check to the instruction encodings that were missing it. Change-Id: I3d7d5393aa324737301c3269cc227b60e8a159e4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48344 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-26 18:36:24 +00:00
Kyle Roarty	9a7fc4ff69	arch-gcn3: Implement LDS accesses in Flat instructions Add support for LDS accesses by allowing Flat instructions to dispatch into the local memory pipeline if the requested address is in the group aperture. This requires implementing LDS accesses in the Flat initMemRead/Write functions, in a similar fashion to the DS functions of the same name. Because we now can potentially dispatch to the local memory pipeline, this change also adds a check to regain any tokens we requested as a flat instruction. Change-Id: Id26191f7ee43291a5e5ca5f39af06af981ec23ab Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48343 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-26 18:36:16 +00:00
Kyle Roarty	523a92f7f0	arch-gcn3: Implement large ds_read/write instructions This implements the 96 and 128b ds_read/write instructions in a similar fashion to the 3 and 4 dword flat_load/store instructions. These instructions are treated as reads/writes of 3 or 4 dwords, instead of as a single 96b/128b memory transaction, due to the limitations of the VecOperand class used in the amdgpu code. In order to handle treating the memory transaction as multiple dwords, the patch also adds in new initMemRead/initMemWrite functions for ds instructions. These are similar to the functions used in flat instructions for the same purpose. Change-Id: I0f2ba3cb7cf040abb876e6eae55a6d38149ee960 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48342 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-24 17:27:02 +00:00
Kyle Roarty	1415308d10	mem-ruby: Account for misaligned accesses in GPUCoalescer Previously, we assumed that the maximum number of requests that would be issued by an instruction was equal to the number of threads that were active for that instruction. However, if a thread has an access that crosses a cache line, that thread has a misaligned access, and needs to request both cache lines. This patch takes that into account by checking the status vector for each thread in that instruction to determine the number of requests. Change-Id: I1994962c46d504b48654dbd22bcd786c9f382fd9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48341 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-24 17:27:02 +00:00
Kyle Roarty	f8578e4b05	gpu-compute: Fix TLB coalescer starvation Currently, we are storing coalesced accesses in an std::unordered_map indexed by a tick index, i.e. issue tick / coalescing window. If there are multiple coalesced requests, at different tick indexes, to the same virtual address, then the TLB coalescer will issue just the first one. However, std::unordered_map is not a sorted container and we issue coalesced requests by iterating through such container. This means that the coalesced request sent in TLBCoalescer::processProbeTLBEvent is not necessarly the oldest one. Because of this, in cases of high contention the oldest coalesced request will have a huge TLB access latency. To fix this issue, we will use an std::map which is a sorted container and therefore guarantees the oldest coalesced request will be sent first. Change-Id: I9c7ab32c038d5e60f6b55236266a27b0cae8bfb0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48340 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-24 17:27:02 +00:00
Giacomo Travaglini	a10106e94a	arch-arm: Stage1&2 TableWalkers sharing same port This patch reverts part of the changes made by the removal of the Stage2MMU class [1]: Prior to that patch the stage1 and stage2 walkers were sharing the same port (which was instantiated in the Stage2MMU). By removing the Stage2MMU we provided every table walker a unique port. With this patch we are reintroducing port sharing to temporarily fix existing platforms using walker caches. (The long term design goal will be to have a unique page table walker) Those complain if we try to connect a single ported cache to 2 table walker ports (stage1 and stage2) [1]: https://gem5-review.googlesource.com/c/public/gem5/+/45780 Change-Id: Ib68ef97f1e9772a698771269c9a4ec4514f5d4d7 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48200 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-19 13:27:11 +00:00
Hoa Nguyen	a021618745	arch-riscv: Revert change-45522 This reverts change: https://gem5-review.googlesource.com/c/public/gem5/+/45522. This reverts commit `1cf41d4c54`. Reason for revert: The above commit caused booting Linux using RISCV either to hang or to take significantly time more than to finish. For the v21-1 release, the above commit will be reverted. JIRA: https://gem5.atlassian.net/browse/GEM5-1043 Change-Id: I58fbe96d7ea50031eba40ff49dabdef971faf6ff Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48099 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-16 20:50:47 +00:00
Hoa Nguyen	ccd03cf704	cpu: remove O3 dependency of CheckerCPU Currently, compiling CheckerCPU uses the dyn_inst.hh header from O3CPU. However, including this header is not required and it causes gem5 failed to build when O3CPU is not part of CPU_MODELS. This change also involves moving the the dependency on src/cpu/o3/dyn_inst.hh to src/cpu/o3/cpu.cc and src/cpu/lsq_unit.cc, which previously includes src/cpu/o3/dyn_inst.hh implicitly through src/cpu/checker/cpu.hh. JIRA: https://gem5.atlassian.net/browse/GEM5-1025 Change-Id: I7664cd4b9591bf0a4635338fff576cb5f5cbfa10 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48079 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-15 06:01:48 +00:00
Kyle Roarty	fb4a7e1e24	gpu-compute: Fix off-by-one when creating an AddrRange The end value of an AddrRange is already not included in the range, so subtracting one from the end creates an off-by-one error. This patch removes the extra -1 that was used when determining the end of an AddrRange in allocateGpuVma Change-Id: I75659e9a7fabd991bb37be9aa40f8e409eb21154 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48020 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-14 20:47:27 +00:00
Kyle Roarty	46e62e5eb3	arch-gcn3: Free dest registers in non-memory Load DS insts Certain DS insts are classfied as Loads, but don't actually go through the memory pipeline. However, any instruction classified as a load marks its destination registers as free in the memory pipeline. Because these instructions didn't use the memory pipeline, they never freed their destination registers, which led to a deadlock. This patch explicitly calls the function used to free the destination registers in the execute() method of those Load instructions that don't use the memory pipeline. Change-Id: Ic2ac2e232c8fbad63d0c62c1862f2bdaeaba4edf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/48019 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-14 20:47:27 +00:00
Javier Garcia Hernandez	4754e32219	arch-arm: Fixes an error related to HTM error code handling Arguments of the function bits(), called in restore method, are the other way around. This leads to wrong retry handling. Jira Issue: https://gem5.atlassian.net/browse/GEM5-1041 Change-Id: I0748b1cad57bea5527ca585852d183bd75b4c9ef Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47939 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-14 17:21:37 +00:00
Bobby R. Bruce	b2677990f6	gpu-compute: Add missing overrides These missing overrides were causing compilations errors with the Clang 11 compiler: https://www.mail-archive.com/gem5-dev@gem5.org/msg39683.html Change-Id: Ib5e7096ab9a7a8505bcc848ff3f08674f7f289ce Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47899 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-13 00:16:51 +00:00
Bobby R. Bruce	b9df038ca8	systemc,tests,python: Updated testall.py to python3 Change-Id: I95fce9d71bf0af9cd76e8bf0dd353281cff8ed74 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47022 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> (cherry picked from commit `f9a941524f`) Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47919 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu>	2021-07-12 23:31:54 +00:00
Nikos Nikoleris	264ee10991	cpu-minor: Substitute calls to functions removed in c++-17 Change-Id: Ib15234b37e577afd7ff186f1ba7cc5896aea1430 Signed-off-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47799 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-10 21:07:23 +00:00
Nikos Nikoleris	d63c30df97	base: Make the random number generator public There are cases where we need a random number generator engine. The Random class has such an engine but its interface currently only allows for generating random numbers. To make sure we can reuse the same random number generator in as many places as possible this patch makes the engine in the Random class public. Change-Id: I80153dd39f5b0d12537e4c0cf54773e7725b2a94 Signed-off-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47859 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-10 21:07:23 +00:00
Kyle Roarty	e2e18d41e1	configs,gpu-compute: Add support for gfx902/Raven This patch adds support for a gfx902 Vega APU, ripping the appropriate values for device_id from the ROCm Thunk (src/topology.c). Note: gfx902 isn't officially supported by ROCm. This means that it may not work for all programs. In particular, rocBLAS is incompatible with gfx902, so anything that uses rocBLAS won't be able to run with gfx902. Change-Id: I48893e7cc9c7e52275fdfd22314f371a9db8e90a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47530 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-10 03:42:03 +00:00
Matthew Poremba	897c0c11ed	dev,dev-hsa,gpu-compute: Refactor dmaVirt calls Remove the duplicate dmaVirt calls from HSA packet processor and GPU command processor and move them into their own class. This removes some duplicate code and allows a DmaVirtDevice to be created which will be useful for upcoming full system GPU commits. The DmaVirtDevice is an abstraction of the base DmaDevice but iterates using ChunkGenerator over virtual addresses. Classes which inherit from DmaVirtDevice must provide a translation function to translate from virtual address to physical address. Once translated, the physical address is passed to DmaDevice to do the work. Change-Id: Idd59ccb4d9ba21c0b1150ee328ededf5a88d824e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47179 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:40:18 +00:00
Nathanael Premillieu	5c7e1bd917	mem-cache: adding late prefetch stats Adding a late prefetch stat plus stats for each reason a prefetch can be detected as late Change-Id: Ia6d5294e8ce58b2b0aae2be98fd0cee83be73b8d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47204 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	0339f34b87	mem-cache: count pf filtered by demand to the same cache line Add a stat to count how many prefetch request are filtered in the prefetch queue becasue a demand is going to the same cache line Also adding a corresponding debug statement for when it happens Change-Id: I52475f19bd109c135b7259d08d5f5c0b5fd90ee5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47203 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	de80da9204	mem-cache: show in DPPRINTF if block is prefetched Add the prefetch status in the DPRINTF showing the state of a cache block. Change-Id: Ib8edf882dc17414f751cc8773d9035ee2887e971 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47202 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	b193c0adfd	mem-cache: add option to send pf on hit on pf From the point of view of the prefetchers, a hit on a prefetched block should be considered the same as a miss: a new prefetch should be generated. Change-Id: If865324502b81cfd3ae8c009666d3f498092b90f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47201 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	352ae672e2	mem-cache: accuracy and coverage stat for prefetchers Add an accuracy and coverage stat for the prefetchers. Accuracy is defined as the ratio of the number of prefetch request that have been counted as useful over the number of prefetch request issued. Accuracy tells whether the prefetcher is producing useful requests or not. Coverage is defined as the ratio of of the number of prefetch request that have been counted as useful over the number of demand misses if there was no prefetch, which is counted as the number of useful prefetch request plus the remaining demand misses. Due to the way stats are defined in the cache, I have to add a stat to count the number of remaining demand misses directly in the prefetcher stat. Demand is defined as being one of this request type: ReadReq, WriteReq, WriteLineReq, ReadExReq, ReadCleanReq, ReadSharedReq. Coverage tells what part of misses are covered by the prefetcher. Change-Id: I3bb8838f87b42665fdd782889f6ba56ca2a802fc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47603 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	c66f32f24e	mem-cache: add a prefetch useful stat Count how many time a prefetch is useful, meaning a hit has happened on a prefetched cache block. Another stat (pfUsefulButMiss) has been added to count the special case where there is a hit on prefetched block but it is counted as a miss because the block is not in the requested coherency state. Change-Id: I253216b9ac96d5f21139b710c489d6eb3fce7136 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47602 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	f691613876	mem-cache: print when hitting on a prefetched line Only print it on the first it on a prefetched line (as the prefetched flag is removed after the first hit) This is useful when debugging prefetchers. Change-Id: Id67cc957c7366a244bedad93824a3c4fdf2055b5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47601 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	cf0881433b	mem-cache: add pfIssued stat in MultiPrefetcher Count issued prefetches for each prefetchter in a MultiPrefetcher Change-Id: If03fb0669af9bb92ce9cf210b6201a9719a7c771 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47600 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	85a8dbf761	mem-cache: move unusedPrefetches stat to prefetcher This stat belongs to prefetchers. It has been renamed to pfUnused to match the naming of exisiting prefetcher stats. Change-Id: Iec350a62da544535dfc0c2527fcdf73217ae4db7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47599 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:30:45 +00:00
Nathanael Premillieu	f3e7d02150	mem-cache: print prefetch queues in Queued prefetcher Added to track the content of the prefetch queues in the debug output Change-Id: I49d0f4f643ec0dbd7af3087b6267d454cfccddba Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47199 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 22:30:45 +00:00
Kyle Roarty	1812041dc0	gpu-compute: Update GET_PROCESS_APERTURES IOCTLs The apertures for non-gfx801 GPUs are set differently. If the apertures aren't set properly, ROCm will error out. This change sets the apertures appropriately based on the gfx version of the simulated GPU. It also adds in new functions to set the scratch and lds apertures in GFX9 to mimic the linux kernel. Change-Id: I1fa6f60bc20c7b6eb3896057841d96846460a9f8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47529 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 16:22:07 +00:00
Kyle Roarty	ab9e28ddb8	configs,gpu-compute: Set proper dGPUPoolID defaults In GPU.py, dGPUPoolID is defined as an int, but was defaulted to False. Explicitly set it to 0, instead. In apu_se.py, dGPUPoolID was being set to 1, but that was resulting in crashes. Setting it to 0 avoids those crashes. Change-Id: I0f1161588279a335bbd0d8ae7acda97fc23201b5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47527 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 16:11:20 +00:00
Kyle Roarty	6d2404acc5	arch-x86: Ignore mbind syscall mbind gets called when running with a dGPU in ROCm 4, but we are able to ignore it without breaking anything Change-Id: I7c1ba47656122a5eb856981dca2a05359098e3b2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47526 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2021-07-09 16:11:20 +00:00
Kyle Roarty	76888a9cca	gpu-compute: Add mmap functionality to GPURenderDriver dGPUs mmap the GPURenderDriver, however it doesn't appear that they do anything with it. This patch implements the mmap function by just returning the address provided, while not doing anything else Change-Id: Ia010a2aebcf7e2c75e22d93dfb440937d1bef3b1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47523 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 16:11:20 +00:00
Daniel R. Carvalho	bb596f55e3	base-stats: Use smart pointer for info's storageParams Previously the storage params were not being deallocated. Make sure this happens by managing it with smart pointers. As a side effect, encapsulate this variable to facilitate future changes. Change-Id: I4c2496d08241f155793ed35e3463512d9ea06f83 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38178 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 11:24:10 +00:00
Daniel R. Carvalho	70194795c3	base-stats: Use std vector in vector stats Use std::vector in vector based stats to avoid data management. Change-Id: I6b341f03e4861a5b8f80fa8741373065b7c755bf Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27085 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 11:24:10 +00:00
Daniel R. Carvalho	b63a802033	base-stats: Remove info dependency from stats storage Info depends on the storage type, not the other way around. Change-Id: Ie3deca17b859a217c0c7bd833c017d9436eee4b0 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27083 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 11:24:10 +00:00
Daniel R. Carvalho	e9bdc92039	base-stats,tests: Add unit test for Stats::Group Add a unit test for Stats::Group. Three bugs were found: groups are able to add themselves/null groups as their sub-groups, and one can create a cyclic dependency of sub-groups. The ADD_STAT macro is not being tested. Change-Id: I52326994b3f75e313024f872d214e8c45943f44d Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/43010 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 11:24:10 +00:00
Daniel R. Carvalho	d001f3f575	base-stats,tests: Add unit test for Stats::Info Add a unit test for stats/info. One test has been disabled due to not knowing the expected behavior. It is important to notice that Stats::Info can have duplicate names using the new style. Stats::Group is responsible for not allowing duplicate names in this case. Change-Id: I8b169d34c1309b37ba79fa9cf6895547b7e97fc0 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/43009 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 11:24:10 +00:00
Daniel R. Carvalho	79bab1dc5d	mem: Adopt a memory namespace for memories Encapsulate every class inheriting from Abstract or Physical memories, and the memory controller in a memory namespace. Change-Id: I228f7e55efc395089e3616ae0a0a6325867bd782 Issued-on: https://gem5.atlassian.net/browse/GEM5-983 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47309 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 11:24:10 +00:00
Daniel R. Carvalho	5635b3aaa2	mem: Adopt the memory namespace in qos files Encapsulate everything qos-related in the gem5::memory namespace. Change-Id: Ib906ddd6d76b9d4a56f2eb705efe6cd498829155 Issued-on: https://gem5.atlassian.net/browse/GEM5-983 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47308 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2021-07-09 11:24:10 +00:00
Kyle Roarty	06da510020	arch-vega: Add decoding for implemented insts Certain instructions were implemented in instructions.cc, but weren't actually being decoded by the decoder, causing the decoder to return nullptr for valid instructions. This patch fixes the decoder to return the proper instruction class for implemented instructions Change-Id: I8d8525a1c435147017cb38d9df8e1675986ef04b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47521 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Kyle Roarty	9fe9d83e5b	arch-vega: Add missing return to flat_load_dwordx4 Change-Id: Ibf56c25a3d22d3c12ae2c1bb11f00f4a44b5919a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47520 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Kyle Roarty	cb73fe1959	arch-vega: Fix s_endpgm instruction Copy over changes that had been made to s_engpgm in GCN3 but weren't added to the Vega implementation Change-Id: I1063f83b1ce8f7c5e451c8c227265715c8f725b9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47519 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Alex Dutu <alexandru.dutu@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-09 03:49:29 +00:00
Daniel R. Carvalho	80b5f43d29	sim: Align coding style of probes Fix coding style of probes for the following aspects: - Long lines - Return type of multi-line functions - Name of local parameters Also, added missing overrides Jira: https://gem5.atlassian.net/browse/GEM5-857 Change-Id: Ibd905d1941fc203ca8308f7a3930d58515b19a97 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38697 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-07-08 11:58:25 +00:00
Davide Basilio Bartolini	7bdd8b56e3	base-stats: Add basic test for hdf5 stats Also rethrow hdf5 exceptions to propagate error message Related JIRA issue: https://gem5.atlassian.net/browse/GEM5-1028 Change-Id: I67badcf261f04cd446d016a4ad3d7393bad9a6ba Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47740 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 10:33:35 +00:00
Davide Basilio Bartolini	86c3f08d13	base-stats: Fix stat descriptions Change-Id: Ib43dc9261287f10c615786a0d533304ab55ac232 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47741 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 10:33:35 +00:00
Richard Cooper	c25ef2d191	arch-arm: Update ARMv8.1-PAN to allow unprivileged instructions. Update the ARMv8.1-PAN implementation to allow specified unprivileged instructions to execute even when the cpsr.pan bit is set. The specified instructions generate memory requests with the TLB::ArmFlags::UserMode flags bit set. See sections D5.4.2 (About PSTATE.PAN) and G5.6.2 (About the PAN bit) of the Arm Architecture Reference Manual for details. https://developer.arm.com/documentation/ddi0487/latest/ Change-Id: I9e904e0154de72c2e4cc70cbc49b3c8407a3cb1d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47779 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 08:48:51 +00:00
Richard Cooper	04a4fb53ad	misc: Fix nesting of namespace{} and if-endif. Fix a build error caused by the misaligned nesting of namespace{} and if-endif. Previously, if HAVE_TUNTAP is not defined then the gem5 namespace is not closed. Change-Id: I5e45d2271f97bb9e38c91565adb6aff0a7b43744 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47761 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 08:48:08 +00:00
Richard Cooper	704ef29f4f	arch-arm: Fix for build error in recent MacOS 11. On a recent version of MacOS 11, the build fails due to the missing sysctl.h include. Updated the preprocessor macros to include this file for __APPLE__ builds. Change-Id: I985d6c2ea97b82b32750bb562b2051f87d6c2e65 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47760 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 08:48:08 +00:00
Richard Cooper	cd76334a2a	sim-se: Fix for build error in MacOS. On MacOS the build fails because the types of readfds, writefds, and errorfds cannot be automatically converted to fd_set*. Added casts similar to the ones used for the FD_ZERO calls to help the compiler. Change-Id: I40a2268f7f2ca1bece1ecafda52dfddf2212364d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47759 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 08:48:08 +00:00
Kyle Roarty	ebb6c4b99b	gpu-compute: Check for WAX dependences This adds checking if the destination registers are free or busy in the operandsReady() function for both scalar and vector registers. This allows us to catch WAX dependences between instructions. Change-Id: I0fb0b29e9608fca0d90c059422d4d9500d5b2a7d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47539 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-07-08 01:18:01 +00:00

1 2 3 4 5 ...

12269 Commits