derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Giacomo Travaglini	0dc2a87666	dev-arm: Fix PCI range in VExpress_GEM5_Foundation When we added the PCI mem range in the VExpress_GEM5_Foundation [1], we meant to add a 256GiB region starting at 0x40 0000 0000. By mistake the end address was set to 0x8 0000 0000 rather than 0x80 0000 0000 [1]: https://gem5-review.googlesource.com/c/public/gem5/+/44165 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I848b8fee11fb742939c9343aae4ee5205aa836e4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62511 Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-09-01 08:07:02 +00:00
Bobby R. Bruce	2bc5a8b71a	misc: Run pre-commit run on all files in repo The following command was run: ``` pre-commit run --all-files ``` This ensures all the files in the repository are formatted to pass our checks. Change-Id: Ia2fe3529a50ad925d1076a612d60a4280adc40de Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62572 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-08-24 21:47:07 +00:00
Gabe Black	f4209bbdee	misc: Remove lingering uses of TheISA::. Change-Id: Ie55e0d79867fbc8f75a993fb456a58c84de5def4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62196 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com>	2022-08-20 07:30:16 +00:00
Gabe Black	a13e3debed	misc: Stop excluding code when building the NULL ISA. The BaseCPU needs a little extra hacking because it tries to create default objects based on what the ISA is. If the ISA isn't recognized, then the types will be set to None, and some extra checks have been added as the type is set up. Change-Id: Ia3cae313e1a96a953d2316d9192f41a8fd28c141 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62195 Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-08-20 07:30:07 +00:00
Wei-Han Chen	b3781ce93d	configs: Add ITS in fastmodel cluster There's a gic-its domain in gem5_vexpress_v2 device tree, thus adding ITS domain in fastmodel cluster config. Change-Id: Ieb0221fec2e85710531cef1723c492a07f47290a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/62212 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-08-10 06:40:35 +00:00
Bobby R. Bruce	787204c92d	python: Apply Black formatter to Python files The command executed was `black src configs tests util`. Change-Id: I8dfaa6ab04658fea37618127d6ac19270028d771 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/47024 Maintainer: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-08-03 09:10:41 +00:00
Giacomo Travaglini	ef2573bc95	arch-arm: Convert to the new faulting logic This patch is moving trapping behaviour modelled in MiscRegOp64::trap to the MiscRegLUTEntry fault callbacks. Change-Id: Idfca428e9e6669b747de0255888fc8a85a1f5d07 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61683 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-07-29 06:51:11 +00:00
Matthew Poremba	68115460d8	gpu-compute: Set LDS and Scratch apertures in FS The LDS and scratch aperture base and limits are hardcoded to some values that are useful for SE mode. In reality, these are chosen by the driver so we need to honor whatever values the driver passes so that when addresses are calculated they fall into the correct aperture to route flat instructions to those apertures. This overwrites the default hardcoded values for LDS and scratch base and limit using the values providing by the driver in a MAP_PROCESS packet. Change-Id: I0e194a26631f697819d8aaecf1bf346a7b7c7026 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61656 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-07-28 14:10:33 +00:00
Matthew Poremba	f65f5a8981	gpu-compute,arch-vega: Overhaul HWRegs, setreg, getreg These instructions are supposed to be read/writing special shader hardware registers. Currently they are getting/setting to an SGPR. This results in getting incorrect registers at best and clobbering an SGPR being used by an application at worst. Furthermore, some registers need to be set in the shader and the application will never (can never) set them. This patch overhauls the getreg/setreg instructions to use different storage in the shader. The values will be updated either via setreg from an application (e.g., mode register) or set by a PM4 MAP_PROCESS. Change-Id: Ie5e5d552bd04dc47f5b35b5ee40a569ae345abac Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61655 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2022-07-28 14:10:33 +00:00
Matthew Poremba	f2949f3d03	dev-amdgpu: Set PASID in interrupt cookie The driver uses the pasid to look up events that need to be set in kfd_signal_event_interrupt (amdkfd/kfd_events.c). Currently this is uninitialized which causes the function in the driver to return without doing anything useful. This changeset initializes the cookie PASID to 0x8000. 0x8000 is always the first PASID assigned by the driver. This works since gem5 only supports one GPU process in FS mode. This would have to be changed for multi-process support, so a comment is added as a reminder. Change-Id: I7074b581f2f2f346bd910eef15d5f9253ce17e2c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61653 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-07-28 14:10:33 +00:00
Matthew Poremba	23fadb7260	dev-hsa: Don't set _aqlComplete in setRdIdx method This code is unnecessary as the read index is already correct. Furthermore, it can cause hangs in some situations where the packet SHOULD be marked as not complete. This causes a bug where the read index is incremented by 1 multiple times, causing the packet processor to read an invalid packet, followed by a hang after it does nothing. Change-Id: Iceda3c9606e018f60f8902770a2d9762c1c14304 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61650 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2022-07-28 14:10:33 +00:00
Joël Porquet-Lupine	1c6f57cd6d	dev: update LupIO-IPI device to latest specs The specs for the LupIO-IPI device were recently updated. Instead of providing a single IPI value for each processor, the device now provides 32 individual IPI bits that can be masked and set. Update device accordingly in gem5. Change-Id: Ia47cd1c70e073686bc2009d546c80edb0ad58711 Signed-off-by: Joël Porquet-Lupine <joel@porquet.org> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61530 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-07-21 23:48:04 +00:00
Joël Porquet-Lupine	0800c060d8	dev: Fix cpu/reg decoding logic in multi-instance LupIO devices The current decoding logic is flawed and complicated to understand. Using simple division and modulo instead; the compiler is smart enough to generate efficient code since the divisor is a power of 2. Change-Id: I95cbb4969e37132343f557e772984a48749731f0 Signed-off-by: Joël Porquet-Lupine <joel@porquet.org> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61529 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-07-21 23:47:30 +00:00
Alexandru Dutu	1115f81233	gpu-compute: Fix for HSA queue remapping When a queue is being remapped the write and dispatch pointers are set to the read pointer. This assumes that all packets up to the read pointer have been dispatched and completed. Change-Id: I4ed0c6c68f16f57c3fb5c3ecba182a43e74078e2 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/61429 Reviewed-by: Matt Sinclair <mattdsinclair.wisc@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-07-20 16:26:14 +00:00
Earl Ou	c0ca47b6ed	dev: avoid intpin to reset value at binding stage By design SimObject should initialize its state at init() stage. However, the original intpin design will try to reset the sink side when binding. This could cause unexpected issue as the other side does not init() yet. To align with the design, the call to upper()/lower() should be left to the initiator in the init() function instead of constructor. Change-Id: Iec8b228715d093381a33e747849119562bd634e1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/60751 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2022-06-29 12:42:36 +00:00
Bobby R. Bruce	7b9364b5c0	misc: Merge branch 'release-staging-v22-0' into stable	2022-06-17 20:21:37 -04:00
ntampouratzis	f3e9484969	arch-riscv,dev: Add PCI Host to RISCV Board Add GenericRiscvPciHost to RISCV Board. In addition, we connect the IGbE_e1000 ethernet card to PCI in order to verify the correct functionality. To be noticed that we build a new Linux kernel v5.10 (with Bootloader) according to these steps ( https://github.com/gem5/gem5-resources/tree/stable/src/riscv-fs) adding the the PCI and e1000 drivers: CONFIG_PCI_SYSCALL=y CONFIG_PCI_STUB=y CONFIG_PCI_HOST_GENERIC=y CONFIG_NET_VENDOR_INTEL=y CONFIG_E1000=y CONFIG_E1000E=y CONFIG_IGB=y CONFIG_NET_VENDOR_I825XX=y Here you can find the kernel.config and our prebuild kernel to verify the correct behaviour: https://www.dropbox.com/scl/fo/sz9s37vybpfecbfilxqzz/h?dl=0&rlkey=klkxh33anjqnzwj3sopucqqzx You can verify it with the following command: build/RISCV/gem5.fast configs/example/gem5_library/riscv-fs.py Dear Jason Lowe-Power, Thank you for your comments! We have addressed all of them. Best regards, Nikolaos Tampouratzis Dear Jason, I think that it is ok now! :) Thanks! Best regards, Nikolaos Tampouratzis Change-Id: Id27d84a5588648b82cbfd5c88471927157ae6759 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59969 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-06-06 18:42:12 +00:00
Matthew Poremba	1e6ff02c25	dev-amdgpu: Allow for atomics in SystemHub It seems that applications can be compiled which issue atomics to host memory, such as heterosync. Remove the arbitrary assert to disallow them and issue atomics as a DMA write by default. Change-Id: I7812a421a9312406b3faccdc05d6c7e9fc837da0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59669 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-05-16 15:08:09 +00:00
Giacomo Travaglini	5d45c50b48	misc: Add VExpress_GEM5_Foundation bootloader The VExpress_GEM5_Foundation platform cannot use the VExpress_GEM5_V2 bootloader as the GIC has a different memory map A new tarball has been uploaded to dist.gem5.org with the new bootloader Change-Id: Ie0c16e623c3323b7be2a333cd6b0ffcf891b7b9b Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59392 Tested-by: kokoro <noreply+kokoro@google.com>	2022-05-07 22:40:47 +00:00
Giacomo Travaglini	776321d2c2	dev-arm: GICD_PIDR2.ArchRev value depends on GIC version The GIC architecture specification states the GICD_PIDR2.ArchRev field is set to 3 for GICv3 and to 4 for GICv4. We bind this value to the gicv4 parameter Change-Id: I3ba34bc0b4538b4d5170915a4ee042e534f2590f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59391 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2022-05-07 22:40:25 +00:00
Matthew Poremba	54d2438066	dev-amdgpu: Removed hardcoded AQL queue size The AQL queue size is currently hardcoded to 64kB. For longer running applications this causes the circular queue to wrap before reaching the real end of the queue. Add the computation for queue size instead. Previously longer applications (e.g., bc in pannotia) were hanging around 4k kernels. With change the application launches 10k+ kernels. Change-Id: I6c31677c1799a3c9ce28cf4e7e79efcb987e3b7f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59449 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com>	2022-05-07 03:47:06 +00:00
Giacomo Travaglini	7a9e99400f	dev-arm: Gicv3.gicv4 parameter set to False by default GICv4 features are not currently implemented so it is more natural to set it to false by default VExpress_GEM5_V2 platform assumes a GICv4 memory map therefore sets it to True Change-Id: Ib4bd17acd56cd029aacf5578ab0259a6ea1bb30c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59390 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2022-05-06 16:29:22 +00:00
Matthew Poremba	c170994676	dev-amdgpu: Fix size issue in interrupt handler The data allocated for the DMA request used to send an interrupt cookie was too large. This was causing the memcpy to occasionally seg fault due to reading past the bounds of the source parameter (the interrupt cookie struct). Correct the size and add a compile time check to ensure it is the correct number of bytes expected by the driver. Change-Id: Ie9757cb52ce8f72354582c36cfd3a7e8a1525484 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58969 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Alexandru Duțu <alexandru.dutu@amd.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2022-05-05 16:09:49 +00:00
Yu-hsin Wang	1f713320fe	dev: Expose ResetRequestPort constructor Port::Port is in protected scope. ResetRequestPort should expose the constructor by itself. Change-Id: I72ce701fca89379f90e212d7411f481ae1e1977a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/59209 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2022-04-29 02:42:54 +00:00
Yu-hsin Wang	8df2ebf43e	dev: Add a special reset interface to consolidate reset logic How to reset a model correctly is very different between models. Take cpu models for instance, they have different reset pins for different parts(typically one for each core, one for shared component, one for debug interface). To make users more easily to reset the model, here we want to introduce a special reset port. By implementing the port, users can simply request a whole reset to the model. If users want to do partial resets, users still can access the raw pins to achieve what they want. Change-Id: I746121d16441e021dc3392aeae1a6d9fa33d637a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58810 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-26 06:31:51 +00:00
Matthew Poremba	8a53add7f8	dev-amdgpu: Fix frame writes for <32-bit writes In theory a packet between one and eight bytes can be written to frame buffer memory from the driver. In gem5 pkt->getLE<utin32_t>() will assert if the packet size is <32-bits. Change to pkt->getUintX(...) to fix this issue. Change-Id: If8554013e4ea7bac90985487991d0bf8bdc765ea Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58852 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-21 21:32:53 +00:00
Matthew Poremba	1562251243	dev-amdgpu: Update comments pointing to ROCK repo It seems the tag name was changed which broke a few links in some comments pointing to where definitions and struct come from. Update the URLs and also use consistent version. Change-Id: I7d6393f1f08d592989999a8a6f9c5bbdf1a9c992 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58471 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-08 17:12:32 +00:00
Matthew Poremba	e3f65393fd	dev-amdgpu,arch-vega: Implement TLB invalidation logic Add logic to collect pointers to all GPU TLBs in full system. Implement the invalid TLBs PM4 packet. The invalidate is done functionally since there is really no benefit to simulate it with timing and there is no support in the TLB to do so. This allow application with much larger data sets which may reuse device memory pages to work in gem5 without possibly crashing due to a stale translation being leftover in the TLB. Change-Id: Ia30cce02154d482d8f75b2280409abb8f8375c24 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58470 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-04-08 17:12:32 +00:00
Matthew Poremba	2af227c32a	dev-hsa: Update QCntxt readIndex in HW scheduler write The QCntxt is reused when a queue is unmapped and mapped again. This is fairly common in GPU full system. If this is not done the readIndex on the queue context is reset to 1, causing getCommandsFromHost to read from the wrong slot which is typically an old dispatch packet or an invalid packet. This causes simulation to stall as the incorrect completion signal is eventually written. Change-Id: I65541e559fe04f5eb44b936ca37e3f802262fe6a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57670 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 23:24:53 +00:00
Matthew Poremba	6883f12f09	dev-hsa: Properly mask HSA packet header bits The HSA packet macros were not actually masking the header bits properly. Add a mask call around the width (number of bits) of the field being masked. Change-Id: Ia5e5fb0451296e99a85fb12a5f73b27aea72fc2e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57669 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 23:24:53 +00:00
Gabe Black	e6c0ba97db	scons: Put all config variables in an env['CONF'] sub-dict. This makes what are configuration and what are internal SCons variables explicit and separate, and makes it unnecessary to call out what variables to export to C++. These variables will also be plumbed into and out of kconfiglib in later changes. Change-Id: Iaf5e098d7404af06285c421dbdf8ef4171b3f001 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56892 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-28 20:31:21 +00:00
Matthew Poremba	0255d5ea51	dev-amdgpu: Handle framebuffer reads from device cache Reads to the frame buffer are currently handled by either the MMIO trace or from the GART table if the address is in the GART aperture. In some cases the MMIO trace will not contain the address or the data may have been written previously and be different from the MMIO trace. To handle this, return the data that was written previously by the driver. The priority order from lowest to highest is: MMIO trace, device cache, special framebuffer registers. Change-Id: Ia45ae19555508fcd780926fedbd7a65c3d294727 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57589 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 19:51:29 +00:00
Matthew Poremba	7937fe357d	dev-amdgpu: Add device memory This adds the actual backing store for the GPU framebuffer. Change-Id: I22c6dd9bd25b216c4ec99ee472c83d4cb2648efb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57533 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 19:51:29 +00:00
Matthew Poremba	c8518e486d	dev-amdgpu: Always mark interrupts enabled The driver will check this bit is set after initializing IPs. Currently the MMIO trace will cause this bit to be set at the correct time, however this is not portable access different ROCm versions. Therefore we modify the value to always set the bit indicating interrupts are enabled. Change-Id: Iae0baf1936720fbe9835ae4acadbf1b3bdc52896 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/57530 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 19:51:29 +00:00
Matthew Poremba	581e451723	gpu-compute,dev-hsa: Update CP and HSAPP for full-system Make the necessary changes to connect Vega pagetable walkers for full-system mode. Previously the CP and HSA packet processor could only read AQL packets from system/host memory using proxy port. This allows for AQL to be read from device memory which is used for non-blit kernels. Change-Id: If28eb8be68173da03e15084765e77e92eda178e9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53077 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 19:51:29 +00:00
Matthew Poremba	9b87844658	dev-amdgpu: Setup VRAM memories in device Change-Id: Ic519429f13c4ad1d42997f361cbfe0c6e9aba29a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53074 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 14:12:51 +00:00
Bobby R. Bruce	ea9b7ef6a2	dev-amdgpu: Add braces to stop clang compilation braces error Additional braces are needed due to a clang compilation bug that falsely throws a "suggest braces around initialization of subject" error. More info on this bug is available here: https://stackoverflow.com/questions/31555584 Change-Id: Ide5cdd260716ba06f6da4663732e39d18e00af97 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58150 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 13:40:04 +00:00
Bobby R. Bruce	d63c640775	dev-amdgpu: Remove unused variables in src/dev/amdgpu These were causing errors to be thrown when compiling in clang-12. Change-Id: I8bd2d7e7e1d4423a54766ed906c864bb91e884f0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58149 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 13:40:04 +00:00
Giacomo Travaglini	4bbcd98598	dev-arm: Remove unused ELIsInHost redirection for CNTKCTL_EL1 The redirection to CNTHCTL_EL2 is already handled in ISA::redirectRegVHE Change-Id: Ia3290c5bdb75c6e45f08a47c1b75881bc52add5f Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58115 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 09:51:56 +00:00
Giacomo Travaglini	9e65dcaeec	arch-arm, dev-arm: Implement EL2 Secure Virtual Timer Change-Id: Ie4d4ff27b6375593ca4a6f6ae2a5e428ada943be Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58112 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 09:51:37 +00:00
Giacomo Travaglini	e6797303c4	arch-arm, dev-arm: Implement EL2 Secure Physical Timer Change-Id: I052f72695e670fad492079ab912268d05c797100 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58111 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-25 09:50:15 +00:00
Giacomo Travaglini	f1dce36f97	arch-arm, dev-arm: Implement EL2 Non-secure Virtual Timer Change-Id: I0cc499e1309c35d946c5b9231846263f97bfa2b0 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58110 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 16:04:58 +00:00
Giacomo Travaglini	cfc570dd1c	dev-arm: Rename GenericTimer interrupts The Arm Architecture Reference Manual has moved from "Armv7-oriented" names for generic timer interrupts to names more consistent with Armv8 (Exception Levels based). We are therefore renaming those interrupts as follows: int_phys_s -> int_el3_phys int_phys_ns -> int_el1_phys int_virt -> int_el1_virt int_hyp -> int_el2_ns_phys Change-Id: Id6e34a0e4311953938b25bca168a34357e3c8643 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/58109 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 16:04:58 +00:00
Matthew Poremba	7511ff3126	dev-amdgpu: Add checkpoint support to AMDGPUDevice These will be needed for the second checkpoint. Change-Id: I85ee2cbc0df130868d19376c4d98dbe4d424698e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53069 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 14:59:57 +00:00
Matthew Poremba	1be246bbe3	dev-amdgpu: Add PM4PP, VMID, Linux definitions The PM4 packet processor is handling all non-HSA GPU packets such as packets for (un)mapping HSA queues. This commit pulls many Linux structs and defines out into their own files for clarity. Finally, it implements the VMID related functions in AMDGPU device. Change-Id: I5f0057209305404df58aff2c4cd07762d1a31690 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53068 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 14:59:57 +00:00
Alexandru Dutu	e67e02d657	configs: Connect SDMA, IH, and memory manager in GPUFS Add the devices that have been added in previous changesets to the config file. Forward MMIO writes to the appropriate device based on the MMIO address. Connect doorbells and forward rings to the appropriate device based on queue type. Change-Id: I44110c9a24559936102a246c9658abb84a8ce07e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53065 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 14:59:57 +00:00
Alexandru Dutu	f1772d3505	dev-amdgpu: Add SDMAEngine and GPU device methods SDMAEngine handles copies to device memory. This commit updates sdma_packets.hh style as well. Added several methods needed by SDMAEngine to GPU device including GART table, various getters, and aperture range checkers. Move the MMIO interface from GPUController to SDMAEngine. Create an SDMA MMIO and commands header with only the macros we use so that we don't need to check in multi-thousand line header files from the linux kernel. Keep SOC15 IH client ID macros as that file is small. Change-Id: I986fede90cc1bc16ee56d4e8598cf9283bde034e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53064 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 14:59:57 +00:00
Matthew Poremba	9cbdf75295	dev-amdgpu: Add VM class for apertures, TranslationGens Create a VM class to reduce clutter in the amdgpu_device.* files. This new file is in charge of reading/writting MMIOs related to VM contexts and apertures. It also provides ranges checks for various apertures and breaks out the MMIO interface so that there are not overloaded macro definitions in the device MMIO methods. The new translation generator classes for the various apertures are also added to this class. Change-Id: Ic224c1aa485685685b1136a46eed50bcf99d2350 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/53066 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-24 14:59:57 +00:00
Matthew Poremba	2390cd1143	dev-amdgpu: Add SystemHub for GPU load/store to host In a dGPU configuration, vector and scalar loads/stores can either be requests to device memory or host memory depending on if the system bit is set in the PTE when the request's virtual address is translated. This object is used to send/receive those requests to the host via DMA. This object will be used in a later changeset by the compute unit and fetch units to issue data and instruction loads from the GPU which translate to physical addresses on the host/cpu memory. Change-Id: I4537059f90ebc03f3b2e6b8b631b4c452841f83f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51851 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-22 17:35:59 +00:00
Matthew Poremba	42b56ceb7b	dev-amdgpu: Add memory manager for GPU VRAM The memory manager is responsible for reading and writes to VRAM memory for direct requests that bypass GPU caches. Change-Id: I4aa1e77737ce52f2f2c01929b58984126bdcb925 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/51850 Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-03-22 17:35:59 +00:00

1 2 3 4 5 ...

1338 Commits