derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Giacomo Travaglini	05d733d0cd	arch-arm: Generalize KVM Gic state copying logic By moving the Gic state copying logic from the MuxingKvmGic to the BaseGic we allow different Gic releases (e.g Gicv2, Gicv3) to override the implementation accoding to their personal architectural state It is also possible to use the same logic outside of the KVM context Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I88d6fca69a9b61a889c5ec53221404b8396cc12d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55607 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-02-01 10:42:57 +00:00
Alex Richardson	d5e734c540	arch-riscv: Fix (c.)addiw sign-extension behaviour Previously calling a function with an INT_MAX argument would result in the following (incorrectly extended) trace: ``` lui a1, 524288 : IntAlu : D=0xffffffff80000000 c_addiw a1, -1 : IntAlu : D=0xffffffff7fffffff ``` I noticed this due to a kernel assertion that checked the second argument was bigger than the first. Since INT_MAX was incorrectly being extended to 0xffffffff7fffffff, the generated slt comparison instruction was returning 1 instead of the expected zero (which would have happened with 0x7fffffff). The problem in the current addiw logic is that the immediate value is an int64_t, so the 32-bit Rs1/Rc1 values are promoted to 64-bit for the aritmetic operation, thereby making the current cast redundant. Fix this by placing parens around the whole expression and truncating that to 32 bits. Change-Id: I7b18a8101b1c2614b9f056004e6a7f87b66b64c9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/56103 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-28 10:39:09 +00:00
Giacomo Travaglini	d657c28279	arch-arm: Add a reverse map MiscRegIndex -> MiscRegNum64 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I63cdcdfca610cfd37a03769e077388a193510bc7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55606 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:28:55 +00:00
Giacomo Travaglini	8f199c9b7c	arch-arm: Reimplement decodeAArch64SysReg using new decode map Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: Ief6c9d666b01248ea4e01414f575a5c5758618ba Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55605 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:28:55 +00:00
Giacomo Travaglini	167fb09aaf	arch-arm: Generate a decode map for AArch64 MiscRegs The map is translating AArch64 system register numbers (op0, op1, crn, crm, op2) into a MiscRegIndex Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: I359f5d97b248ffafa9cf461d98339175fdf9688f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55604 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:28:55 +00:00
Giacomo Travaglini	b982437b6e	arch-arm: Define MiscRegNum64 data structure Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Change-Id: Ia635bc068751edd9305a6e493e38e1a49aa64c4d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55603 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-25 16:28:55 +00:00
Gabe Black	c537d9ad10	arch-arm,cpu: Add a class for ops for vec reg elements. This lets a caller print the name of a register in a friendly way without having to know how many elements go with each vector register. Change-Id: I85598c078c604f1bebdba797308102482639c209 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49163 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-24 22:18:17 +00:00
Gabe Black	528d184ac7	misc: Linearlize VecElem indexing. These registers used to be accessed with a two dimensional index, with one dimension specifying the register, and the second index specifying the element within that register. This change linearizes that index down to one dimension, where the elements of each register are laid out one after the other in sequence. Change-Id: I41110f57b505679a327108369db61c826d24922e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/49148 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-21 23:05:47 +00:00
Luming Wang	73267e67c4	arch-riscv: reduced lr/sc implementation In gem5::RiscvISA::ISA, handleLocked* functions maintain an address stack(i.e. locked_addrs) to check whether each SC matches the most recent LR. However, there are some problems with this implementation. First, the elements in the stack may only be popped when the handleLockedSnoop function is invoked. In other cases, the elements in the stack will not be popped even if the SC and LR match. This makes the `locked_addrs` get bigger and bigger as gem5 runs. Second, LR/SC does not always match. For example, in Linux's __cmpxchg[1], after executing LR, if the value read is not equal to the old value, the subsequent SC is skipped. For gem5's current implementation, this would cause the address to be pushed into `locked_addrs` every time __cmpxchg is failed. But these addresses are never popped. This also makes the `locked_addrs` get bigger and bigger. Third, existing emulator implementations (spike, qemu) do not use the stack, but only record the last address accessed by LR. Afterward, when executing SC, these implementations determine whether the address accessed by SC is the same as the one recorded. This patch modifies gem5's handleLocked* function by referring to other existing RISC-V implementations. It eliminates `locked_addrs` and simplifies the related code. Thus, it fixes the "memory leak"-like error that can occur on `locked_addrs` when executing LR/SC. Related links: [1] Linux's cmpxchg implementation for RISC-V: + https://github.com/torvalds/linux/blob/master/arch/riscv/include/asm/cmpxchg.h [2] spike lr/sc implementation: + https://github.com/riscv-software-src/riscv-isa-sim/blob/master/riscv/insns/sc_d.h + https://github.com/riscv-software-src/riscv-isa-sim/blob/master/riscv/insns/lr_d.h + https://github.com/riscv-software-src/riscv-isa-sim/blob/master/riscv/mmu.h [3] rocket lr/sc implementation: + https://github.com/chipsalliance/rocket-chip/blob/master/src/main/scala/rocket/NBDcache.scala [4] QEMU lr/sc implementation: + https://gitlab.com/qemu-project/qemu/-/blob/master/target/riscv/insn_trans/trans_rva.c.inc Change-Id: Ic79444cace62e39b7fe9e01f665cb13e4d990d0a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55663 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-21 13:41:23 +00:00
Matthew Poremba	faf3730559	arch-vega: Fix global 64-bit calcAddr with SGPR base Global instruction address calculation when using an SGPR or SGPR pair as a base address was being calculated incorrectly when 64-bit addresses were to be generated. From the ISA documentation, the SGPR should be read as 32-bit or 64-bit depending on "ADDRESS_MODE." The VGPR-offset (computed from the lower 32-bits of vaddr) should always be 32-bits and the offset is 12 bits from the instruction. This means the 32-bit mask should only be applied to vaddr to get the VGPU-offset rather than the final sum. The SGPR base format is being seen in more recent clang/ROCm versions to avoid unnecessary copies of SGPRs into VGPRs to use VGPRs as the base address. Change-Id: I48910611fcfac5b62bc63496bbaabd6f6e53fe0d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55643 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 16:03:23 +00:00
Yu-hsin Wang	2fed34d099	fastmodel: Set simulation pause when breakpoint hit The 7th parameter of breakpoint_set_code is dontStop. It seems the fastmodel would prefetch something or do some evaluation ahead with the flag set. This behavior prevents the instruction stepping feature of gdb. The implementation of the feature is creating a breakpoint on the next instruction and contining the simulation. Without stopping on the breakpoint, it wouldn't invoke the breakpoint callback, since it may evaulate the code we just want it to stop already. We should set the dontStop to false to fix this issue. Change-Id: Iaf8acd3235fa9625c1423ef34606e1fa5d0c531a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55484 Reviewed-by: Earl Ou <shunhsingou@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-20 01:16:20 +00:00
Matthew Poremba	3ecd28a222	arch-vega: Update FLAT memory access helpers to support LDS This patch ports the changes from a similar patch for arch-gcn3: https://gem5-review.googlesource.com/c/public/gem5/+/48343. Vega already has an helper function to send to the correct pipe depending on the scope, however the initMem helpers currently always assume global scope. In addition the MUBUF WBINVL1 instructions are updated similarly to the GCN3 patch. Change-Id: I612b9198cb56e226721a90e72bba64395c84ebcd Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55465 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-18 15:20:10 +00:00
Matthew Poremba	ff17ecc177	arch-vega: Fix MUBUF out-of-bounds case 1 Ported from https://gem5-review.googlesource.com/c/public/gem5/+/51127: This patch updates the out-of-bounds check to properly check against the correct buffer_offset, which is different depending on if the const_swizzle_enable is true or false. Change-Id: I9757226e62c587b679cab2a42f3616a5dca97e60 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55464 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-18 15:20:10 +00:00
Matthew Poremba	0cb64ce9f0	arch-vega: Free dest registers in non-memory Load DS insts Ported from https://gem5-review.googlesource.com/c/public/gem5/+/48019: Certain DS insts are classfied as Loads, but don't actually go through the memory pipeline. However, any instruction classified as a load marks its destination registers as free in the memory pipeline. Because these instructions didn't use the memory pipeline, they never freed their destination registers, which led to a deadlock. This patch explicitly calls the function used to free the destination registers in the execute() method of those Load instructions that don't use the memory pipeline. Change-Id: I8231217a79661ca6acc837b2ab4931b946049a1a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55463 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-17 23:55:51 +00:00
Gabe Black	d3a323a72c	arch-x86: Make x86 respect m5op_base in SE mode. In SE mode, we can reasonably hard code what virtual address the m5ops show up at since that's private to the process, but we should respect the external setting of what physical address to use. Change-Id: I2ed9e5ba8c411e22e1d5163cf2ab875f9e2fe387 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/52496 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 12:27:19 +00:00
Gabe Black	1b0852ed30	arch-x86: Bare metal workload. Change-Id: I9ff6f5a9970cc7af2ba639be18f1881748074777 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/45045 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 08:40:06 +00:00
Gabe Black	c2c4303a07	arch-x86: Use 16 bit modRM encoding if address size is 16 bit. The modRM byte should be interpreted with 16 bit rules if the address size is 16 bits, whether that's because the address size is that by default, or because it was overridden. It should not be based on the operand size in any case. Change-Id: I8827abe1eea8905b0404f7402fb9531804d63fae Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55503 Maintainer: Gabe Black <gabe.black@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 07:59:41 +00:00
Gabe Black	c17d68f739	arch-x86: In the LVT in the local APIC, start with all entries masked. This is what the APIC is supposed to look like when coming out of reset. Change-Id: Ia9b6e13533692109849e729d9ad3b358f36e2e47 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55451 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 07:59:10 +00:00
Gabe Black	7b01dbd926	arch-x86: Implement real mode far ret. Change-Id: I4fd3210f30246f19ca03906465f160bcbfbfbccc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55450 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 07:58:52 +00:00
Gabe Black	c22ec209d8	arch-x86: Split out and implement INT for real mode. The INT instruction is much simpler in real mode than it is in legacy protected mode. Change-Id: I79f5bc7ebe36726537cd61657f301905085c1199 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55449 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 07:58:37 +00:00
Gabe Black	cfce0ad874	arch-x86: Implement IRET for real mode. The IRET instruction is comparitively very simple in real mode. It just pops a few values off the stack into CS, RIP, and RFLAGS, and sets the CS base. Change-Id: I2bc6015209f1beca31253e288dad7c8de5cd22fc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55448 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 07:58:25 +00:00
Gabe Black	2572b85f54	arch-x86: Hook up the PUSH segment selector insts in the decoder. Change-Id: Id4d59ced3f74a593bb6b0774b843f5dc155c49c5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55447 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2022-01-15 07:58:09 +00:00
Gabe Black	75f77d8fd3	arch-x86: Implement the PUSH instruction for segment selectors. The implementation for PUSH is very simple and can be implemented trivially like the other PUSH versions. POP is more complicated since it needs to actually set up the segment being popped into. Change-Id: I4a5a4bcace15aef02186f893ccdd052083e5cb5d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55446 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2022-01-15 07:57:52 +00:00
Gabe Black	10118f7518	arch-x86: Add decoder syntax for fixed segment registers. There is syntax for this already for fixed integer registers, which this is patterned after. Rather than prefixing the operand descriptor with a lower case "r", fixed segment registers are prefixed with a lower case "s". Change-Id: Ic08d323bef732a62de23f77ec805c8b7cd5e2303 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55445 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 07:57:30 +00:00
Gabe Black	25b046f4d4	arch-x86: Fix disassembly of fixed register macroops. These are mapped to instruction definitions like MOV_R_R, even though one or more of the Rs might have come from a fixed value. Because MOV_R_R (for instance) is only defined once, using a fixed text constant there won't work because that can only have one value. Instead, use a variable which will have the value of that constant so that the same disassembly code will work no matter what fixed value was used. Change-Id: Ie45181c6becce80ad44fa30fc3323757ef713d7c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55444 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Gabe Black <gabe.black@gmail.com>	2022-01-15 07:57:17 +00:00
Gabe Black	864650101b	arch-x86: Handle a special case for MODRM in 16 bit mode. When the address size is 16 bit, the mod field is 0, and the rm is 6, there is no base register, only a displacement. Change-Id: Ib67a6e5ce617d08913b9ca6dee66877f0154ffe1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55285 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 01:54:09 +00:00
Gabe Black	01333c73de	arch-x86: Fix real mode far jumps with set MSB in the offset. When performing a real mode far jump, we were computing the offset into the segment more or less correctly, but then when we tried to actually set the PC using it, we used the second of the two wrip microop arguments. The first argument is an unsigned value and is intended to be a base to work from when figuring out the new IP, and the second argument is a signed offset which can be used to implement relative jumps/branches. When we used the second operand for our new value, setting the first operand to t0 (the zero register on x86), we would inadvertantly sign extend it since the wrip instruction would treat it as a signed value. Instead, we can just switch the two operands so that the wrip microop treats the desired value as the unsigned base, and then adds a signed t0 to it, which will still be 0 one way or the other. Also, while researching this bug, I found that the size used for computing the new IP is always the operand size, and never the address size. This CL fixes that problem as well by removing the faulty override. Change-Id: I96ac9effd37b40161dd8d0b634c5869e767a8873 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55243 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-15 01:44:38 +00:00
Earl Ou	186ba92504	fastmodel: make gem5 fastmodel build hermetic This CL makes fastmodel RPATH relative to $ORIGIN instead of absolute path. In this way we can move build folder (installing), without breaking gem5 run. Change-Id: I8b16d749252b982e45dfe779a5df931015a0e07d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55085 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-14 14:40:14 +00:00
Alex Richardson	f9f86cc366	arch-riscv: Consistently check privilege mode for CSR accesses According to the RISC-V privileged spec (section 2.1), bits 8 and 9 of the CSR number encode the lowest privilege mode that is permitted to access the CSR. Commit `55e7d3e5b6` added this check for for CSR_MSTATUS but none of the other CSRs. Change-Id: Iecf2e387fa9ee810e8b8471341bfa371693b97c5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55404 Reviewed-by: Nils Asmussen <nils.asmussen@barkhauseninstitut.org> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-13 10:44:48 +00:00
Alex Richardson	bd687d48eb	arch-riscv: Add an ostream operator for PrivilegeMode This makes it easier to use the current privilege mode in error messages. Change-Id: I425d45d3957a70d8afb6cbde18955fae1461c960 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55403 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-13 10:44:48 +00:00
Gabe Black	c498d8bced	cpu: Specialize CPUs for an ISA at the leaves, not BaseCPU. The BaseCPU type had been specializing itself based on the value of TARGET_ISA, which is not compatible with building more than one ISA at a time. This change refactors the CPU models so that the BaseCPU is more general, and the ISA specific components are added to the CPU when the CPU types are fully specialized. For instance, The AtomicSimpleCPU has a version called X86AtomicSimpleCPU which installs the X86 specific aspects of the CPU. This specialization is done in three ways. 1. The mmu parameter is assigned an instance of the architecture specific MMU type. This provides a reasonable default, but also avoids having having to use the ISA specific type when the parameter is created. 2. The ISA specific types are made available as class attributes, and the utility functions (including __init__!) in the BaseCPU class can refer to them to get the types they need to set up the CPU at run time. Because SimObjects have strange, unhelpful semantics as far as assigning to their attributes, these types need to be set up in a non-SimObject class, which is then brought in as a base of the actual SimObject type. Because the metaclass of this other type is just "type", things work like you would expect. The SimObject doesn't do any special processing of base classes if they aren't also SimObjects, so these attributes survive and are accessible using normal lookup in the BaseCPU class. 3. There are some methods like addCheckerCPU and properties like needsTSO which have ISA specific values or behaviors. These are set in the ISA specific subclass, where they are inherently specific to an ISA and don't need to check TARGET_ISA. Also, the DummyChecker which was set up for the BaseSimpleCPU which doesn't actually do anything in either C++ or python was not carried forward. The CPU type still exists, but it isn't installed in the simple CPUs. To provide backward compatibility, each ISA implements a .py file which matches the original .py for a CPU, and the original is renamed with a Base prefix. The ISA specific version creates an alias with the old CPU name which maps to the ISA specific type. This way, old scripts which refer to, for example, AtomicSimpleCPU, will get the X86AtomicSimpleCPU if the x86 version was compiled in, the ArmAtomicSimpleCPU on arm, etc. Unfortunately, because of how tags on PySource and by extension SimObjects are implemented right now, if you set the tags on two SimObjects or PySources which have the same module path, the later will overwrite the former whether or not they both would be included. There are some changes in review which would revamp this and make it work like you would expect, without this central bookkeeping which has the conflict. Since I can't use that here, I fell back to checking TARGET_ISA to decide whether to tell SCons about those files at all. In the long term, this mechanism should be revamped so that these compatibility types are only available if there is exactly one ISA compiled into gem5. After the configs have been updated and no longer assume they can use AtomicSimpleCPU in all cases, then these types can be deleted. Also, because ISAs can now either provide subclasses for a CPU or not, the CPU_MODELS variable has been removed, meaning the non-ISA specialized versions of those CPU models will always be included in gem5, except when building the NULL ISA. In the future, a more granular config mechanism will hopefully be implemented for all of gem5 and not just the CPUs, and these can be conditional again in case you only need certain models, and want to reduce build time or binary size by excluding the others. Change-Id: I02fc3f645c551678ede46268bbea9f66c3f6c74b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/52490 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-12 15:59:27 +00:00
Yu-hsin Wang	1e0504cf4a	fastmodel: Fix cluster build failed FastModelCortexCluster subclasses don't have `type` property. They don't need to be specified in sim_objects for generating *Params class. Change-Id: Ic09e494042e05d68c890f9603b8b78a4a8d815a9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55305 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-12 08:07:33 +00:00
Wing Li	ad7ff8e271	fastmodel: export wake request ports from GIC Change-Id: I561ef876a4e873501ed2e9775b5bdb59707521a9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54783 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-12 01:48:19 +00:00
Matthew Poremba	d6bd91a9fd	arch-vega: Implement large ds_read/write instructions Port large DS read/write instructions from https://gem5-review.googlesource.com/c/public/gem5/+/48342. This implements the 96 and 128b ds_read/write instructions in a similar fashion to the 3 and 4 dword flat_load/store instructions. These instructions are treated as reads/writes of 3 or 4 dwords, instead of as a single 96b/128b memory transaction, due to the limitations of the VecOperand class used in the amdgpu code. In order to handle treating the memory transaction as multiple dwords, the patch also adds in new initMemRead/initMemWrite functions for ds instructions. These are similar to the functions used in flat instructions for the same purpose. Change-Id: Iee2de14eb7f32b6654799d53dc97d806288af98f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55344 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-11 16:58:09 +00:00
Matthew Poremba	5a94e73d00	arch-vega: Validate if scalar sources are scalar gprs Port the fixes for scalar source checks from arch-gcn3 at https://gem5-review.googlesource.com/c/public/gem5/+/48344. Scalar sources can either be a general-purpose register or a constant register that holds a single value. If we don't check for if the register is a general-purpose register, it's possible that we get a constant register, which then causes all of the register mapping code to break, as the constant registers aren't supposed to be mapped like the general-purpose registers are. This fix adds an isScalarReg check to the instruction encodings that were missing it. Change-Id: I30dd2d082a5a1dcc3075843bcefd325113ed1df6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55343 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-11 16:58:09 +00:00
Yu-hsin Wang	614b608a08	fastmodel: Add an example reset controller for IrisCpu The example reset controller provides a register interface to config RVBAR and ability to reset the core. Change-Id: I088ddde6f44ff9cc5914afb834ec07a8f7f269fa Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54065 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-11 02:11:40 +00:00
Alistair Delva	cb7799648b	arch-arm: Add support for initrd/initramfs Add initrd_filename and initrd_addr parameters to specify that an initrd/initramfs should be loaded into memory from a file, just like the DTB blob. The user must specify the initrd file, and they can specify the initrd load address as well. However, in practice, it's expected that the dev/machine backend will derive the initrd load address from the dtb load address, which is how a bootloader would typically do it. Change-Id: I6378927c2984b7ccdd1471486dd7803500ef5883 Signed-off-by: Alistair Delva <adelva@google.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54184 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-05 21:38:34 +00:00
Eric Ye	f894de5486	scons: Try to fix build dependency bug when generating fastmodels Bug: 201084562 Change-Id: I33cc9e09b1ce46f80864d75f088a2534949e55e1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/55043 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2022-01-05 15:29:32 +00:00
Luming Wang	1f155ffd90	arch-riscv,sim-se: Complements the system calls on RISC-V There are many SE mode system calls that are implemented in src/sim/syscall_emul.cc or src/sim/syscall_emul.hh. And they work well under X86 and ARM platforms. However, they are not supported in se_workload.cc under the RISC-V platform. This patch adds support for all the system calls already implemented in syscall_emul.hh/cc to the RISC-V platform (in arch/riscv/linux/se_workload.cc). Change-Id: Ia47c3c113767b50412b1c8ade3c1047c894376cf Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54803 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu>	2022-01-05 03:25:58 +00:00
Bobby R. Bruce	065a7dbf1b	misc: Merge branch release-staging-v21-2 into develop Change-Id: I8200ac51c20117f63b51d555fa2f12e5dd35f22e	2021-12-26 23:59:41 -08:00
Cui Jin	2194aea053	arch-riscv: rvc instruction is mistaken as branch Fetch in O3CPU mistakes the normal non-branching compressed instructions, and regards it as a branch. This issue interrupts the consecutive instruction stream, thus affecting performance of cpu front-end. This fix sets the compressed for PCState during decoding. Jira Issue: https://gem5.atlassian.net/browse/GEM5-1137 Change-Id: I7607d563bba8a08869e104877fc3c11c94cbe904 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54644 Reviewed-by: Jin Cui <cuijinbird@gmail.com> Reviewed-by: Ayaz Akram <yazakram@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-22 23:51:50 +00:00
Kyle Roarty	f9deeea427	arch-gcn3,arch-vega: Select proper data on misaligned access req1->getSize() returns the size in bytes, but because we're using it in an array index, we need to scale it by the size of the data type. This ensures we give the second request the proper data. Change-Id: I578665406762d5d0c95f2ea8297c362e1cc0620b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54503 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2021-12-20 18:28:08 +00:00
Gabe Black	c154888b2a	arch,sim-se: Handle syscall retry/suppression in the syscall desc. Rather than make each ISA include boilerplate to ignore a SyscallReturn's value when it's marked as suppressed or needing a retry, put that code into the SyscallDesc::doSyscall method instead. That has two benefits. First, it removes a decent amount of code duplication which is nice from a maintenance perspective. Second, it puts the SyscallDesc in charge of figuring out what to do once a system call implementation finishes. That will let it schedule a retry of the system call for instance, without worrying about what the ISA is doing with the SyscallReturn behind its back. Jira Issue: https://gem5.atlassian.net/browse/GEM5-1123 Change-Id: I76760cba75fd23e6e3357f6169c0140bee3f01b6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54204 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-17 05:45:26 +00:00
Yu-hsin Wang	110e22439f	arch-arm: gdb support Thumb-2 ISA From the document*1, we should allow 2,3,4 in kind check function for supporting all kinds of ARM breakpoint. 1. https://sourceware.org/gdb/current/onlinedocs/gdb/ARM-Breakpoint-Kinds.html Change-Id: I82bcb88cfe6e80e7f17cd6bb68a26a45ace7b174 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54124 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-16 13:26:45 +00:00
Yu-hsin Wang	46957f4337	base: Correct checkBpLen naming with checkBpKind In gdb document1, the second parameter of checkpoint command(Z0, Z1) is named after kind. Although underlying implementation probably considers it as length2, it's still good to follow the name described in gdb document for avoiding any confusion. Refs: 1. https://sourceware.org/gdb/onlinedocs/gdb/Packets.html 2. https://github.com/bminor/binutils-gdb/blob/master/gdb/arch-utils.h#L41 Change-Id: Ib4b585613b8018970b16355f96cdff2ce9d5bae6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54123 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2021-12-16 13:26:39 +00:00
Giacomo Travaglini	a923674d62	arch-arm: Do not squash table walks if translation is partial As partial translations have been introduced we cannot just rely on checking if there is a valid translation when looking for translations to squash. The translation has to be complete as well. This is fixing realview-o3-checker regression Change-Id: I1ad42bd6172207a72f53b7a843c323c0eea88f06 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54043 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> (cherry picked from commit `ec891adca9`) Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54103 Reviewed-by: Bobby Bruce <bbruce@ucdavis.edu> Maintainer: Bobby Bruce <bbruce@ucdavis.edu>	2021-12-14 09:43:41 +00:00
Gabe Black	2be95c4470	arch,sim-se: Handle syscall retry/suppression in the syscall desc. Rather than make each ISA include boilerplate to ignore a SyscallReturn's value when it's marked as suppressed or needing a retry, put that code into the SyscallDesc::doSyscall method instead. That has two benefits. First, it removes a decent amount of code duplication which is nice from a maintenance perspective. Second, it puts the SyscallDesc in charge of figuring out what to do once a system call implementation finishes. That will let it schedule a retry of the system call for instance, without worrying about what the ISA is doing with the SyscallReturn behind its back. Jira Issue: https://gem5.atlassian.net/browse/GEM5-1123 Change-Id: I3732a98c8e0d0b2b94d61313960aa0782c0b971f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54023 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2021-12-13 20:31:51 +00:00
Giacomo Travaglini	ec891adca9	arch-arm: Do not squash table walks if translation is partial As partial translations have been introduced we cannot just rely on checking if there is a valid translation when looking for translations to squash. The translation has to be complete as well. This is fixing realview-o3-checker regression Change-Id: I1ad42bd6172207a72f53b7a843c323c0eea88f06 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/54043 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-13 09:22:01 +00:00
Giacomo Travaglini	fcb544d569	arch-arm: Allow the L2 unified TLB to store partial translations We are allowing the L2 TLB to store partial translations from the second level of lookup JIRA: https://gem5.atlassian.net/browse/GEM5-1108 Change-Id: I1286c14a256470c2075fe5533930617139d4d087 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/52126 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-12 05:04:24 +00:00
Giacomo Travaglini	6ea9a7fe73	arch-arm: Allowing table descriptor to be inserted in TLB This patch is modifying both TableWalker and MMU to effectively store/use partial translations * TableWalker changes: If there is a TLB supporting partial translations (implemented with previous patch), the TableWalker will craft partial entries and forward them to the TLB as walks are performed * MMU changes: We now instruct the table walker to start a page table traversal even if we hit in the TLB, if the matching entry holds a partial translation JIRA: https://gem5.atlassian.net/browse/GEM5-1108 Change-Id: Id20aaf4ea02960d50d8345f3e174c698af21ad1c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/52125 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-12-12 05:04:24 +00:00

1 2 3 4 5 ...

5050 Commits