derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Bobby R. Bruce	d0772a0bfe	misc: Merge branch 'release-staging-v20.1.0.0' into develop Change-Id: I3694b251855b969c7bd3807f34e1b4241d47d586	2020-09-30 20:39:06 -07:00
Bobby R. Bruce	090fa08c14	misc: Updated version to 20.1.0.0 Change-Id: Ic7a37581c58caa354eeecab051122116177d0721 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35456 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-10-01 03:05:49 +00:00
Sungkeun Kim	65338a63d2	sim: Adding missing argument of panic function panic function call in panicFsOnlyPseudoInst (src/sim/pseudo_inst.cc) needs to be invoked with argument (name). Jira Issue: https://gem5.atlassian.net/jira/software/c/projects/GEM5/issues/GEM5-786?filter=allissues Change-Id: Iecacab7b9e0383373b69e9b790fa822d173d29c3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35040 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-30 22:14:51 +00:00
Matthew Poremba	dcf242d838	mem-ruby: Fixing token port responses in GPUCoalescer The is a bug in the GPUCoalescer which occurs in the following situation: 1) An instruction crosses a page boundary causing multiple TLB requests to be sent. 2) The TLB responses arrive at different times, causing the vector memory requests to be sent at different times. 3) The first vector memory request completes before the second vector memory request arrives at the coalescer. This caused the coalescer to consider the instruction sequence number done and return its token. Then the second request would arrive and complete sending back another token. Eventually this increases the token count beyond the maximum tripping an assert. This change keeps track of the number of per-lane requests which are expected to be sent in the vector memory request by looking at the exec mask of the instruction. The token is not returned until the expected number of per-lane requests have been coalesced. This fixes "#7" in the list of issues in JIRA-300. There are also style fixes for local variables in code nearby the changes in this CL. Change-Id: I152fd9397920ad82ba6079112908387e71ff3cce JIRA: https://gem5.atlassian.net/browse/GEM5-300 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35176 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Kyle Roarty <kyleroarty1716@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-30 20:19:36 +00:00
Matthew Poremba	53807c8276	configs,gpu-compute: Fixes to connect gmTokenPort When the TokenPort was moved from the GCN3 staging branch to develop the TokenPort was changed from being the port connecting the ComputeUnit to Ruby's vector memory port to a sideband port which inhibits requests to Ruby's vector memory port. As such, it needs to be explicitly connected as a new port. This changes the getPort method in ComputeUnit to be aware of the port as well as modifying the example config to connect to TCPs. The iteration to connect in the config file was modified since it was not properly connecting to TCPs each time and Ruby.py does not explicitly return a list of each MachineType. Change-Id: Ia70a6756b2af54d95e94d19bec5d8aadd3c2d5c0 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35096 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-30 20:19:21 +00:00
Giacomo Travaglini	c417b76bad	cpu: Never use a empty byteEnable The byteEnable variable is used for masking bytes in a memory request. The default behaviour is to provide from the ExecContext to the CPU (and then to the LSQ) an empty vector, which is the same as providing a vector where every element is true. Such vectors basically mean: do not mask any byte in the memory request. This behaviour adds more complexity to the downstream LSQs, which now have to distinguish between an empty and non-empty byteEnable. This patch is simplifying things by transforming an empty vector into a all true one, making sure the CPUs are always receiving a non empty byteEnable. JIRA: https://gem5.atlassian.net/browse/GEM5-196 Change-Id: I1d1cecd86ed64c53a314ed700f28810d76c195c3 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23285 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-30 14:16:31 +00:00
Giacomo Travaglini	81a3637260	arch-x86: Add byteEnable mask in x86 memhelpers Next patch will make the byteEnable mandatory in the ExecContext interface so we need to amend the existing helpers to make them use generate the boolean vector. JIRA: https://gem5.atlassian.net/browse/GEM5-196 Change-Id: Ib24550aa1e22049487ef4ec2748b786be456d342 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23529 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com>	2020-09-30 08:50:39 +00:00
Giacomo Travaglini	e04ee364de	arch-arm: Using new "raw" memhelpers JIRA: https://gem5.atlassian.net/browse/GEM5-196 Change-Id: Ie5ea0fc845a8f6d77a5723bacaff25ba04562f9c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23528 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-30 08:50:39 +00:00
Giacomo Travaglini	0eab4bf2af	arch: Add raw read/writeMem helpers With some exceptions (in arm/x86) the standard memory read/write interface for instructions relies upon the helper functions in src/arch/generic/memhelpers.hh which wrap the ExecContext interface. (readMem, writeMem...) Those helpers rely on the source/destination data to be provided (as expected) but not on the size of the transaction. The latter gets evaluated via the host size of the source/destination data (sizeof(MemT)). For this reason some instructions, which are instead using an incompatible MemT data (as an example, a SIMD operation loading data in an array of integers), make direct use of the ExecContext interface, which is simply requesting for a pointer and a number of bytes. Some other instructions are using the ExecContext interface since the helpers do not accept a byteEnable argument. This patch is adding some helpers to address these issues. The idea is to deprecate direct usage of the ExecContext APIs. These new wrappers do not work with the type detection mechanism to evaluate the number of bytes we are accessing. JIRA: https://gem5.atlassian.net/browse/GEM5-196 Change-Id: I5b822d278bdf325a68a01aa1861b6487c6628245 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23527 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Gabe Black <gabeblack@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com>	2020-09-30 08:50:39 +00:00
Giacomo Travaglini	d29349ead6	arch: Do value-initialization for MemOperand With this patch we are properly initializing the MemOperand variable, with value-initialization. Prior to this patch, the variable was simply default-initialized. For a native type, this means the variable is undefined. With value initialization we are sure the variable is not undefined and the compiler doesn't complain about it. JIRA: https://gem5.atlassian.net/browse/GEM5-196 Change-Id: I55a5b8f047b8e691529807b61d38f0d47fcfe61e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/23526 Tested-by: kokoro <noreply+kokoro@google.com> Tested-by: Gem5 Cloud Project GCB service account <345032938727@cloudbuild.gserviceaccount.com> Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com>	2020-09-30 08:50:39 +00:00
Gabe Black	ba197c1163	arch: Wrap a docstring in isa_parser.py. This brings the ISA parser in line with the style guide. Note that the docstring needs to be a single string literal for python to consider it a docstring, and the parser itself needs each line of the docstring to be a rule in its CFG. We can accomplish both by taking advantage of the fact that two directly adjacent quoted strings are treated as a single string literal by python, and by escaping the newline so that they're actually considered adjacent. Change-Id: I7f4d252998877808425aafb0159600ba4c3bf9ad Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35276 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 22:03:14 +00:00
Gabe Black	7e8bc77353	base: Expose the ObjectFile class to python. This will make it possible to inspect a binary and determine, for example, what architecture or operating system it was compiled for. Change-Id: Ib40f1e1c02448dc5bf084bb0dd98d3767f463fef Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33899 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 21:56:50 +00:00
Adrian Herrera	a21573cef0	dev-arm: SMMUv3, default CMDQ entries to 128 From Linux 587e6c10a7ce89a5924fdbeff2ec524fbd6a124b, SMMUv3 implementations in 64-bit platforms must report a minimum of 128 CMDQ entries via SMMU_IDR1. Otherwise, the SMMUv3 Linux driver returns -ENXIO. Change-Id: I304aac1b734515b3077003e8d67cc19730afc67f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35297 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 18:38:48 +00:00
Bobby R. Bruce	b715c2d513	python: Flush the simulation stdout/stderr buffers Occasionally gem5's stdout/stderr, when run within the TestLib framework, will be shuffled. This is resolved by flushing the stdout/stderr buffer before and after simulation. In addition to this, the verifier.py has been improved to remove boilerplate gem5 code from the stdout comparison. Change-Id: I04c8f9cee4475b8eab2f1ba9bb76bfa3cfcca6ec Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34995 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 17:52:15 +00:00
Gabe Black	87baeab20f	x86: Use the common pseudoInst dispatch function. Instead of hand invoking each individual pseudo inst. New instructions added in the future will automatically become available without a lot of extra hand implementation. It also simplifies the x86 ISA description. Change-Id: Ibb671dc2656e61679b7ed016c51a6c879e12910a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27789 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 11:10:36 +00:00
Timothy Hayes	d9d4203e04	arch-arm: Instantiate a single HTM checkpoint at ISA::startup Change-Id: I48cc71dce607233f025387379507bcd485943dde Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35016 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 09:16:28 +00:00
Timothy Hayes	5c83d8f74c	cpu: Allow storing an invalid HTM checkpoint Commits `02745afd` and `f9b4e32` introduced a mechanism for creating checkpoint objects for hardware transactional memory (HTM) and Arm TME. Because the checkpoint object also contains the local UID of a transaction, it is needed before any architectural checkpointing takes places. This caused segfaults when running HTM codes. This commit allows ISAs to allocate a checkpoint once at the beginning of simulation. In order to do that we need to remove the validity check assertion; the cpt will become valid only after a first successfull transaction start Change-Id: I233d01805f8ab655131ed8cd6404950a2bf6fbc7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35015 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 09:16:28 +00:00
Nikos Nikoleris	55cbc64d1e	mem: Fix some reference use in range loops This change fixes two cases of range loops, one where we can't use lvalue reference, and one more where we have to use an lvalue reference as we can't create a copy. In both cases clang would warn. Change-Id: I760aa094af66be32a150bad37acc21d6fd512a65 Signed-off-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34776 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 22:38:11 +00:00
Gabe Black	b877efa6d4	misc: Update attribute syntax, and reorganize compiler.hh. This change replaces the __attribute__ syntax with the now standard [[]] syntax. It also reorganizes compiler.hh so that all special macros have some explanatory text saying what they do, and each attribute which has a standard version can use that if available and what version of c++ it's standard in is put in a comment. Also, the requirements as far as where you put [[]] style attributes are a little more strict than the old school __attribute__ style. The use of the attribute macros was updated to fit these new, more strict requirements. Change-Id: Iace44306a534111f1c38b9856dc9e88cd9b49d2a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35219 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 21:52:59 +00:00
Gabe Black	3c31a214b6	base,mem: Use the standard [[deprecated]] attribute. The [[deprecated]] attribute is now standard, and so we don't need to wrap it in our own macro any more. Change-Id: I363df9a9c6b820dee8c21b1716335c0d15fbc62d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35216 Maintainer: Gabe Black <gabeblack@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 21:52:41 +00:00
Ciro Santilli	325a143d6f	cpu: make ExecSymbol show the symbol in addition to address Before this commit, ExecSymbol would show only the symbol and no address: 0: system.cpu: A0 T0 : @_kernel_flags_le_lo32+6 : mrs x0, currentel After this commit, it shows the symbol in addition to the address: 0: system.cpu: A0 T0 : 0x10 @_kernel_flags_le_lo32+6 : mrs x0, currentel Change-Id: I665802f50ce9aeac6bb9e174b5dd06196e757c60 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35077 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 10:47:38 +00:00
Gabe Black	2c6901120f	base,dev: Use the standard attribute [[noreturn]]. The [[noreturn]] attribute has been standard since c++11, and so we don't (and haven't for a while) need to wrap it in a macro. Change-Id: Ifba62c87c19224bb366e93ebba685a063cc750ce Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35218 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 05:41:45 +00:00
Gabe Black	62aa07c915	arch,base,cpu,dev: Get rid of the M5_DUMMY_RETURN macro. This macro probably would have been defined to "return" in some cases, to be put after a call to a function that doesn't return so that the compiler wouldn't think control would reach the end of a non-void function. It was only ever defined to expand to nothing, and now that [[noreturn]] is a standard attribute, it should never be needed going forward. Change-Id: I37625eab72deeaede77f9347116b9fddd75febf7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35217 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 05:41:26 +00:00
Gabe Black	50a0b85367	arm,base,gpu: Use std::make_unique instead of m5::make_unique. Now that we're using c++14, we can just assume that std::make_unique exists. We no longer have to conditionally inject our own version. Change-Id: I5d851afb02dd05c7af93864ffec3b3184f3d4ec8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35215 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 05:41:08 +00:00
Earl Ou	ff6a3a6171	base,sim: implement a faster mutex for single thread case This change applies an atomic variable to check if we really need to obtain a mutex, and uses a condition variable to notify. See about 5% improvement in the simulation speed. Change-Id: I7e165987dcb587b27fae90978b9b3fde6f5563ef Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34915 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 02:30:41 +00:00
Gabe Black	4b63d5e7a8	mem: When loading an image directly in memory, use the right CL size. Some code was added fairly recently which would load a memory image into a memory directly in order to make it easier to set up ROMs. Unfortunately, that code accidentally used the image size instead of the cache line size when setting up the port proxy which would actually write the data. This happens to work when the image size is a power of two since that's all the proxy checks for, but there's no guarantee that every image will be sized that way. This change instead looks into the system object, retrieves the cache line size from it, and uses that to set up the port proxy. Change-Id: I227ac475b855d9516e1feb881769e12ec4e7d598 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35155 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-26 01:16:40 +00:00
Gabe Black	9de667af6e	sim: Remove check whether the System port is connected. The port will report an error if something tries to use it and it's not connected. If it isn't needed, there's no reason to force users to hook something up to it just to satisfy the check. Change-Id: I0668b8a86c8cb323aba51670fb7914d35acc5198 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34815 Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2020-09-25 23:51:49 +00:00
Gabe Black	dd130fec92	base: When creating an ELF file memory image, ignore empty segments. Sometimes ELF files have segments in them which are marked as loadable, but which actually have zero size in memory. When setting up a memory image we should drop those to avoid confusing other code which tries to find the footprint of a memory image. No part of these segments, including their starting address or ending address, need to actually land on top of memory since they don't actually contain any data. Change-Id: If8b61d10db139e0f688b6ceabcb8e6a898557469 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35156 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-25 23:50:59 +00:00
Bobby R. Bruce	63e9699256	misc: Merge branch 'release-staging-v20.1.0.0' into develop Change-Id: I656a2d9512b1822a7e8d82606da7a0a5504d6820	2020-09-24 22:28:11 -07:00
Gabe Black	d872fe2f17	base: Minor cleanup of the ChunkGenerator. Minor style fixes, switched to Addr for some types so they'll definitely be large enough. Change-Id: I985004116c48ce6fb236c04e04fe54ed49a68277 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34177 Reviewed-by: Steve Reinhardt <stever@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-25 04:50:00 +00:00
Gabe Black	bcc797a2cb	fastmodel: Update the IRIS ThreadContext base class. The syscall() method has been removed, and HTM related methods have been added. Change-Id: I796c1a554bfd4b1ee01a62c9c7ad403dd699cc0f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35038 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 20:08:13 +00:00
Gabe Black	76bceca2e1	arm,fastmodel: Update the VExpressFastModel to use ArmInterruptPins. The HDLCD device now uses an ArmInterruptPin instead of a GIC and interrupt number parameter. Change-Id: I31122e66a1c18f61592f3dca214ee057baad8f88 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35039 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 20:07:59 +00:00
Gabe Black	281afe2be0	fastmodel: Update for the isa_traits.hh changes. arch/arm/isa_traits.hh no longer has using namespace ArmISA, and also no longer directly or indirectly provides interrupt number related constants. Change-Id: Ieda31d1db4f85632a555b2f72ee8bff0aa159eee Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35037 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 20:07:48 +00:00
Kyle Roarty	45f57ff2c2	gpu-compute: set exec_mask for permute,bpermute instructions This change sets gpuDynInst->exec_mask for permute and bpermute instructions, fixing a bug where they would never write their data. permute and bpermute instructions are load instructions that write to a VGPR. Because of that, they use gpuDynInst->exec_mask when checking what lanes should write to the VGPR. gpuDynInst->exec_mask gets set to wf->execMask() as that is what other load instructions that write to VGPRs do. Change-Id: Ie443283488cbd2ab9c17fc255e7cc44418353419 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35036 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 17:56:04 +00:00
Kyle Roarty	347d7644eb	gpu-compute: replace uint32_t* casts with bits API calls The uint32_t* casting was challenging to fully understand what was being done at a glance. Replaced with calls to various bits functions as it's functionally equivalent and much more clear. This also fixes a segfault in GPUInitAbi DPRINTFs from a mis-typed uint32_t* cast. Change-Id: Id5d1863942848dd7a9e5e17e8180c33adbc72f15 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34677 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 14:53:16 +00:00
Adrian Herrera	87328c2fc1	dev-arm: FVPBasePwrCtrl, fix vector resizing (1) ThreadContexts are registered into System in BaseCPU::init. (2) FVPBasePwrCtrl state is resized based on registered ThreadContexts in FVPBasePwrCtrl::init. FVPBasePwrCtrl::init may be called before BaseCPU::init based on the model names alphabetical order, leading to segmentation faults. To fix this, (2) is now carried out in FVPBasePwrCtrl::startup. Change-Id: Ica6c5b7448da556d61aee53f8777a709fcad2212 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35075 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 14:01:09 +00:00
Gabe Black	2e5ffdb7f3	cpu: Use cprintf and C++ type magic to get rid of a THE_ISA. It should be fine to let operator overloading take care of figuring out how to print the ExtMachInst type for a given ISA. Change-Id: I173fd9f49013d92191118775d20344219a69337e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34822 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-23 12:40:42 +00:00
Earl Ou	b86461ce94	systemc: avoid mutex lock in non async cases Avoid acquiring a mutex lock in case there is no async update in the scheduler. This helps increasing simulation speed by about 4%. Change-Id: I971c7bf1a1eeb46208eeee6e5da6385c907092b3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34695 Reviewed-by: Earl Ou <shunhsingou@google.com> Maintainer: Earl Ou <shunhsingou@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-23 04:05:03 +00:00
Giacomo Travaglini	2035ebfbba	dev-arm: Implement GICv4.1 GICD_TYPER2 as RES0 If GICv4.1 is not implemented (our case) the register should be treated as RES0 Change-Id: Ia60f6dce9741c34bf167805f60c3fc8bf0897510 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Ciro Santilli <ciro.santilli@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34875 Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 13:19:43 +00:00
Giacomo Travaglini	47aa52ed17	arch-arm: TLBI ALLE2IS should broadcast to the IS domain This was implemented as a normal ALLE2 hence affecting the current PE only Change-Id: Ib369dd5a4b738daf96a01b5535d7481a97bb3730 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34795 Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 08:17:28 +00:00
Bobby R. Bruce	92e8a871f3	misc: Merge branch 'release-staging-v20.1.0.0' into develop Change-Id: I1b33eeda67e7641ab71935e140fd24d4735be596	2020-09-21 18:48:12 -07:00
Bobby R. Bruce	b45bbef206	tests,base: Fixed unittests for .fast unittests.fast, unittests.prof, and unittests.perf had failing tests due to the stripping of asserts via compiler optimization. This patch alters the unittests to skip these tests when TRACING_ON == 0. Change-Id: I2d4ab795ecfc2c4556b5eb1877635409d0836ec6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34898 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 01:46:22 +00:00
Gabe Black	0ad5d1edc5	arch,cpu,sim: Route system calls through the workload. System calls should now be requested from the workload directly and not routed through ExecContext or ThreadContext interfaces. That removes a major special case for SE mode from those interfaces. For now, when the SE workload gets a request for a system call, it dispatches it to the appropriate Process object. In the future, the ISA specific Workload subclasses will be responsible for handling system calls and not the Process classes. For simplicity, the Workload syscall() method is defined in the base class but will panic everywhere except when SEWorkload overrides it. In the future, this mechanism will turn into a way to request generic services from the workload which are not necessarily system calls. For instance, it could be a way to request handling of a page fault without having to have another PseudoInst just for that purpose. Change-Id: I18d36d64c54adf4f4f17a62e7e006ff2fc0b22f1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33282 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 07:26:42 +00:00
Gabe Black	3293926413	sim: Create a Workload object for SE mode. The workload object is still optional for the sake of compatibility, even though it probably shouldn't be in the long term. If a simulation is just a collection of components with nothing in particular running on it, for instance driven by a traffic generator, should it even have a System object in the first place? Change-Id: I8bcda72bdfa3730248226fb62f0bba9a83243d95 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33278 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 07:26:22 +00:00
Gabe Black	adb13e4fc7	dev: Stop using the OS page size in the IDE controller. This size was used to break up DMA transactions so that a single transaction would not cross a page boundary. This was because on Alpha, there was an actual page table which translated between PCI and DMA address spaces. On all currently implemented systems, the mapping is simply to add a scalar offset, so it's not possible for a legal region of memory to be contiguous in one space but not in the other. Additionally, if it was possible for there to be a mismatch, it was only coincidence that Alpha used a page table which had the same sized pages as it normally used. There is no requirement that there even would be fixed sized pages in the first place. To avoid this artificial dependency between the IDE controller and the ISA, this change simply changes the chunk size for DMA accesses to 4K. That's the page size at least on x86 and probably other architectures, and will be a pretty close approximation of the previous behavior. It's possible that even having this chunking in the first place is unnecessary and functionally useless, but there are some checks which happen between chunks, and changing how big they are would change the frequency of those checks. For instance, the controller/disk may not notice in the same amount of time if a DMA was cancelled somehow. Change-Id: I1ec840d1f158c3faa31ba0184458b69bf654c252 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34178 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 07:26:04 +00:00
Gabe Black	c8cde1fefa	base: Use M5_UNLIKELY with conditional DPRINTF family functions. Most DPRINTFs will be skipped over most of the time, and when they aren't they'll already have overhead from string handling, output to the console and/or a file, etc, which will drown out the behavior of a branch. Change-Id: I5475d7b5add63b44f60c0a1d46b4b14e6bf30fd3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34818 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:39 +00:00
Gabe Black	89ffa84de6	base: Use M5_UNLIKELY for conditional panic, etc., macros. panic_if and fail_if should happen at most once in any given simulation, and warn_if, etc., should still not happen most of the time. Change-Id: Iaa6cb03c11b86d84f51cc4738efb8f203de4201c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34817 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:31 +00:00
Gabe Black	2f6a770ccf	base: Add M5_LIKELY and M5_UNLIKELY macros to compiler.hh. The clang/gcc implementation uses the nonstandard __builtin_expect(). In C++20, new standard attributes can be used instead. We can't use those yet though. Change-Id: Idd2541a7eca0d97ac6c643abbf2910cbc343d7e5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34816 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:24 +00:00
Gabe Black	8d88d84d95	cpu: Clear out some unnecessary ISA dependence in thread_context.hh. The ISA version of the ISA class isn't used any more. Neither is TheISA::MachInst. Change-Id: I9085ad2b51ba19bf6e5bb17769dd048ac6384fec Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34821 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:00 +00:00
Gabe Black	24e87cb1c5	gpu: Stop using TheISA in the GPU TLB. This class is defined inside the X86ISA namespace, so there's no point in pretending it's generic. Remove TheISA and let the code access what it needs from X86ISA naturally since it's there already. Change-Id: I21b5d2d2b9af6aa0c10ddbb5b3ddca1692188dcc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34173 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2020-09-18 13:48:45 +00:00

1 2 3 4 5 ...

11010 Commits