derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Adrian Herrera	a21573cef0	dev-arm: SMMUv3, default CMDQ entries to 128 From Linux 587e6c10a7ce89a5924fdbeff2ec524fbd6a124b, SMMUv3 implementations in 64-bit platforms must report a minimum of 128 CMDQ entries via SMMU_IDR1. Otherwise, the SMMUv3 Linux driver returns -ENXIO. Change-Id: I304aac1b734515b3077003e8d67cc19730afc67f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35297 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 18:38:48 +00:00
Gabe Black	87baeab20f	x86: Use the common pseudoInst dispatch function. Instead of hand invoking each individual pseudo inst. New instructions added in the future will automatically become available without a lot of extra hand implementation. It also simplifies the x86 ISA description. Change-Id: Ibb671dc2656e61679b7ed016c51a6c879e12910a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27789 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 11:10:36 +00:00
Giacomo Travaglini	e7f36d30c1	ext: Add timing indications to every TestCase The log_call helper is now accepting a time parameter (dictionary). If the param is not None, the function will fill the timing indications (user and system time) for the TestCase. There are some TestCases whose user time is not of our interest; for example we don't really care about the cpu time of a stdout diff (MatchStdout tests). In those cases the resulting cpu time in the generated JUnit file (results.xml) will be 0. JIRA: https://gem5.atlassian.net/browse/GEM5-548 Change-Id: I53c1b59f8ad93900aeac06197e39189c00a9053c Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32653 Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-29 09:16:07 +00:00
Gabe Black	b877efa6d4	misc: Update attribute syntax, and reorganize compiler.hh. This change replaces the __attribute__ syntax with the now standard [[]] syntax. It also reorganizes compiler.hh so that all special macros have some explanatory text saying what they do, and each attribute which has a standard version can use that if available and what version of c++ it's standard in is put in a comment. Also, the requirements as far as where you put [[]] style attributes are a little more strict than the old school __attribute__ style. The use of the attribute macros was updated to fit these new, more strict requirements. Change-Id: Iace44306a534111f1c38b9856dc9e88cd9b49d2a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35219 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 21:52:59 +00:00
Gabe Black	3c31a214b6	base,mem: Use the standard [[deprecated]] attribute. The [[deprecated]] attribute is now standard, and so we don't need to wrap it in our own macro any more. Change-Id: I363df9a9c6b820dee8c21b1716335c0d15fbc62d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35216 Maintainer: Gabe Black <gabeblack@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 21:52:41 +00:00
Ciro Santilli	325a143d6f	cpu: make ExecSymbol show the symbol in addition to address Before this commit, ExecSymbol would show only the symbol and no address: 0: system.cpu: A0 T0 : @_kernel_flags_le_lo32+6 : mrs x0, currentel After this commit, it shows the symbol in addition to the address: 0: system.cpu: A0 T0 : 0x10 @_kernel_flags_le_lo32+6 : mrs x0, currentel Change-Id: I665802f50ce9aeac6bb9e174b5dd06196e757c60 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35077 Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 10:47:38 +00:00
Gabe Black	2c6901120f	base,dev: Use the standard attribute [[noreturn]]. The [[noreturn]] attribute has been standard since c++11, and so we don't (and haven't for a while) need to wrap it in a macro. Change-Id: Ifba62c87c19224bb366e93ebba685a063cc750ce Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35218 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 05:41:45 +00:00
Gabe Black	62aa07c915	arch,base,cpu,dev: Get rid of the M5_DUMMY_RETURN macro. This macro probably would have been defined to "return" in some cases, to be put after a call to a function that doesn't return so that the compiler wouldn't think control would reach the end of a non-void function. It was only ever defined to expand to nothing, and now that [[noreturn]] is a standard attribute, it should never be needed going forward. Change-Id: I37625eab72deeaede77f9347116b9fddd75febf7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35217 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 05:41:26 +00:00
Gabe Black	50a0b85367	arm,base,gpu: Use std::make_unique instead of m5::make_unique. Now that we're using c++14, we can just assume that std::make_unique exists. We no longer have to conditionally inject our own version. Change-Id: I5d851afb02dd05c7af93864ffec3b3184f3d4ec8 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35215 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 05:41:08 +00:00
Earl Ou	ff6a3a6171	base,sim: implement a faster mutex for single thread case This change applies an atomic variable to check if we really need to obtain a mutex, and uses a condition variable to notify. See about 5% improvement in the simulation speed. Change-Id: I7e165987dcb587b27fae90978b9b3fde6f5563ef Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34915 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-28 02:30:41 +00:00
Gabe Black	9de667af6e	sim: Remove check whether the System port is connected. The port will report an error if something tries to use it and it's not connected. If it isn't needed, there's no reason to force users to hook something up to it just to satisfy the check. Change-Id: I0668b8a86c8cb323aba51670fb7914d35acc5198 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34815 Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>	2020-09-25 23:51:49 +00:00
Gabe Black	dd130fec92	base: When creating an ELF file memory image, ignore empty segments. Sometimes ELF files have segments in them which are marked as loadable, but which actually have zero size in memory. When setting up a memory image we should drop those to avoid confusing other code which tries to find the footprint of a memory image. No part of these segments, including their starting address or ending address, need to actually land on top of memory since they don't actually contain any data. Change-Id: If8b61d10db139e0f688b6ceabcb8e6a898557469 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35156 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-25 23:50:59 +00:00
Richard Cooper	fe2817270e	ext: Monkeypatch os.waitpid to extract CPU time from subprocess Added utility class `TimedWaitPID` which monkey-patches os.waitpid() with a functor that has the same signature, but calls os.wait4() instead. This allows the process's user and system CPU time to be obtained from the OS when using APIs (such as subprocess) which use os.waitpid() internally. The process CPU time is stored within the functor and can be read back later by calling TimedWaitPID.get_time_for_pid(). JIRA: https://gem5.atlassian.net/browse/GEM5-548 Change-Id: I9ebe9ca1241a4f28c90ad31f672f32ac52786664 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/32652 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Hoa Nguyen <hoanguyen@ucdavis.edu>	2020-09-25 14:12:09 +00:00
Bobby R. Bruce	63e9699256	misc: Merge branch 'release-staging-v20.1.0.0' into develop Change-Id: I656a2d9512b1822a7e8d82606da7a0a5504d6820	2020-09-24 22:28:11 -07:00
Gabe Black	d872fe2f17	base: Minor cleanup of the ChunkGenerator. Minor style fixes, switched to Addr for some types so they'll definitely be large enough. Change-Id: I985004116c48ce6fb236c04e04fe54ed49a68277 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34177 Reviewed-by: Steve Reinhardt <stever@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-25 04:50:00 +00:00
Gabe Black	bcc797a2cb	fastmodel: Update the IRIS ThreadContext base class. The syscall() method has been removed, and HTM related methods have been added. Change-Id: I796c1a554bfd4b1ee01a62c9c7ad403dd699cc0f Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35038 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 20:08:13 +00:00
Gabe Black	76bceca2e1	arm,fastmodel: Update the VExpressFastModel to use ArmInterruptPins. The HDLCD device now uses an ArmInterruptPin instead of a GIC and interrupt number parameter. Change-Id: I31122e66a1c18f61592f3dca214ee057baad8f88 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35039 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 20:07:59 +00:00
Gabe Black	281afe2be0	fastmodel: Update for the isa_traits.hh changes. arch/arm/isa_traits.hh no longer has using namespace ArmISA, and also no longer directly or indirectly provides interrupt number related constants. Change-Id: Ieda31d1db4f85632a555b2f72ee8bff0aa159eee Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35037 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 20:07:48 +00:00
Kyle Roarty	45f57ff2c2	gpu-compute: set exec_mask for permute,bpermute instructions This change sets gpuDynInst->exec_mask for permute and bpermute instructions, fixing a bug where they would never write their data. permute and bpermute instructions are load instructions that write to a VGPR. Because of that, they use gpuDynInst->exec_mask when checking what lanes should write to the VGPR. gpuDynInst->exec_mask gets set to wf->execMask() as that is what other load instructions that write to VGPRs do. Change-Id: Ie443283488cbd2ab9c17fc255e7cc44418353419 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35036 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 17:56:04 +00:00
Kyle Roarty	347d7644eb	gpu-compute: replace uint32_t* casts with bits API calls The uint32_t* casting was challenging to fully understand what was being done at a glance. Replaced with calls to various bits functions as it's functionally equivalent and much more clear. This also fixes a segfault in GPUInitAbi DPRINTFs from a mis-typed uint32_t* cast. Change-Id: Id5d1863942848dd7a9e5e17e8180c33adbc72f15 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34677 Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 14:53:16 +00:00
Adrian Herrera	87328c2fc1	dev-arm: FVPBasePwrCtrl, fix vector resizing (1) ThreadContexts are registered into System in BaseCPU::init. (2) FVPBasePwrCtrl state is resized based on registered ThreadContexts in FVPBasePwrCtrl::init. FVPBasePwrCtrl::init may be called before BaseCPU::init based on the model names alphabetical order, leading to segmentation faults. To fix this, (2) is now carried out in FVPBasePwrCtrl::startup. Change-Id: Ica6c5b7448da556d61aee53f8777a709fcad2212 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35075 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-24 14:01:09 +00:00
Gabe Black	2e5ffdb7f3	cpu: Use cprintf and C++ type magic to get rid of a THE_ISA. It should be fine to let operator overloading take care of figuring out how to print the ExtMachInst type for a given ISA. Change-Id: I173fd9f49013d92191118775d20344219a69337e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34822 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-23 12:40:42 +00:00
Gabe Black	dcffee005e	scons: Adjust the version of C++ to C++14. Change-Id: I318d337fc61bca0ae40413c23ee36d59d45a79bc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34820 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Richard Cooper <richard.cooper@arm.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-23 08:35:26 +00:00
Earl Ou	b86461ce94	systemc: avoid mutex lock in non async cases Avoid acquiring a mutex lock in case there is no async update in the scheduler. This helps increasing simulation speed by about 4%. Change-Id: I971c7bf1a1eeb46208eeee6e5da6385c907092b3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34695 Reviewed-by: Earl Ou <shunhsingou@google.com> Maintainer: Earl Ou <shunhsingou@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-23 04:05:03 +00:00
Giacomo Travaglini	2035ebfbba	dev-arm: Implement GICv4.1 GICD_TYPER2 as RES0 If GICv4.1 is not implemented (our case) the register should be treated as RES0 Change-Id: Ia60f6dce9741c34bf167805f60c3fc8bf0897510 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Ciro Santilli <ciro.santilli@arm.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34875 Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 13:19:43 +00:00
Giacomo Travaglini	47aa52ed17	arch-arm: TLBI ALLE2IS should broadcast to the IS domain This was implemented as a normal ALLE2 hence affecting the current PE only Change-Id: Ib369dd5a4b738daf96a01b5535d7481a97bb3730 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34795 Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 08:17:28 +00:00
Ciro Santilli	b3719766f5	util: add pkg-config to ubuntu all-dependencies Dockerfiles Without this, HDF5 is not built, e.g. a run such as http://jenkins.gem5.org/job/Nightly/68/console contains: Checking for hdf5-serial using pkg-config... pkg-config not found Checking for hdf5 using pkg-config... pkg-config not found Checking for H5Fcreate("", 0, 0, 0) in C library hdf5... (cached) no Warning: Couldn't find any HDF5 C++ libraries. Disabling HDF5 support. This is done to increase coverage a bit, and serve as dependency documentation to users. Change-Id: Ibf820a3aa76c29eeee1201646924ee181615a162 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34777 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 06:53:22 +00:00
Bobby R. Bruce	92e8a871f3	misc: Merge branch 'release-staging-v20.1.0.0' into develop Change-Id: I1b33eeda67e7641ab71935e140fd24d4735be596	2020-09-21 18:48:12 -07:00
Bobby R. Bruce	b45bbef206	tests,base: Fixed unittests for .fast unittests.fast, unittests.prof, and unittests.perf had failing tests due to the stripping of asserts via compiler optimization. This patch alters the unittests to skip these tests when TRACING_ON == 0. Change-Id: I2d4ab795ecfc2c4556b5eb1877635409d0836ec6 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34898 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-22 01:46:22 +00:00
Gabe Black	0ad5d1edc5	arch,cpu,sim: Route system calls through the workload. System calls should now be requested from the workload directly and not routed through ExecContext or ThreadContext interfaces. That removes a major special case for SE mode from those interfaces. For now, when the SE workload gets a request for a system call, it dispatches it to the appropriate Process object. In the future, the ISA specific Workload subclasses will be responsible for handling system calls and not the Process classes. For simplicity, the Workload syscall() method is defined in the base class but will panic everywhere except when SEWorkload overrides it. In the future, this mechanism will turn into a way to request generic services from the workload which are not necessarily system calls. For instance, it could be a way to request handling of a page fault without having to have another PseudoInst just for that purpose. Change-Id: I18d36d64c54adf4f4f17a62e7e006ff2fc0b22f1 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33282 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 07:26:42 +00:00
Gabe Black	3293926413	sim: Create a Workload object for SE mode. The workload object is still optional for the sake of compatibility, even though it probably shouldn't be in the long term. If a simulation is just a collection of components with nothing in particular running on it, for instance driven by a traffic generator, should it even have a System object in the first place? Change-Id: I8bcda72bdfa3730248226fb62f0bba9a83243d95 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33278 Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 07:26:22 +00:00
Gabe Black	adb13e4fc7	dev: Stop using the OS page size in the IDE controller. This size was used to break up DMA transactions so that a single transaction would not cross a page boundary. This was because on Alpha, there was an actual page table which translated between PCI and DMA address spaces. On all currently implemented systems, the mapping is simply to add a scalar offset, so it's not possible for a legal region of memory to be contiguous in one space but not in the other. Additionally, if it was possible for there to be a mismatch, it was only coincidence that Alpha used a page table which had the same sized pages as it normally used. There is no requirement that there even would be fixed sized pages in the first place. To avoid this artificial dependency between the IDE controller and the ISA, this change simply changes the chunk size for DMA accesses to 4K. That's the page size at least on x86 and probably other architectures, and will be a pretty close approximation of the previous behavior. It's possible that even having this chunking in the first place is unnecessary and functionally useless, but there are some checks which happen between chunks, and changing how big they are would change the frequency of those checks. For instance, the controller/disk may not notice in the same amount of time if a DMA was cancelled somehow. Change-Id: I1ec840d1f158c3faa31ba0184458b69bf654c252 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34178 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 07:26:04 +00:00
Gabe Black	a83316ec00	scons: Increase the minimum clang version to 3.9. This matches what's documented elsewhere. We need version 3.4 to support c++14, but we support only as far back as 3.9. Also, the argument to set c++14 as the standard is different in 3.4 and earlier (-std=c++1y), so it makes life slightly easier to move past it to 3.9. Change-Id: I66fa578dd3222c62907496a888f8068ed0918c7b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34819 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:46 +00:00
Gabe Black	c8cde1fefa	base: Use M5_UNLIKELY with conditional DPRINTF family functions. Most DPRINTFs will be skipped over most of the time, and when they aren't they'll already have overhead from string handling, output to the console and/or a file, etc, which will drown out the behavior of a branch. Change-Id: I5475d7b5add63b44f60c0a1d46b4b14e6bf30fd3 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34818 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:39 +00:00
Gabe Black	89ffa84de6	base: Use M5_UNLIKELY for conditional panic, etc., macros. panic_if and fail_if should happen at most once in any given simulation, and warn_if, etc., should still not happen most of the time. Change-Id: Iaa6cb03c11b86d84f51cc4738efb8f203de4201c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34817 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:31 +00:00
Gabe Black	2f6a770ccf	base: Add M5_LIKELY and M5_UNLIKELY macros to compiler.hh. The clang/gcc implementation uses the nonstandard __builtin_expect(). In C++20, new standard attributes can be used instead. We can't use those yet though. Change-Id: Idd2541a7eca0d97ac6c643abbf2910cbc343d7e5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34816 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:24 +00:00
Gabe Black	8d88d84d95	cpu: Clear out some unnecessary ISA dependence in thread_context.hh. The ISA version of the ISA class isn't used any more. Neither is TheISA::MachInst. Change-Id: I9085ad2b51ba19bf6e5bb17769dd048ac6384fec Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34821 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-20 01:03:00 +00:00
Gabe Black	24e87cb1c5	gpu: Stop using TheISA in the GPU TLB. This class is defined inside the X86ISA namespace, so there's no point in pretending it's generic. Remove TheISA and let the code access what it needs from X86ISA naturally since it's there already. Change-Id: I21b5d2d2b9af6aa0c10ddbb5b3ddca1692188dcc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34173 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com>	2020-09-18 13:48:45 +00:00
Ciro Santilli	6bc2111c96	tests: cleanup all SE tests previously moved to gem5-resources The move was done at: https://gem5-review.googlesource.com/c/public/gem5-resources/+/32074 All files keep exact same name, or are obvious renames like underscore to -. threads/ is the only non obvious and remaps to src/simple/std_thread.cpp Only m5-exit is left because it does squashfs generation which wasn't yet moved. Change-Id: I72ad104c9311c2f81af49458bdd44e24a6bafc0a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34476 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-18 09:09:16 +00:00
Gabe Black	15d60a0e7d	systemc: Add a missing override. A recent change accidentally left off the override, upsetting gcc. Change-Id: I78cf1969aa6ac462539a2793a8a91dea32002f3a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34756 Maintainer: Gabe Black <gabeblack@google.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-18 07:25:28 +00:00
Kyle Roarty	be3bcd1629	gpu-compute: Fix deadlock in fetch_unit after branch instruction The following deadlock was occuring in fetch_unit w/timingSim: 1. exec() is called, a wave is ready to fetch, so it sets pendingFetch 2. A packet is sent to ITLB to fetch for that wave 3. The wave executes a branch, causing the fetch buffer to be cleared 4. The packet is handled, and fetch() is called. However, because the fetch buffer was cleared, it returns doing nothing. 5. exec() gets called again, but the wave will never be scheduled to fetch, as pendingFetch is still set to true. This patch clears pendingFetch (and dropFetch) before returning in fetch() when the fetch buffer has been cleared. dropFetch needed to be cleared otherwise gem5 would crash. Change-Id: Iccbac7defc4849c19e8b17aa2492da641defb772 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34555 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-17 21:24:19 +00:00
Bobby R. Bruce	5a20525e75	util: Removed libelf-dev dep from Dockerfiles The libelf-dev dependency is no longer required in our Dockerfiles. This reverts commit `0cf67fb362`, https://gem5-review.googlesource.com/c/public/gem5/+/33596. The libelf-dev dependency has been kept for the "all_dependencies" Dockerfiles. The corresponding Docker images have been built and uploaded to: https://gcr.io/gem5-test. Change-Id: Iacbd8240f69d476ad3a649baaccb6b85fec2487c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34676 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-17 03:13:21 +00:00
Gabe Black	64a7bcd2c4	util: Add an unmap_m5_mem() function to the m5 util's m5_mmap.*. This cleans up the mmap-ing. This is primarily used for testing since the tests may end up mmap-ing the backing file many times, and we don't want all those earlier mappings lying around. This change also makes the original mmap-ing function close the file it opens, since the man page for mmap explicitly says you can do that and not lose the mapping. That means we don't have to keep track of the file descriptor which corresponds to the mmap-ed file when we do the unmapping, and it's slightly cleaner in general. Change-Id: I90e3e755cebf3d03e2bf644adf8ef3e157236172 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27750 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Pouya Fotouhi <pfotouhi@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-17 02:30:26 +00:00
Gabe Black	bd13e8e206	util: Add a "semi" call type unit test to the m5 utility. This is largely similar to the "inst" call type test since it's also another form of illegal instruction, but there's more checking to do since the way arguments are passed is more complex. Change-Id: Ie61bb4da8befab579c3044fd2ddee753926de174 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/27749 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-17 02:29:56 +00:00
Earl Ou	ae37bdd4f4	systemc: self-manage TimeSlot in Scheduler TimeSlot is new and deleted frequently. Having a recycling memory manager can help saving the time spent new and delete. Tested and see about 4% improvement in simulation speed. Change-Id: I0ab173168336a883b85f768d7fdf07a936a14d69 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34615 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-17 02:26:34 +00:00
Earl Ou	1bca30c95b	systemc: use list instead of map in scheduler The queue in systemC scheduler is implemented as a std::map. This provides the best big-O solution. However, most of simulation usecases has very small number of pending events. This is expected as we usually only trigger a few new events after some events are processed. In such scenario, we should optimize for insert/erase instead of search. This change use std::list instead of std::map. As a proof, we can find that gem5's original event_queue is also implemented as a list instead of tree. We see 5% speed improvement with the example provided by Matthias Jung: https://gist.github.com/myzinsky/557200aa04556de44a317e0a10f51840 Change-Id: I75c30df9134e94df42fd778115cf923488ff5886 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34515 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-17 02:26:05 +00:00
Bobby R. Bruce	9a39ac876e	misc: Merge branch 'release-staging-v20.1.0.0' into develop Change-Id: I8c3277af7903f0b055b26e497139455a03678524	2020-09-16 17:16:17 -07:00
Jason Lowe-Power	f2d7005f60	misc: Add Matt Poremba as GPU maintainer Change-Id: I90494955b6db628695ef8a42111977decba27618 Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34655 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Matthew Poremba <matthew.poremba@amd.com> Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-16 23:16:25 +00:00
Jason Lowe-Power	914d6af8a8	configs: Add special case in MemConfig SimpleMemory doesn't implement a full MemCtrl interface. Thus, like the NVM and HMC memories, we need to add a special case to MemConfig.py. The --mem-type command line option now works for SimpleMemory and all of the DRAM interfaces (it does not work for the NVM interfaces, though). Issue-on: https://gem5.atlassian.net/browse/GEM5-777 Change-Id: I6d60649215be324bdd2a104b1976752f936c960e Signed-off-by: Jason Lowe-Power <jason@lowepower.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/34595 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-16 21:07:19 +00:00
Gabe Black	a10c573755	arch,cpu: Get rid of the IsMemRef StaticInst flag. A comment at the top of StaticInstFlags.py says that if IsMemRef is set, exactly one of IsStore or IsLoad will be set. That's not strictly true since IsAtomic may be set as well, in which case neither IsStore or IsLoad will be set (in one example I found). The isMemRef accessor still exists, and now just ors the IsStore, IsLoad, and IsAtomic flags. Change-Id: Ic5ff104da68978273977a6eff2abab5dd0ae7fda Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/33744 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-09-16 12:13:08 +00:00

1 2 3 4 5 ...

16073 Commits