derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Ciro Santilli	6ecf110b06	arch-arm: inform bootloader of kernel position with a register Before the commit, the bootloader had a hardcoded entry point that it would jump to. However, the Linux kernel arm64 v5.8 forced us to change the kernel entry point because the required memory alignment has changed at: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/ commit/?h=v5.8&id=cfa7ede20f133cc81cef01dc3a516dda3a9721ee Therefore the only way to have a single bootloader that boots both pre-v5.8 and post-v5.8 kernels is to pass that information from gem5 to the bootloader, which we do in this patch via registers. This approach was already used by the 32-bit bootloader, which passed that value via r3, and we try to use the same register x3 in 64-bit. Since we are now passing this information, the this patch also removes the hardcoding of DTB and cpu-release-addr, and also passes those values via registers. We store the cpu-release-addr in x5 as that value appears to have a function similar to flags_addr, which is used only in 32-bit arm and gets stored in r5. This commit renames atags_addr to dtb_addr, since both are mutually exclusive, and serve a similar purpose, DTB being the newer recommended approach. Similarly, flags_addr is renamed to cpu_release_addr, and it is moved from ArmSystem into ArmFsWorkload, since it is not an intrinsic system property, and should be together with dtb_addr instead. Before this commit, flags_addr was being set from FSConfig.py and configs/example/arm/devices.py to self.realview.realview_io.pio_addr + 0x30. This commit moves that logic into RealView.py instead, and sets the flags address 8 bytes before the start of the DTB address. JIRA: https://gem5.atlassian.net/browse/GEM5-787 Change-Id: If70bea9690be04b84e6040e256a9b03e46710e10 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/35076 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:32:19 +00:00
Daniel R. Carvalho	eb4382af0e	base: Add documentation to flags.hh Add documentation to the Flags class. Use this opportunity to rename some arguments to make their intention clearer. Finally, the constructors have been merged using a default value of 0. Change-Id: I924b1d5c20a3e2066be64ab124ae1a5d96d4b3bf Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38735 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:15:18 +00:00
Daniel R. Carvalho	d7b09c8fce	base: Remove Flags<U> assignment Currently unused and broken. Since these are templated classes, and _flags is private, the assignment is a compilation error. Furthermore, assignment of flags of different types is likely undefined behavior. Change-Id: I8430045c42c003efc74e343cc5b4a4350bc2ad92 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38713 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:15:18 +00:00
Daniel R. Carvalho	4a503306a7	base: Assert Flags' type is unsigned Operations rely on the use of unsigned integers. Change-Id: I825a88f81b54577585976d6558b1409870897721 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38712 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:15:18 +00:00
Daniel R. Carvalho	0c3dcfd314	base: Remove flag from allFlags on destruction When a flag is destroyed it must be removed from the list containing all flags. Use this opportunity to remove "using namespace std" since it is barely used. Change-Id: I201371a770c56e11b92532e146d577c6ecb29d34 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38709 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:15:18 +00:00
Daniel R. Carvalho	23f8efc73b	base: Remove negation operator in Flag There is already a bool conversion operator, so there is no need to provide a negation operator. Change-Id: If5f99f8a0bb1707c115d139417aedd47bd162963 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38708 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:15:18 +00:00
Daniel R. Carvalho	ba284a13dd	base: Fix uninitialized variable in Flag This was uninitialized, and was breaking expected values under certain situations. Change-Id: If51ab6ae038c7c397bc83de1c73af348c1db4ef8 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38707 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:15:18 +00:00
Daniel R. Carvalho	3b03eaab9c	mem-cache: Fix update of useful prefetches The probe notification must be parsed on every hit, even if the prefetcher is set not to generate prefetches on accesses. This fixes the calculation of useful prefetches. Change-Id: Iff298f7bea11013f411f4ba39dba705fd81a0cd4 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38177 Reviewed-by: Bobby R. Bruce <bbruce@ucdavis.edu> Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 11:12:23 +00:00
Gabe Black	c6933a27da	misc: Fix missing includes. Change-Id: I545ff03041e8fe66dc489c6aa95c009e54df0970 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38995 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 08:55:59 +00:00
Gabe Black	a7f3c5aad2	base: Remove begin() and end() from CircleBuf. These functions return iterators which are inconsistent with the usage model for this type. It should be accessed using the peek, push, and pop methods and not iterators. If you need a class with iterators which is oriented around accessing individual elements at a time, the CircularQueue type is likely a better choice. Change-Id: I9f37eab12e490b63d870d378a91f601dad353f25 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38998 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Gabe Black <gabe.black@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 02:33:58 +00:00
Gabe Black	43114ad1dd	dev: Use regular atomic accesses for DMA in bypass mode. These are now accelerated with backdoor accesses and should be at least as fast as functional accesses. This removes a dependency on port proxies, and also stops the HDLCD from using functional accesses. Change-Id: I5e959288eb533d09cffa7b79938aa2f61e4aff7d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38720 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 01:29:32 +00:00
Gabe Black	c5da197679	dev: Teach the DmaPort to use atomic memory backdoors. This is implemented similary to the NonCachingSimpleCPU, except that both the normal atomic and noncaching atomic behaviors are implemented by the same class. The sendDma function now dispatches to a method which implements one or the other behavior since that function was getting too big and complex. Change-Id: I7972971ef41d1373424e587cf67c8444d50de748 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38719 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 01:29:10 +00:00
Gabe Black	93c0fdfb12	dev: Generate DMA packets as needed. Instead of generating all of the DMA packets when a request is initiated, keep track of the necessary properties and generate them as needed. The primary benefit of this approach is that if we don't actually need packets, for instance if we can satisfy the request using a memory backdoor, we can just skip them. Otherwise we'll have wasted time creating them, and then have to clean them up. Change-Id: I04d399fb7bce1ff9a44979c311be356baf2db555 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38717 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 01:28:46 +00:00
Gabe Black	65828b2735	base: Re-implement CircleBuf without using CircularQueue. CircularQueue provides iterators which make it easier for users to interact with it and helps abstract its internal state, but at the same time it prevents standard algorithms like std::copy from recognizing opportunities to use bulk copies to speed up execution. It also hides the seams when wrapping around the buffer happens which std::copy wouldn't know how to handle. CircleBuf seems to be intended as a simpler type which doesn't hold complex entries like the CircularQueue does, and instead just acts as a wrap around buffer, like the name suggests. This change reimplements it to not use CircularQueue, and to instead use an underlying vector. The way internal indexing is handled CircularQueue was simplified recently, and using the same scheme here means that this code is actually not much more verbose than it was before. It also intrinsically handles indexing and bulk accesses, and uses std::copy_n in a way that lets it recognize and take advantage of contiguous storage and bulk copies. Change-Id: I78e7cfe174c52f60c95c81e5cd3d71c6052d4d41 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38896 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabe.black@gmail.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-13 01:28:09 +00:00
Gabe Black	28f8a39726	base: Add a setNext method to the ChunkGenerator. This method lets you stretch the current chunk, if you want to skip over some of the upcoming chunks since you've already handled their bytes. For instance, if you were iterating over pages in a range of virtual addresses, you might be able to handle multiple page sized chunks at once if they were represented by a single large sized page table entry. This mechanism would let you move past all the pages you had just handled without having to walk through them all one by one. Change-Id: I7d962f548947b77f0aa1b725036dbcc9e1b28659 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38718 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-09 13:18:48 +00:00
Kyle Roarty	f6ec145fc0	gpu-compute: Fix FLAT insts decrementing lgkm count early FLAT instructions used to decrement lgkm count on execute, while the GCN3 ISA specifies that lgkm count should be decremented on data being returned or data being written. This patch changes it so that lgkm is decremented after initiateAcc (for stores) and after completeAcc (for loads) to better reflect the ISA definition. This fixes a bug where waitcnts would be satisfied even though the memory access wasn't completed, which lead to instructions using the wrong data. Change-Id: I596cb031af9cda8d47a1b5e146e4a4ffd793d36c Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38696 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-07 17:12:31 +00:00
Adrian Herrera	00dd0d7b3d	dev-arm: SMMUv3, enable interrupt interface Users can set "irq_interface_enable" to allow software to program SMMU_IRQ_CTRL and SMMU_IRQ_CTRLACK. This is required to boot Linux v5.4+ in a reasonable time. Notice the model does not implement architectural interrupt sources, so no assertions will happen. Change-Id: Ie138befdf5a204fe8fce961081c575c2166e22b9 Signed-off-by: Adrian Herrera <adrian.herrera@arm.com> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38555 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com>	2021-01-07 09:07:09 +00:00
Gabe Black	131ae4a2eb	dev: Remove the return type from DmaPort::dmaAction. This function had a comment claiming that returning an arbitrary request from the call was necessary for page table walker statistics, but looking at the actual code, the return type was never used. Also returning whatever the last request happens to be seems arbitrary, and a bad boundary for modularization. The page table walker should not depend on the internal implementation of the DMA port. Change-Id: I00281fbaf6aeb85b15baf54f3d4a23ca1ac72b8b Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38716 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-06 06:33:50 +00:00
Gabe Black	e5c8f03b21	dev: Fix style in the pixel pump base class. Change-Id: I8aa25911b367d36d6862780b39781f13724e79dc Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38715 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-06 04:26:33 +00:00
Gabe Black	2bf116e859	dev: Style fixes in the ARM HDLCD device. Change-Id: I230e0e0db879a56bc23c3ed439b9722e76cdd8e4 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38484 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-06 04:26:10 +00:00
Cui Jin	8c3658939d	arch-riscv: fix the wrong cause register setting The most significant bit should be set based on interrupt or exception. I assume in current RV64 implementation the bit should be 63rd, rather than 31st. This causes interrupt handler to get invalid cause code. Minor bug is for the mpie is suppossed to be set to the value of old mie. The fix is verified in FS. Jira Issue: https://gem5.atlassian.net/browse/GEM5-858 Change-Id: I1cc166c254b35f5c1acb3f5774c43149c61cc37a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38755 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-06 01:18:15 +00:00
Gabe Black	3059c6df5c	arch: Add a mechanism to pad the src or dest reg index arrays. ARM reaches in and pads out the source register index list behind the parser's back to force dest regs to also be sources in case an instruction fails predication and needs to forward the original register values. It shouldn't be hacking up these values in that way, but since it is, this will let it continue to do so while still fitting in the new system where each instruction allocates its src/dest reg index arrays to size. Change-Id: Ia296be9f63123f18f6cdc0d3bb1314d33e759b3a Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38380 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-05 07:33:01 +00:00
Gabe Black	27a41d6ef6	dev: Cache the cacheLineSize in the DMA read FIFO. This is a minor simplification which decouples the FIFO from the system object at run time, although it does need to read the cache line size out at construction time. Change-Id: I57d96a676b9604663b6c9ed7c662640f507c5305 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38482 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2021-01-05 02:09:25 +00:00
Gabe Black	9e97dbe8c8	dev: Make DMA devices use their own ports for functional accesses. DMA devices already have ports they use for non-functional accesses. We can just attach a port proxy to that instead of getting one from the system object. Change-Id: I5e9adee43c7fe07b4c90978dbb7ec71468caadbb Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38481 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Alexandru Duțu <alexandru.dutu@amd.com>	2020-12-31 10:19:12 +00:00
Gabe Black	f16bfed9ea	dev: Style fixes in src/dev/dma_device.(cc\|hh). Change-Id: Ie72f30d95e7f889f9a440d0fed57a5940747b40d Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38480 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-31 10:18:55 +00:00
Gabe Black	b3dd0d4f99	base: Style fixes in the CircleBuf and Fifo classes. Change-Id: Ia08548027973e2b18e09bc3f05a6498855bdd7f7 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38479 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br>	2020-12-31 10:18:46 +00:00
Gabe Black	fa36e7d560	base,cpu: Simplify the CircularQueue class significantly. This class had been trying to keep all indices within the modulus of the queue size, and to use all elements in the underlying storage by making the empty and full conditions alias, differentiated by a bool. To keep track of the difference between a storage location on one trip around the queue vs other times around, ie an alias once the indices had wrapped, it also keep track of a "round" value in both the queue itself, and any iterators it created. All this bookkeeping significantly complicated the data structure. Instead, this change modifies it to keep track of a monotonically increasing index which is wrapped at the time it's used. Only the head index and current size need to be tracked in the queue itself, and only a pointer to the queue and an index need to be tracked in the iterators. Theoretically, it's possible that this value could overflow eventually since it increases forever, unlike before where the index wrapped and was never larger than the queue's capacity. In practice, the type of the index was changed from a uint32_t to a size_t, probably a 64 bit value in modern systems, which will hold much larger values. Also, the round counter and the index values together acted like a smaller than 64 bit value anyway, since the round counter would overflow after 2^32 times around a less than 2^32 entry queue. One minor interface difference is that the head() and tail() values returned by the queue are no longer pre-wrapped to be modulo the queue's capacity. As long as consumers don't try to be overly clever and feed in fixed values, do their own bounds checking, etc., something that would be cumbersome considering the wrapping nature of the structure, this shouldn't be an issue. Also, since external consumers no longer need to worry about wrapping, since only one of them was used in only one place, and because they weren't even marked as part of the interface, the modulo helper functions have been eliminated from the queue. If other code wants to perform modulo arithmetic for some reason (which the queue no longer requires) they can accomplish basically the same thing in basically the same amount of code using normal math. Also, rather than inherit from std::vector, this change makes the vector internal to the queue. That prevents methods of the vector that aren't aware of the circular nature of the structure from leaking out if they're not overridden or otherwise proactively blocked. On top of simplifying the implementation, this also makes it perform slightly better. To measure that, I ran the following command: $ time build/ARM/base/circular_queue.test.opt --gtest_repeat=100000 > /dev/null and found a few percent improvement in total run time. While this difference was small and not measured against realistic usage of the data structure, it was still measurable, and at minimum doesn't seem to have hurt performance. Change-Id: Ic2baa28de135be7086fa94579bbec451d69b3b15 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38478 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-31 10:18:36 +00:00
Gabe Black	be7043f079	base: Fix style issues in the circular queue. Change-Id: I61da587d760019a338522f098745f375a5ce429e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38477 Reviewed-by: Daniel Carvalho <odanrc@yahoo.com.br> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-31 10:18:18 +00:00
Cui Jin	acfb233685	arch-riscv: fix MIE csr register setting bugs Any changes on xIE bits changes should trigger the updating of CSR register. The old condition is wrongly reversed. The fix is verified in FS. Jira Issue: https://gem5.atlassian.net/browse/GEM5-855 Change-Id: Ia2c6d3fbfd24d7f9d23f7cfa6f25f893544f4157 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38578 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Ayaz Akram <yazakram@ucdavis.edu> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-31 03:17:09 +00:00
Daniel R. Carvalho	4dc09a9543	mem-cache: Generate error on compression misconfiguration Compressed caches must use the compressed tags, otherwise a seg fault will be generated. Besides, if no compressor is assigned; yet compressed tags are used, data is not compressed. Generate an error for the first case, and a warning for the second. Change-Id: Iac5474ed919163ce38a8c4e8efd9727e5b3d8417 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38635 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-22 16:03:34 +00:00
Hoa Nguyen	0badfdb207	mem-ruby: Update stats style for SimpleNetwork Change-Id: I7d54ed02d01a3811b41dce794e308b8b77576c92 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38055 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	fa81ca4988	sim: Update stats style of System and Process Change-Id: I3af072a61a18f4fbba3f7d4b632c58501e7b7ae8 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37995 Tested-by: kokoro <noreply+kokoro@google.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	6a80d9f0aa	mem-ruby: Update stats of AbstractController and derived classes This commit moves stats of AbstractController and its derived classes to a Stats::Group struct. Also, one of the controllers needs access to the ruby system profiler stats, and Profiler's stats is now made public as a result. Change-Id: Ibe04e33a6cf09b453564592d29293b354d0d33c9 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38075 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	4c42811ff3	mem-ruby: Move CacheMemory stats used in SLICC to a Stats group This change moves some stats that are used in SLICC to a separate Stats::Group. In order to use stats in SLICC, new functions are added in CacheMemory: - profileDemandHit() - profileDemandMiss() The functions increase the corresponding stat by 1. Change-Id: I52b6fefdf6579a49f626f2fca400641f90800017 Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37815 Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Tiago Mück <tiago.muck@arm.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	78270ede7b	mem-ruby: Update stats style This commit makes move stats from several classes in mem/ruby to corresponding Stats::Group's. For ruby's Profiler, additional changes are made: there are stats that are profiled for each of RequestType, for each of MachineType, and for each of combinations of RequestType and MachineType. The current naming scheme is ...<stat_name>.<request_type_name>.<machine_type_name>. To make it easier for stats parser to know whether the stat is of RequestType, or is of MachineType, or is of (RequestType, MachineType), a prefix is added as follows, ...<meta>.<stat_name>.<request_type_name>.<machine_type_name> where <meta> is one of {RequestType, MachineType, RequestTypeMachineType}. Another point of using this naming scheme is that the parser doesn't need to know all of RequestType and MachineType. Change-Id: I8b8bdd771c7798954f984d416f521e8eb42d01ed Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/36478 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-22 09:52:36 +00:00
Hoa Nguyen	350278b84d	python: Improve SimObject's warning when parent specified twice SimObject outputs a warning when its parent is specified more than once. The cause is most likely that there is unexpected param specified in the constructor called in the Python interface. This commit adds a note about this probable cause of this potential error to the warning message. Change-Id: I9b6bf5d5fb0c77bfdad5fde42e88f814e8a4b72b Signed-off-by: Hoa Nguyen <hoanguyen@ucdavis.edu> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38359 Maintainer: Bobby R. Bruce <bbruce@ucdavis.edu> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-22 09:52:23 +00:00
Gabe Black	40deabcc48	scons,fastmodel: Change how ARM license slots are throttled. To limit the number of license slots used by SCons when building fast model components, the fastmodel SConscript set up a group of nodes which are attached to each simgen run using the SCons SideEffect method using one of the library files it generates. To create each unique node, the SCons Value() method was used, passing it the counter for the loop. In at least version 4 of SCons, what this ended up doing was setting that library file as a source for each of the Value() nodes it corresponds to. That doesn't seem like a problem, but then when creating config include files, files which expose SCons configuration values to C++, they also create Value() nodes using the value of the config variable. In cases where that variable is boolean, the value might be 0 or 1. The result was that the config header depended on Value(0) (for instance), and then Value(0) depended on a collection of static library files. When scons tried to determine whether the config file was up to date, it tried to check if if its sources had changed. It would check Value(0), and then Value(0) would try to compute a checksum for its own source. To do that, it seems to assume that the value can be interpreted as a string and tries to decode it as utf8. Since the library is a binary file, that would fail and break the build with a cryptic message from within the guts of SCons. To address this, this change replaces the loop index with a call to object(). Each instance created in that way will be different from every other, and there will be no way (purposefully or otherwise) to create a collision with it when creating Value() nodes for some other purpose. Change-Id: I56bc842ae66b8cb36d3dcbc25796b708254d6982 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38617 Reviewed-by: Yu-hsin Wang <yuhsingw@google.com> Reviewed-by: Ahbong Chang <cwahbong@google.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-21 21:53:40 +00:00
Gabe Black	150fef8453	dev: Use BitUnions and a RegisterBank in the Uart8250. Change-Id: I139db4f08f9e6addfed4906ea6c49ee67439d30e Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/36818 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-18 00:30:55 +00:00
Gabe Black	3eb63b688f	x86: Fix some comments in x86 KVM process initialization. These comments did not reflect what the code was actually doing. Change-Id: I2bcd23bd68c870e364bdfd0b9b0eb5dcb560e713 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38537 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-17 03:30:20 +00:00
Gabe Black	aad4ee30c4	x86: Change some CR0 settings when setting up kvm x86 processes. These values were (seemingly) arbitrarily changed from the original, non-KVM settings, and no longer matched the comments which were also copied over. These two bits enable alignment checking on memory accesses (not normally used on x86), and whether kernel code can write to read only pages. Change-Id: I48e560e448e4849607f12e9336d1ab0458ad9407 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38536 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-17 03:30:01 +00:00
Gabe Black	cc0d4a8fd6	arm: Fix style in the ISA templates. Change-Id: I3014d26c8649efaf6227f2e3a798cc6c4183a0c5 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38379 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-16 13:19:21 +00:00
Daniel R. Carvalho	491b9874d2	mem-cache: Implement a frequency-sampling compressor Implementation of a generic frequency-based sampling compressor. The compressor goes through a sampling stage, where no compression is done, and the values are simply sampled for their frequencies. Then, after enough samples have been taken, the compressor starts generating compressed data. Compression works by comparing chunks to the table of most frequent values. In theory, a chunk that is present in the frequency table has its value replaced by the index of its respective entry in the table. In practice, the value itself is stored because there is no straight- forward way to acquire an index from an entry. Finally, the index can be encoded so that the values with highest frequency have smaller codeword representation. Its Huffman coupling can be used similar to the approach taken in "SC 2 : A Statistical Compression Cache Scheme". Change-Id: Iae0ebda08e8c08f3b62930fd0fb7e818fd0d141f Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37335 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-16 12:13:05 +00:00
Daniel R. Carvalho	32bce3301d	mem-cache: Add a data-update probe to cache This probe is responsible for notifying any changes to the data contents of a block. This includes fills, overwrites, and invalidations/evictions. Jira: https://gem5.atlassian.net/browse/GEM5-814 Change-Id: I1ff3c09c63d5402765c2125c4d76d95b614877d6 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/37096 Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com> Maintainer: Nikos Nikoleris <nikos.nikoleris@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-16 12:13:05 +00:00
Daniel R. Carvalho	9388ec18a4	sim: Add a listener checker to probes Add a function to check if a probe has listeners. This can be used to avoid performing costly tasks when no one is listening. Change-Id: I8996a0ea298cb7cf97ac8aa9e627331a22bea26e Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38175 Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Reviewed-by: Nikos Nikoleris <nikos.nikoleris@arm.com>	2020-12-16 12:13:05 +00:00
Gabe Black	c29457b7a0	x86: Use the right register type when initializing x86 kvm processes. Functionally this doesn't matter, since no bitfields are used in the type and it devolves into just being a uint64_t, but we should use CR8 and not CR4 when initializing the CR8 register. Change-Id: Ifc7dc9072d552f7010afce9115427c8ed624ebb9 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38535 Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com> Reviewed-by: Matthew Poremba <matthew.poremba@amd.com>	2020-12-16 09:03:48 +00:00
Gabe Black	2a0867f51c	x86: Set the effective base of the TSS when initializing a process. For some segments, there are two base registers. One is the architecturally visible base, and the other is the effective base used when actually referencing memory relative to that segment. The process initialization code was setting the architecturally visible base, presumably because that's the value used by KVM, but was setting the effective base to zero. Change-Id: I06e079f24fa63f0051268437bf00c14578f62612 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38488 Reviewed-by: Gabe Black <gabe.black@gmail.com> Maintainer: Gabe Black <gabe.black@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-16 06:44:36 +00:00
Gabe Black	4110ee2088	x86: Some small style fixes in arch/x86/process.hh. Moved two single line functions to be all on one line, and added some consts. Change-Id: Iecfa3a9c2bde69ce2f26e9531864a7cb92b0a1df Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38489 Reviewed-by: Jason Lowe-Power <power.jg@gmail.com> Maintainer: Jason Lowe-Power <power.jg@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-16 00:47:58 +00:00
Gabe Black	09982dcbe9	x86,sim: Remove special handling for KVM in the clone syscall. When a gem5 op is triggered using a KVM MMIO exit event, the PC has already been advanced beyond the offending instruction. Normally when a system call or gem5 op is triggered, the PC has not advanced because the instruction hasn't actually finished executing. This means that if a gem5 op, and by extension a system call in SE mode, want to advance the PC to the instruction after the gem5 op, they have to check whether they were triggered from KVM. To avoid having to special case these sorts of situations (currently only in the clone system call), we can have the code which dispatches to gem5 ops from KVM adjust the next PC so that it points to what the current PC is. That way the PC can be advanced unconditionally, and will point to the instruction after the one that triggered the call. To be fully consistent, we would also need to adjust the current PC. That would be non-trivial since we'd have to figure out where the current instruction started, and that may not even be possible to unambiguously determine given x86's instruction structure. Then we would also need to restore the original PC to avoid confusing KVM. Change-Id: I9ef90b2df8e27334dedc25c59eb45757f7220eea Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38486 Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com> Maintainer: Andreas Sandberg <andreas.sandberg@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-15 01:36:39 +00:00
Gabe Black	4db903a59f	sim: Remove full system checks from some pseudo insts. These pseudo insts are less useful outside of full system, but they should all still work. Removing this check makes it possible to, for instance, test them in syscall emulation mode, and removes another difference between the two styles of simulation. Change-Id: Ia7d29bfc6f7c5c236045d151930fc171a6966799 Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38485 Reviewed-by: Giacomo Travaglini <giacomo.travaglini@arm.com> Maintainer: Giacomo Travaglini <giacomo.travaglini@arm.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-15 01:35:52 +00:00
Daniel R. Carvalho	9c235d19b0	mem-ruby: Fix const copy of addr range in AbstractController Clang 10 throws the following error: loop variable 'addr_range' of type 'const AddrRange' creates a copy from type 'const AddrRange' [-Werror,-Wrange-loop-construct] note: use reference type 'const AddrRange &' to prevent copying Issue introduced by `c7fabb979c`. Change-Id: I43e8d613eb5069d5ce9cb12ddec18cba0a3847f6 Signed-off-by: Daniel R. Carvalho <odanrc@yahoo.com.br> Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/38495 Reviewed-by: Matt Sinclair <mattdsinclair@gmail.com> Maintainer: Matt Sinclair <mattdsinclair@gmail.com> Tested-by: kokoro <noreply+kokoro@google.com>	2020-12-14 11:50:26 +00:00

1 2 3 4 5 ...

11326 Commits