derek/gem5 - gem5 - Gitea: Git with a cup of tea

derek/gem5

Author	SHA1	Message	Date
Hoa Nguyen	ac5280fedc	mem,sim: Change the type of cache_line_size to Addr Change-Id: Id39e8249fef89c0d59bb39f8104650257ff00245 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-20 14:00:45 -07:00
Hoa Nguyen	9057eeabec	cpu: Explicitly define cache_line_size -> 64-bit unsigned int While it makes sense to define the cache_line_size as a 32-bit unsigned int, the use of cache_line_size is way out of its original scope. cache_line_size has been used to produce an address mask, which masking out the offset bits from an address. For example, [1], [2], [3], and [4]. However, since the cache_line_size is an "unsigned int", the type of the value is not guaranteed to be 64-bit long. Subsequently, the bit twiddling hacks in [1], [2], [3], and [4] produce 32-bit mask, i.e., 0x00000000FFFFFFC0. This behavior at least caused a problem in LLSC in RISC-V [5], where the load reservation (LR) relies on the mask to produce the cache block address. Two distinct 64-bit addresses can be mapped to the same cache block using the above mask. This patch explicitly defines cache_line_size as a 64-bit unsigned int so the cache block mask can be produced correctly for 64-bit addresses. [1] `3bdcfd6f7a/src/cpu/simple/atomic.hh (L147)` [2] `3bdcfd6f7a/src/cpu/simple/timing.hh (L224)` [3] `3bdcfd6f7a/src/cpu/o3/lsq_unit.cc (L241)` [4] `3bdcfd6f7a/src/cpu/minor/lsq.cc (L1425)` [5] `3bdcfd6f7a/src/arch/riscv/isa.cc (L787)` Change-Id: I29abc7aaab266a37326846bbf7a82219071c4ffe Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-19 13:32:09 -07:00
Bobby R. Bruce	3bdcfd6f7a	mem-ruby: patch fixes a protocol error in MOESI_CMP_Directory (#316 ) When there is race between FwdGetX and PUTX on owner. Owner in this case hands off ownership to GetX requestor and PUTX still goes through. But since owner has changed, state should go back to M and PUTX is essentially trashed. An Unblock to the Directory in this case will give an undefined transition. I have added transitions which indicate that when an Unblock is served to the Directory, it means that some kind of ownership transfer has happened while a PUTX/PUTO was in progress.	2023-09-15 13:25:51 -07:00
Bobby R. Bruce	a101b1aba3	stdlib: Add 'to_path' arg to obtain_resource This allows for a user to specify the exact path they want a resource to be downloaded to. This differs from 'resource_direcctory' in that a user may specify the file/directory name of the resource (using just the 'resource_directory' will have the resource as its ID in that directory. Change-Id: I887be6216c7607c22e49cf38226a5e4600f39057	2023-09-14 15:33:17 -07:00
Bobby R. Bruce	b12f28af96	stdlib: Add 'quiet' option to obtain_resource func Change-Id: I15d3be959ba7ab8af328fc6ec2912a8151941a1e	2023-09-14 15:33:17 -07:00
Bobby R. Bruce	59a96c8c2f	mem-cache: Fix bug in classic cache while clflush (#274 ) This change, https://github.com/gem5/gem5/pull/205, mistakenly allocates write buffer for clflush instruction when there's a cache miss. However, clflush in gem5 is not a write instruction. Thus, the cache should allocate miss buffer in this case.	2023-09-14 01:14:39 -07:00
Gautham Pathak	178db9e270	mem-ruby: patch fixes a protocol error in MOESI_CMP_Directory When there is race between FwdGetX and PUTX on owner. Owner in this case hands off ownership to GetX requestor and PUTX still goes through. But since owner has changed, state should go back to M and PUTX is essentially trashed. An Unblock to the Directory in this case will give an undefined transition. I have added transitions which indicate that when an Unblock is served to the Directory, it means that some kind of ownership transfer has happened while a PUTX/PUTO was in progress. Change-Id: I37439b5a363417096030a0875a51c605bd34c127	2023-09-13 19:09:13 -04:00
Bobby R. Bruce	d38c029195	mem-ruby: This commit patches an error in AbstractController.cc (#294 ) Links to #293 After calling m5_dump_reset_stats(0,0) in a test program, some statistics like l1_controllers.L1Dcache.m_demand_hits, l1_controllers.L1Dcache.m_demand_misses, l1_controllers.L1Dcache.m_demand_accesses were not getting reset in the newer stat dumps. This one line patch fixes that. Changes were tested with calling two m5_dump_reset_stats(0,0) in a row for a system with 1 core, tested on both SE and FS. Credits: @MeatBoy106	2023-09-13 15:48:46 -07:00
Bobby R. Bruce	673d4b2ac2	arch-x86: initialize and correct bitwidth for FPU tag word (#304 ) The x87 FPU tag word (FTW) was not explicitly initialized in {X86_64,i386}Process::initState(), resulting in holding an initial value of zero, resulting in an invalid x87 FPU state. This commit initializes FTW to 0xFFFF, indicating the FPU is empty at program start during syscall emulation. The 16-bit FTW register was also incorrectly masked down to 8-bits in X86ISA::ISA::setMiscRegNoEffect(), leading to an invalid X87 FPU state that later caused crashes in the X86KvmCPU. This commit corrects the bitwidth of the mask to 16. GitHub issue: https://github.com/gem5/gem5/issues/303	2023-09-13 15:47:50 -07:00
Gautham Pathak	87db6df8f6	mem-ruby: This commit patches an error in AbstractController.cc After calling m5_dump_reset_stats(0,0) in a test program, some statistics like l1_controllers.L1Dcache.m_demand_hits, l1_controllers.L1Dcache.m_demand_misses, l1_controllers.L1Dcache.m_demand_accesses were not getting reset in the newer stat dumps. This one line patch fixes that. Changes were tested with calling two m5_dump_reset_stats(0,0) in a row for a system with 1 core, tested on both SE and FS. Credits to Gabriel Busnot for finding the fix. Change-Id: I19d75996fa53d31ef20f7b206024fd38dbeac643	2023-09-13 14:07:16 -04:00
Bobby R. Bruce	5fd901ffbb	cpu, configs: Fix TraceCPU after multi-ISA addition (#302 ) This PR fixes #301	2023-09-12 17:26:27 -07:00
Nicholas Mosier	2178e26bf2	arch-x86: initialize and correct bitwidth for FPU tag word The x87 FPU tag word (FTW) was not explicitly initialized in {X86_64,i386}Process::initState(), resulting in holding an initial value of zero, resulting in an invalid x87 FPU state. This commit initializes FTW to 0xFFFF, indicating the FPU is empty at program start during syscall emulation. The 16-bit FTW register was also incorrectly masked down to 8-bits in X86ISA::ISA::setMiscRegNoEffect(), leading to an invalid X87 FPU state that later caused crashes in the X86KvmCPU. This commit corrects the bitwidth of the mask to 16. GitHub issue: https://github.com/gem5/gem5/issues/303 Change-Id: I97892d707998a87c1ff8546e08c15fede7eed66f	2023-09-12 15:39:29 +00:00
Bobby R. Bruce	1bebf6a3cc	sim-se: Use tgt_stat64 instead of tgt_stat in newfstatatFunc (#283 ) The syscall emulation of newfstatat incorrectly treated the output stat buffer to be of type `OS::tgt_stat`, not `OS::tgt_stat64`, causing the invalid output stat buffer in the application to hold invalid data. This patch fixes the bug by simply substituting the type `OS::tgt_stat` with `OS::tgt_stat64` in `newstatatFunc()`. GitHub issue: https://github.com/gem5/gem5/issues/281	2023-09-12 08:33:42 -07:00
Bobby R. Bruce	94e5a0cccf	sim-se: Fix tgkill logic bug in handling signal argument (#286 ) The syscall emulation of tgkill contained a simple logic bug (a `\|\|` instead of a `&&`), causing the signal argument to always be considered invalid. This patch fixes the bug by simply changing the `\|\|` to a `&&`. GitHub issue: https://github.com/gem5/gem5/issues/284	2023-09-12 08:32:56 -07:00
Bobby R. Bruce	d67a6603c1	cpu-kvm: properly set x86 xsave header on gem5->KVM transition (#298 ) If the XSAVE KVM capability is available (KVM_CAP_XSAVE), the X86KvmCPU will try to set the x87 FPU + SSE state using KVM_SET_XSAVE, which expects a buffer (struct kvm_xsave) in XSAVE area format (Vol. 1, Sec. 13.4 of Intel x86 SDM). The original implementation of `X86KvmCPU::updateKvmStateFPUXSave()`, however, improperly sets the xsave header, which contains a bitmap of state components present in the xsave area. This patch defines `XSaveHeader` structure to model the xsave header, which is expected directly following the legacy FPU region (defined in the `FXSave` structure) in the xsave area. It then sets two bist in the xsave header to indicate the presence of x86 FPU and SSE state components. GitHub issue: https://github.com/gem5/gem5/issues/296	2023-09-12 08:32:20 -07:00
Giacomo Travaglini	a0a799f474	cpu: Disable CPU switching functionality with TraceCPU Now that the TraceCPU is no longer a BaseCPU we disable CPU switching functionality. AFAICS from the code, it seems like using m5.switchCpus was never really working. The takeOverFrom was described as being used when checkpointing (which is not really the case). Moreover the icache/dcache event loops were not checking if the CPU was switched out so the trace was always been consumed regardless of the BaseCPU state. Note: IMHO the only case where you might want to switch between an execution-driven CPU to the TraceCPU is when you want to warm your caches before the ROI. All other cases don't really make sense as with the TraceCPU there is no architectural state being maintained/updated. Change-Id: I0611359d2b833e1bc0762be72642df24a7c92b1e Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-09-12 15:50:05 +01:00
Giacomo Travaglini	9a5d900770	cpu: Stop treating TraceCPU as a BaseCPU This is fixing a recently reported issue [1] where it is not possible to use the TraceCPU to replay elastic traces It requires some architectural data structures (like ArchMMU, ArchDecoder...) which are no longer defined in the BaseCPU class at compilation time. Which Arch version should be used for a class (TraceCPU) that is supposed to be ISA agnostic ? Does it really make sense to define them for the TraceCPU? Those classes are not used anyway during trace replay and their sole purpose would just be to comply to the BaseCPU interface. As there is no elegant way to make things work, this patch stops treating the TraceCPU as a BaseCPU. While it philosophically makes sense to treat the TraceCPU as a common CPU (it sort of replays pre-executed instructions), a case can be made for considering it more like a traffic generator. [1]: https://github.com/gem5/gem5/issues/301 Change-Id: I7438169e8cc7fb6272731efb336ed2cf271c0844 Signed-off-by: Giacomo Travaglini <giacomo.travaglini@arm.com>	2023-09-12 15:49:29 +01:00
Roger Chang	def89745bc	arch-riscv: Allow Minor and O3 CPU execute RVV Change-Id: I4780b42c25d349806254b5053fb0da3b6993ca2f	2023-09-12 13:56:22 +08:00
Roger Chang	0f54cb0593	arch-riscv: Remove check vconf done implementation Change-Id: If633cef209390d0500c4c2c5741d56158ef26c00	2023-09-12 13:56:22 +08:00
Roger Chang	31b95987da	arch-riscv: Change the instruction family to jump like The method that get the vl, vtype from PCState in the next changes Change-Id: I022b47b7a96572f6434eed30dd9f7caa79854c31	2023-09-12 13:56:22 +08:00
Roger Chang	282765234b	arch-riscv: Implement the branchTarget for vsetvl Change-Id: I10bf6be736ce2b99323ace410bff1d8e1e2a4123	2023-09-12 13:56:22 +08:00
Roger Chang	a3aaad2ecd	arch-riscv: Refactor the execution part of vsetvl Change-Id: Ie0d9671242481a85bb0fe5728748b16c3ef62592	2023-09-12 13:56:21 +08:00
Roger Chang	1bde42760f	arch-riscv: Get vl, vtype and vlenb from PCState Change-Id: I0ded57a3dc2db6fcc7121f147bcaf6d8a8873f6a	2023-09-12 13:56:21 +08:00
Roger Chang	8918302239	arch-riscv: Change the implementation of vsetvl The changes includes: 1. Add VL, Vtype and VlenbBits operands 2. Change R/W methods of VL, Vtype and VlenbBits from PCState Change-Id: I0531ddc14344f2cca94d0e750a3b4291e0227d54	2023-09-12 13:56:21 +08:00
Roger Chang	7b5d8b4e5b	arch-riscv: Add vlenb, vtype and vl in PCState Change-Id: I7c2aed7dda34a1a449253671d7b86aa615c28464	2023-09-12 13:56:21 +08:00
Roger Chang	f94658098d	arch-riscv: Remove checked_type in StaticInst Constructor We should not try to check vtype when decoding the instruction. It should be checked in vset{i}vl{i} since the register can be modified via vset{i}vl{i} Change-Id: I403e5c4579bc5b8e6af10f93eac20c14662e4d2d	2023-09-12 13:56:21 +08:00
Roger Chang	3f0475321a	arch-riscv: Change VTYPE to BitUnion64 Change-Id: I7620ad1ef3ee0cc045bcd02b3c9a2d83f93bf3fe	2023-09-12 13:56:21 +08:00
Roger Chang	dfc725838e	arch-riscv: Refactor PCState class Change-Id: I1d25350ba2a3c7c366f42340c20b4488c33cde6f	2023-09-12 13:56:21 +08:00
Nicholas Mosier	2b9d558cef	cpu-kvm: properly set x86 xsave header on gem5->KVM transition If the XSAVE KVM capability is available (KVM_CAP_XSAVE), the X86KvmCPU will try to set the x87 FPU + SSE state using KVM_SET_XSAVE, which expects a buffer (struct kvm_xsave) in XSAVE area format (Vol. 1, Sec. 13.4 of Intel x86 SDM). The original implementation of `X86KvmCPU::updateKvmStateFPUXSave()`, however, improperly sets the xsave header, which contains a bitmap of state components present in the xsave area. This patch defines `XSaveHeader` structure to model the xsave header, which is expected directly following the legacy FPU region (defined in the `FXSave` structure) in the xsave area. It then sets two bist in the xsave header to indicate the presence of x86 FPU and SSE state components. GitHub issue: https://github.com/gem5/gem5/issues/296 Change-Id: I5c5c7925fa7f78a7b5e2adc209187deff53ac039	2023-09-10 15:16:50 +00:00
Nicholas Mosier	8740385f9e	sim-se: Fix tgkill logic bug in handling signal argument The syscall emulation of tgkill contained a simple logic bug (a `\|\|` instead of a `&&`), causing the signal argument to always be considered invalid. This patch fixes the bug by simply changing the `\|\|` to a `&&`. GitHub issue: https://github.com/gem5/gem5/issues/284 Change-Id: I3b02c618c369ef56d32a0b04e0b13eacc9fb4977	2023-09-09 08:51:41 -07:00
Jason Lowe-Power	ebde1133c0	redirect_path patch for restoring cpt (#221 ) Modify the FDArray::unserialize function to perform a checkPathRedirect if a Process pointer is passed in. Currently when restoring a checkpoint, it doesn't perform checkPathRedirect for files that were opened during checkpointing. This patch adds a checkPathRedirect in the FDArray::unserialize to redirect app path for restoring checkpoints.	2023-09-08 15:30:53 -07:00
Nicholas Mosier	259a5d6272	sim-se: Use tgt_stat64 instead of tgt_stat in newfstatatFunc The syscall emulation of newfstatat incorrectly treated the output stat buffer to be of type `OS::tgt_stat`, not `OS::tgt_stat64`, causing the invalid output stat buffer in the application to hold invalid data. This patch fixes the bug by simply substituting the type `OS::tgt_stat` with `OS::tgt_stat64` in `newstatatFunc()`. GitHub issue: https://github.com/gem5/gem5/issues/281 Change-Id: Ice97c1fc4cccbfb6824e313ebecde00f134ebf9c	2023-09-08 11:28:54 -07:00
Hoa Nguyen	91d1a5deb5	mem-cache: Fix bug in classic cache while clflush This change, https://github.com/gem5/gem5/pull/205, mistakenly allocates write buffer for clflush instruction when there's a cache miss. However, clflush in gem5 is not a write instruction. Thus, the cache should allocate miss buffer in this case. Change-Id: I9c1c9b841159c4420567e9c929e71e4aa27d5c28 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-08 18:16:10 +00:00
Nicholas Mosier	e0498cb490	Merge branch 'develop' into bugfix-chdir	2023-09-07 14:13:00 -07:00
Bobby R. Bruce	eb5ae35341	resources,stdlib: Add workload to resource specialization and deprecate workload.py (#212 )	2023-09-07 12:45:45 -07:00
Nicholas Mosier	62e81930d6	Merge branch 'develop' into bugfix-chdir	2023-09-07 09:54:35 -07:00
studyztp	e206b16f73	sim:fixed some style issues Change-Id: I0832a8b68e802e9671b755d3a71fd9c8f17e1648	2023-09-07 08:52:24 -07:00
studyztp	377c875733	sim: check redirect path when unserialize for cpt sim/fd_array.hh: Add "class Process;" to forward declare Process for unserialize function to pass in a Process object pointer. Fix the styling issue with include files. sim/fd_array.cc" Add comments. Change-Id: Ifb21eb1c7bad119028b8fd8e610a125100fde696	2023-09-07 08:52:24 -07:00
studyztp	2a4f3f206b	sim: modifed the type of path Change-Id: I56be3b62b1804371b9b9e0f84ee1ec49cbedf553	2023-09-07 08:52:24 -07:00
studyztp	0dab27f24a	sim: check redirect path when unserialize for cpt Change-Id: I55b8ce1770b0580d52b8dfa782572d492c1bf727	2023-09-07 08:52:24 -07:00
Johnny	105839ae2b	sim: add bypass_on_change to the set() of a signal When reset a port, we don't want to trigger a onChange(). Offer an option to bypass it and update state only. Change-Id: Ia53b7a76d2a320ea67101096cdbfe2eafaf440d2	2023-09-07 11:54:56 +08:00
Nicholas Mosier	6cdaa2c16a	sim-se: Fix crash in chdirFunc() on nonexistent directory This commit fixes a crash in the syscall emulation of the chdir(2) syscall, implemented by chdirFunc() in src/sim/syscall_emul.cc, when passed a nonexistent directory. The buggy code did not check the return value of realpath(). This patch adds code to check the return value of realpath(), and if it is NULL (i.e., there was an error with the requested directory to change to), propagates the error in `errno` to the application. GitHub issue: https://github.com/gem5/gem5/issues/276 Change-Id: I8a576f60fe3687f320d0cfc28e9d3a6b477d7054	2023-09-07 03:18:58 +00:00
Nicholas Mosier	3dfdd48211	misc: Fix buggy special path comparisons This patch fixes the buggy special path comparisons in src/kern/linux/linux.cc Linux::openSpecialFile(), which only checked for equality of path prefixes, but not equality of the paths themselves. This patch replaces those buggy comparisons with regular std::string::operator== string equality comparisons. GitHub issue: https://github.com/gem5/gem5/issues/269 Change-Id: I216ff8019b9a6a3e87e364c2e197d9b991959ec1	2023-09-05 13:44:10 -07:00
Matthew Poremba	2da54d5a4f	mem-ruby: Reorder SLC atomic and response actions Currently the MOESI_AMD_Base-directory transition for system level atomics sends the response message before the atomic is performed. This was likely done because atomics are supposed to return the value of the data before the atomic is performed and by simply ordering the actions this way that was taken care of. With the new atomic log feature, the atomic values are pulled from the log by the coalescer on the return path. Therefore, these actions can be reordered. However, it is now necessary that the atomics be performed before sending the response so that the log is populated and copied by the response action. This should fix #253 . Change-Id: Ie7e178f93990975367de2cc3e89e5ef9c9069241	2023-09-01 10:36:54 -05:00
Bobby R. Bruce	8d47cda8b6	arch-x86: Fix wrong x86 assembly (#251 ) The RM field of ModRM was printed as Reg field for several instructions. For reference, this change fixes typos introduced by [1]. [1] https://gem5-review.googlesource.com/c/public/gem5/+/40339	2023-09-01 00:26:00 -07:00
Hoa Nguyen	4ff1f160ec	arch-x86: Fix wrong x86 assembly The RM field of ModRM was printed as Reg field for several instructions. For reference, this change fixes typos introduced by [1]. [1] https://gem5-review.googlesource.com/c/public/gem5/+/40339 Change-Id: I41eb58e6a70845c4ddd6774ccba81b8069888be5 Signed-off-by: Hoa Nguyen <hn@hnpl.org>	2023-09-01 00:26:51 +00:00
Harshil Patel	7042d74ac2	stdlib, resources: Moved parsing params. - moved parsing from WorkloadResource __init__ to obtain_resource Change-Id: I9ed2aebb23af7f359bc1e5fff8ebe616a0da1374	2023-08-31 09:19:24 -07:00
Matthew Poremba	cfa833a97d	gpu-compute: Set LDS/scratch aperture base register Starting with gfx900 (Vega) the LDS and scratch apertures can be queried using a new s_getreg_b32 instruction. If the instruction is called with the SH_MEM_BASES argument it returns the upper 16 bits of a 64 bit address for the LDS and scratch apertures. The current addresses cannot be encoded in this register, so that addresses are changed to have the lower 48 bits be all zeros in addition to writing the bases register. Change-Id: If20f262b2685d248afe31aa3ebb274e4f0fc0772	2023-08-31 11:01:32 -05:00
Bobby R. Bruce	0e323bc409	mem: Atomic ops to same address (#200 ) Augmenting the DataBlock class with a change log structure to record the effects of atomic operations on a data block and service these changes if the atomic operations require return values. Although the operations are atomic, the coalescer need not send unique memory requests for each operation. Atomic operations within a wavefront to the same address are now coalesced into a single memory request. The response of this request carries all the necessary information to provide the requesting lanes unique values as a result of their individual atomic operations. This helps reduce contention for request and response queues in simulation. Previously, only the final value of the datablock after all atomic ops to the same address was visible to the requesting waves. This change corrects this behavior by allowing each wave to see the effect of this individual atomic op is a return value is necessary.	2023-08-30 23:53:35 -07:00
Bobby R. Bruce	c156df620d	resources, stdlib: Add support for local files in obtain_resource (#204 ) This patch allows a local JSON file to specify a local path in the JSON object of a Resource, through the "url" field. Local paths can be entered with the prefix "file:" in the "url" field. If the local path exists, then the Resource from there is copied into the resource directory defined in the function earlier. This behavior is the same as using specific Resource classes (ex. BinaryResource) and passing a local_path into the function. But, the above class does not allow simultaneous creation of local Resources and Workloads of those local Resources. With this patch, someone can use a local JSON, specify the location of local Resources and create a Workload from those Resources and test both together.	2023-08-29 20:35:40 -07:00

1 2 3 4 5 ...

14406 Commits