llvm-project

Author	SHA1	Message	Date
Nikita Popov	22fdc57140	[Hexagon] Avoid implicit truncation in getConstant() Use getSignedConstant() or change variable type as appropriate. This will avoid assertion failures when implicit truncation is disabled.	2024-11-22 15:07:31 +01:00
Kazu Hirata	7e8bc5cf77	[Hexagon] Remove unused includes (NFC) (#116529 ) Identified with misc-include-cleaner.	2024-11-17 08:38:53 -08:00
Sergei Barannikov	baf59be89b	[SelectionDAG] Fix return types of TC_RETURN for several targets (#116504 ) TC_RETURN nodes do not have a glue result.	2024-11-17 02:14:05 +03:00
Matin Raayai	bb3f5e1fed	Overhaul the TargetMachine and LLVMTargetMachine Classes (#111234 ) Following discussions in #110443, and the following earlier discussions in https://lists.llvm.org/pipermail/llvm-dev/2017-October/117907.html, https://reviews.llvm.org/D38482, https://reviews.llvm.org/D38489, this PR attempts to overhaul the `TargetMachine` and `LLVMTargetMachine` interface classes. More specifically: 1. Makes `TargetMachine` the only class implemented under `TargetMachine.h` in the `Target` library. 2. `TargetMachine` contains target-specific interface functions that relate to IR/CodeGen/MC constructs, whereas before (at least on paper) it was supposed to have only IR/MC constructs. Any Target that doesn't want to use the independent code generator simply does not implement them, and returns either `false` or `nullptr`. 3. Renames `LLVMTargetMachine` to `CodeGenCommonTMImpl`. This renaming aims to make the purpose of `LLVMTargetMachine` clearer. Its interface was moved under the CodeGen library, to further emphasis its usage in Targets that use CodeGen directly. 4. Makes `TargetMachine` the only interface used across LLVM and its projects. With these changes, `CodeGenCommonTMImpl` is simply a set of shared function implementations of `TargetMachine`, and CodeGen users don't need to static cast to `LLVMTargetMachine` every time they need a CodeGen-specific feature of the `TargetMachine`. 5. More importantly, does not change any requirements regarding library linking. cc @arsenm @aeubanks	2024-11-14 13:30:05 -08:00
Kazu Hirata	4048c64306	[llvm] Remove redundant control flow statements (NFC) (#115831 ) Identified with readability-redundant-control-flow.	2024-11-12 10:09:42 -08:00
Sergei Barannikov	eeb987f6f3	[MC] Make generated `MCInstPrinter::getMnemonic` const (NFC) (#114682 ) The value returned from the function depends only on the instruction opcode. As a drive-by, change the type of the argument to const-reference.	2024-11-03 20:37:26 +03:00
Fangrui Song	facdae62b7	[MCInstPrinter] Make printRegName non-const Similar to printInst. printRegName may change states (e.g. #113834).	2024-10-29 19:14:54 -07:00
Alex Rønne Petersen	ad4a582fd9	[llvm] Consistently respect `naked` fn attribute in `TargetFrameLowering::hasFP()` (#106014 ) Some targets (e.g. PPC and Hexagon) already did this. I think it's best to do this consistently so that frontend authors don't run into inconsistent results when they emit `naked` functions. For example, in Zig, we had to change our emit code to also set `frame-pointer=none` to get reliable results across targets. Note: I don't have commit access.	2024-10-18 09:35:42 +04:00
Jay Foad	85c17e4092	[LLVM] Make more use of IRBuilder::CreateIntrinsic. NFC. (#112706 ) Convert many instances of: Fn = Intrinsic::getOrInsertDeclaration(...); CreateCall(Fn, ...) to the equivalent CreateIntrinsic call.	2024-10-17 16:20:43 +01:00
Nikita Popov	255a99c29f	[APInt] Fix APInt constructions where value does not fit bitwidth (NFCI) (#80309 ) This fixes all the places that hit the new assertion added in https://github.com/llvm/llvm-project/pull/106524 in tests. That is, cases where the value passed to the APInt constructor is not an N-bit signed/unsigned integer, where N is the bit width and signedness is determined by the isSigned flag. The fixes either set the correct value for isSigned, set the implicitTrunc flag, or perform more calculations inside APInt. Note that the assertion is currently still disabled by default, so this patch is mostly NFC.	2024-10-17 08:48:08 +02:00
pkarveti	81fee740d0	[Hexagon] Mark instructions as part of the frame setup to fix test sugared-constants.ll (#111795 ) Added .setMIFlag(MachineInstr::FrameSetup) to all BuildMI calls in HexagonFrameLowering::insertAllocframe. This change ensures that the test sugared-constants.ll passes upstream by correctly marking instructions as part of the frame setup.	2024-10-14 09:27:55 -05:00
Rahul Joshi	fa789dffb1	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752 ) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).	2024-10-11 05:26:03 -07:00
Craig Topper	6b8135762c	[Hexagon] Use MCRegister. NFC	2024-09-28 11:40:26 -07:00
Philip Reames	d288574363	[TTI][RISCV] Model cost of loading constants arms of selects and compares (#109824 ) This follows in the spirit of 7d82c99403f615f6236334e698720bf979959704, and extends the costing API for compares and selects to provide information about the operands passed in an analogous manner. This allows us to model the cost of materializing the vector constant, as some select-of-constants are significantly more expensive than others when you account for the cost of materializing the constants involved. This is a stepping stone towards fixing https://github.com/llvm/llvm-project/issues/109466. A separate SLP patch will be required to utilize the new API.	2024-09-25 07:25:57 -07:00
Matt Arsenault	b30b9eb7a8	LiveInterval: Make verify functions return bool (#109672 ) This will allow the MachineVerifier to check these properties instead of just asserting	2024-09-24 08:32:47 +04:00
Kazu Hirata	e4e3ff5adc	[llvm] Use std::optional::value_or (NFC) (#109568 )	2024-09-22 01:00:24 -07:00
Jay Foad	e03f427196	[LLVM] Use {} instead of std::nullopt to initialize empty ArrayRef (#109133 ) It is almost always simpler to use {} instead of std::nullopt to initialize an empty ArrayRef. This patch changes all occurrences I could find in LLVM itself. In future the ArrayRef(std::nullopt_t) constructor could be deprecated or removed.	2024-09-19 16:16:38 +01:00
Lei Huang	4b524088a8	[NFC] Update function names in MCTargetAsmParser.h (#108643 ) Update function names to adhere to LLVM coding standard.	2024-09-18 11:43:49 -04:00
Abinaya Saravanan	c010b72e9b	[HEXAGON] AddrModeOpt support for HVX and optimize adds (#106368 ) This patch does 3 things: 1. Add support for optimizing the address mode of HVX load/store instructions 2. Reduce the value of Add instruction immediates by replacing with the difference from other Addi instructions that share common base: For Example, If we have the below sequence of instructions: r1 = add(r2,# 1024) ... r3 = add(r2,# 1152) ... r4 = add(r2,# 1280) Where the register r2 has the same reaching definition, They get modified to the below sequence: r1 = add(r2,# 1024) ... r3 = add(r1,# 128) ... r4 = add(r1,# 256) 3. Fixes a bug pass where the addi instructions were modified based on a predicated register definition, leading to incorrect output. Eg: INST-1: if (p0) r2 = add(r13,# 128) INST-2: r1 = add(r2,# 1024) INST-3: r3 = add(r2,# 1152) INST-4: r5 = add(r2,# 1280) In the above case, since r2's definition is predicated, we do not want to modify the uses of r2 in INST-3/INST-4 with add(r1,#128/256) 4.Fixes a corner case It looks like we never check whether the offset register is actually live (not clobbered) at optimization site. Add the check whether it is live at MBB entrance. The rest should have already been verified. 5. Fixes a bad codegen For whatever reason we do transformation without checking if the value in register actually reaches the user. This is second identical fix for this pass. Co-authored-by: Anirudh Sundar <quic_sanirudh@quicinc.com> Co-authored-by: Sergei Larin <slarin@quicinc.com>	2024-09-13 18:48:34 -05:00
Kazu Hirata	f5b7c10923	[Hexagon] Avoid repeated hash lookups (NFC) (#107760 )	2024-09-08 10:03:05 -07:00
Piyou Chen	b01c006f73	[TII][RISCV] Add renamable bit to copyPhysReg (#91179 ) The renamable flag is useful during MachineCopyPropagation but renamable flag will be dropped after lowerCopy in some case. This patch introduces extra arguments to pass the renamable flag to copyPhysReg.	2024-08-27 10:08:43 +08:00
Craig Topper	c503758ab6	[CodeGen] Use std::pair<MCRegister, Register> to match return from MRI.liveins(). NFC MachineRegisterInfo::liveins returns std::pair<MCRegister, Register>. Don't convert to std::pair<unsigned, unsigned>.	2024-08-25 15:28:08 -07:00
Kazu Hirata	a5d89d5048	[Target] Use llvm::replace (NFC) (#105942 )	2024-08-24 10:02:01 -07:00
Daniil Fukalov	0da2ba811a	[NFC] Cleanup in ADT and Analysis headers. (#104484 ) Remove unused directly includes and forward declarations in ADT and Analysis headers.	2024-08-17 13:11:18 +02:00
Kazu Hirata	66878ff692	[Hexagon] Use range-based for loops (NFC) (#104538 )	2024-08-16 08:35:59 -07:00
Abinaya Saravanan	86ef9ee600	[HEXAGON] Enable Utilize Mask Instruction Pass only if the Arch (#102880 ) version is greater than v66 No support for mask instruction before arch version v66	2024-08-13 13:30:20 +05:30
Alexis Engelke	4c23c1b93d	[CodeGen] Use SmallVector for MBB preds/succs (#101948 ) Avoid extra heap allocations for typical predecessor/successor counts.	2024-08-06 10:25:03 +02:00
Abinaya Saravanan	c04857cb2c	[HEXAGON] Utilize new mask instruction (#92365 ) This pass utilizes the new Hexagon Mask Instruction. Authored by : Harsha Jagasia, Krzysztof Parzyszek Co-authored-by: Harsha Jagasia <harsha.jagasia@gmail.com> Co-authored-by: Krzysztof Parzyszek <Krzysztof.Parzyszek@amd.com>	2024-08-05 13:31:12 +05:30
Kazu Hirata	7df9da7d78	[llvm] Construct SmallVector with ArrayRef (NFC) (#101872 )	2024-08-04 08:54:23 -07:00
Santanu Das	2771ea4ea4	[Hexagon] Fix concat lowering for HVX for 64B vector length (#98318 ) When concatenation of vector instructions is formed, as a part of it vector rotation is performed. The direction of the shift was not correctly calculated. This fixes the rotation factor.	2024-08-01 11:12:47 -05:00
yandalur	68df06a0b2	[Hexagon] Do not optimize address of another function's block (#101209 ) When the constant extender optimization pass encounters an instruction that uses an extended address pointing to another function's block, avoid adding the instruction to the extender list for the current machine function. Fixes https://github.com/llvm/llvm-project/issues/99714	2024-08-01 11:07:23 -05:00
Nikita Popov	07d2709a17	Revert "[MC] Compute fragment offsets eagerly" This reverts commit be5a845e4c29aadb513ae6e5e2879dccf37efdbb. This causes large code size regressions, which were not present in the initial version of this change.	2024-07-31 09:06:43 +02:00
Fangrui Song	be5a845e4c	[MC] Compute fragment offsets eagerly This builds on top of commit 9d0754ada5dbbc0c009bcc2f7824488419cc5530 ("[MC] Relax fragments eagerly") and relaxes fragments eagerly to eliminate MCSection::HasLayout and `getFragmentOffset` overhead. The approach is slightly different from 1a47f3f3db66589c11f8ddacfeaecc03fb80c510 and has less performance benefit. The new layout algorithm also addresses the following problems: * Size change of MCFillFragment/MCOrgFragment did not influence the fixed-point iteration, which could be problematic for contrived cases. * The `invalid number of bytes` error was reported too early. Since `.zero A-B` might have temporary negative values in the first few iterations. * X86AsmBackend::finishLayout performed only one iteration, which might not converge. In addition, the removed `#ifndef NDEBUG` code (disabled by default) in X86AsmBackend::finishLayout was problematic, as !NDEBUG and NDEBUG builds evaluated fragment offsets at different times before this patch. * The computed layout for relax-recompute-align.s is optimal now. Builds with many text sections (e.g. full LTO) shall observe a decrease in compile time while the new algorithm could be slightly slower for some -O0 -g projects. Aligned bundling from the deprecated PNaCl placed constraints how we can perform iteration.	2024-07-30 18:38:03 -07:00
Fangrui Song	4eb5450f63	Revert "[MC] Compute fragment offsets eagerly" This reverts commit 1a47f3f3db66589c11f8ddacfeaecc03fb80c510. Fix #100283 This commit is actually a trigger of other preexisting problems: * Size change of fill fragments does not influence the fixed-point iteration. * The `invalid number of bytes` error is reported too early. Since `.zero A-B` might have temporary negative values in the first few iterations. However, the problems appeared at least "benign" (did not affect the Linux kernel builds) before this commit.	2024-07-30 14:52:29 -07:00
Sergei Barannikov	25bea3eb03	[MC] Forward declare ELFObjectWriter (#100989 )	2024-07-30 10:40:40 +03:00
Pengcheng Wang	ed4e75d5e5	[CodeGen] Remove AA parameter of isSafeToMove (#100691 ) This `AA` parameter is not used and for most uses they just pass a nullptr. The use of `AA` was removed since 8d0383e.	2024-07-26 15:47:47 +08:00
James Y Knight	dfeb3991fb	Remove the `x86_mmx` IR type. (#98505 ) It is now translated to `<1 x i64>`, which allows the removal of a bunch of special casing. This _incompatibly_ changes the ABI of any LLVM IR function with `x86_mmx` arguments or returns: instead of passing in mmx registers, they will now be passed via integer registers. However, the real-world incompatibility caused by this is expected to be minimal, because Clang never uses the x86_mmx type -- it lowers `__m64` to either `<1 x i64>` or `double`, depending on ABI. This change does _not_ eliminate the SelectionDAG `MVT::x86mmx` type. That type simply no longer corresponds to an IR type, and is used only by MMX intrinsics and inline-asm operands. Because SelectionDAGBuilder only knows how to generate the operands/results of intrinsics based on the IR type, it thus now generates the intrinsics with the type MVT::v1i64, instead of MVT::x86mmx. We need to fix this before the DAG LegalizeTypes, and thus have the X86 backend fix them up in DAGCombine. (This may be a short-lived hack, if all the MMX intrinsics can be removed in upcoming changes.) Works towards issue #98272.	2024-07-25 09:19:22 -04:00
Wesley Wiser	ca076f7a63	[LLVM] [MC] Update frame layout & CFI generation to handle frames larger than 2gb (#99263 ) Rebase of #84114. I've only included the core changes to frame layout calculation & CFI generation which sidesteps the regressions found after merging #84114. Since these changes are a necessary precursor to the overall fix and are themselves slightly beneficial as CFI is now generated correctly, I think it is reasonable to merge this first step. --- For very large stack frames, the offset from the stack pointer to a local can be more than 2^31 which overflows various `int` offsets in the frame lowering code. This patch updates the frame lowering code to calculate the offsets as 64-bit values and fixes CFI to use the corrected sizes. After this patch, additional work is needed to fix offset truncations in each target's codegen.	2024-07-23 09:43:30 -07:00
Fangrui Song	c473e75ade	MCAssmembler: Move ELFHeaderEFlags to ELFObjectWriter Now that MCELFStreamer can access ELFObjectWriter (commit 70c52b62c5669993e341664a63bfbe5245e32884), we can move ELFHeaderEFlags there.	2024-07-22 18:20:18 -07:00
AtariDreams	56ad7cc012	[IR] Remove non-canonical matchings (#96763 )	2024-07-22 09:47:37 +02:00
Fangrui Song	1a47f3f3db	[MC] Compute fragment offsets eagerly This builds on top of commit 9d0754ada5dbbc0c009bcc2f7824488419cc5530 ("[MC] Relax fragments eagerly") and relaxes fragments eagerly to eliminate MCSection::HasLayout and `getFragmentOffset` overhead. Note: The removed `#ifndef NDEBUG` code (disabled by default) in X86AsmBackend::finishLayout was problematic, as (a) !NDEBUG and NDEBUG builds evaluated fragment offsets at different times before this patch (b) one iteration might not be sufficient to converge. There might be some edge cases that it did not handle. Anyhow, this patch probably makes it work for more cases.	2024-07-21 15:42:27 -07:00
Fangrui Song	7f017f0ab4	[MC] Drop unnecessary MCSymbol::setExternal calls for ELF Similar to e4c360a897fe062914519d331e8f1e28b2b1fbfd (2020).	2024-07-21 10:49:25 -07:00
Fangrui Song	8f14e39e59	[MC] Remove unnecessary isVerboseAsm from Target::AsmTargetStreamerCtorTy The parameter is confusing as it duplicates MCStreamer::isVeboseAsm (initialized from MCTargetOptions::AsmVerbose). After 233cca169237b91d16092c82bd55ee6a283afe98, no in-tree target uses the parameter.	2024-07-21 10:19:17 -07:00
Fangrui Song	233cca1692	[ARM,Hexagon] Ignore IsVerboseAsm parameter in favor of MCStreamer::isVerboseAsm() ... to improve consistency. Most targets don't use VerboseAsm. When they do (X86, SystemZ), they use MCStreamer::isVerboseAsm().	2024-07-21 10:02:47 -07:00
Joseph Huber	615b7eeaa9	Reapply "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit 740161a9b98c9920dedf1852b5f1c94d0a683af5. I moved the `ISD` dependencies into the CodeGen portion of the handling, it's a little awkward but it's the easiest solution I can think of for now.	2024-07-20 09:29:31 -05:00
NAKAMURA Takumi	5893b1e297	Reformat	2024-07-20 12:36:57 +09:00
NAKAMURA Takumi	740161a9b9	Revert "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit c05126bdfc3b02daa37d11056fa43db1a6cdef69. (llvmorg-19-init-17714-gc05126bdfc3b) See #99610	2024-07-20 12:36:57 +09:00
Matt Arsenault	0f0cfcff2c	CodeGen: Avoid some references to MachineFunction's getMMI (#99652 ) MachineFunction's probably should not include a backreference to the owning MachineModuleInfo. Most of these references were used just to query the MCContext, which MachineFunction already directly stores. Other contexts are using it to query the LLVMContext, which can already be accessed through the IR function reference.	2024-07-19 22:09:05 +04:00
Kazu Hirata	3e47f6ba4a	Rapply "[Target] Use range-based for loops (NFC) (#98844 )" This iteration drops hunks where the loop body adds more elements.	2024-07-17 19:39:04 -07:00
Amara Emerson	f270a4dd66	[AArch64] Don't tail call memset if it would convert to a bzero. (#98969 ) Well, not quite that simple. We can tc memset since it returns the first argument but bzero doesn't do that and therefore we can end up miscompiling. This patch also refactors the logic out of isInTailCallPosition() into the callers. As a result memcpy and memmove are also modified to do the same thing for consistency. rdar://131419786	2024-07-17 01:31:52 -07:00

... 3 4 5 6 7 ...

3458 Commits