llvm-project

Author	SHA1	Message	Date
Phoebe Wang	c72a751dab	[X86][AMX] Support AMX-TRANSPOSE (#113532 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-11-01 16:45:03 +08:00
Yingwei Zheng	cf9d1c1486	[SDAG] Simplify `SDNodeFlags` with bitwise logic (#114061 ) This patch allows using enumeration values directly and simplifies the implementation with bitwise logic. It addresses the comment in https://github.com/llvm/llvm-project/pull/113808#discussion_r1819923625.	2024-10-31 08:10:07 +08:00
David Green	83ae171722	[AArch64] Add ComputeNumSignBits for VASHR. (#113957 ) As with a normal ISD::SRA node, they take the number of sign bits of the incoming value and increase it by the shifted amount.	2024-10-29 21:02:32 +00:00
Alex Rønne Petersen	ad4a582fd9	[llvm] Consistently respect `naked` fn attribute in `TargetFrameLowering::hasFP()` (#106014 ) Some targets (e.g. PPC and Hexagon) already did this. I think it's best to do this consistently so that frontend authors don't run into inconsistent results when they emit `naked` functions. For example, in Zig, we had to change our emit code to also set `frame-pointer=none` to get reliable results across targets. Note: I don't have commit access.	2024-10-18 09:35:42 +04:00
Simon Pilgrim	49fa91edf7	[DAG] SDPatternMatch - add missing ROTL/ROTR matchers	2024-10-16 11:57:18 +01:00
Simon Pilgrim	d3d2d72549	[DAG] SDPatternMatch - add missing BSWAP/CTPOP/CTTZ matchers	2024-10-16 11:52:58 +01:00
c8ef	854ded9b24	Reapply "[DAG] Enhance SDPatternMatch to match integer minimum and maximum patterns in addition to the existing ISD nodes." (#112203 ) This patch adds icmp+select patterns for integer min/max matchers in SDPatternMatch, similar to those in IR PatternMatch. Reapply #111774. Closes #108218.	2024-10-15 21:07:06 +08:00
David Green	04546a0dd6	[GlobalISel] Support vector G_UNMERGE_VALUES in computeKnownBits. (#112172 ) This adds computeKnownBits support for vector->vector G_UNMERGE_VALUES, grabbing the known bits with an adjusted DemandedElts mask.	2024-10-15 08:23:05 +01:00
c8ef	a3b0c31ebc	Revert "[DAG] Enhance SDPatternMatch to match integer minimum and maximum patterns in addition to the existing ISD nodes." (#112200 ) Reverts llvm/llvm-project#111774 This appears to be causing some tests to fail.	2024-10-14 21:43:49 +08:00
c8ef	11f625cb87	[DAG] Enhance SDPatternMatch to match integer minimum and maximum patterns in addition to the existing ISD nodes. (#111774 ) Closes #108218. This patch adds icmp+select patterns for integer min/max matchers in SDPatternMatch, similar to those in IR PatternMatch.	2024-10-14 21:19:34 +08:00
Jay Foad	eb6e7e8f89	[unittests] Use {} instead of std::nullopt to initialize empty ArrayRef (#109388 ) Follow up to #109133.	2024-09-21 10:59:50 +01:00
Michael Maitland	ee2add0683	[GISEL] Fix bugs and clarify spec of G_EXTRACT_SUBVECTOR (#108848 ) The implementation was missing the fact that `G_EXTRACT_SUBVECTOR` destination and source vector can be different types. Also fix a bug in the MIR builder for `G_EXTRACT_SUBVECTOR` to generate the correct opcode. Clarify the G_EXTRACT_SUBVECTOR specification.	2024-09-17 10:08:39 -04:00
Robert Dazi	8837898b8d	[DAGCombine] Count leading ones: refine post DAG/Type Legalisation if promotion (#102877 ) This PR is related to #99591. In this PR, instead of modifying how the legalisation occurs depending on surrounding instructions, we refine after legalisation. This PR has two parts: * `SDPatternMatch/MatchContext`: Modify a little bit the code to match Operands (used by `m_Node(...)`) and Unary/Binary/Ternary Patterns to make it compatible with `VPMatchContext`, instead of only `m_Opc` supported. Some tests were added to ensure no regressions. * `DAGCombiner`: Add a `foldSubCtlzNot` which detect and rewrite the patterns using matching context. Remaining Tasks: - [ ] GlobalISel - [ ] Currently the pattern matching will occur even before legalisation. Should I restrict it to specific stages instead ? - [ ] Style: Add a visitVP_SUB ?? Move `foldSubCtlzNot` in another location for style consistency purpose ? @topperc --------- Co-authored-by: v01dxyz <v01dxyz@v01d.xyz>	2024-09-15 15:48:36 +04:00
JOE1994	387bee91f0	[llvm][unittests] Strip unneeded uses of raw_string_ostream::str() (NFC) Avoid excess layer of indirection.	2024-09-13 09:42:32 -04:00
Kyungwoo Lee	38c3855c9f	[NFC] Remove unused argument (FuncName) for parseMIR (#106144 ) While working on a MIR unittest, I noticed that parseMIR includes an unused argument that sets a function name. This is not only redundant but also irrelevant, as parseMIR is designed to parse entire module, not specific functions, even though most unittests contain a single function per module. To streamline the API, I have removed this unnecessary argument from parseMIR. However, if this argument was originally included to enhance readability or for any other purpose, please let me know.	2024-08-26 19:19:02 -07:00
Noah Goldstein	70f3863b5f	[DAG][PatternMatch] Add support for matchers with flags; NFC Add support for matching with `SDNodeFlags` i.e `add` with `nuw`. This patch adds helpers for `or disjoint` or `zext nneg` with the same names as we have in IR/PatternMatch api. Closes #103060	2024-08-18 15:37:56 -07:00
v01dXYZ	fc1b019638	[DAG] SD Pattern Match: Operands patterns with VP Context (#103308 ) Currently, when using a VP match context with `sd_context_match`, only Opcode matching is possible (`m_Opc(Opcode)`). This PR suggest a way to make patterns with Operands (eg `m_Node`, `m_Add`, ...) works with a VP context. This PR blocks another PR https://github.com/llvm/llvm-project/pull/102877. Co-authored-by: v01dxyz <v01dxyz@v01d.xyz>	2024-08-16 09:46:20 +01:00
Jorge Botto	05dfac23f1	[DAG] Adding m_FPToUI and m_FPToSI to SDPatternMatch.h (#104044 ) Adds m_FPToUI/m_FPToSI matchers for ISD::FP_TO_UINT/ISD::FP_TO_SINT in SDPatternMatch.h with suitable test coverage. Fixes https://github.com/llvm/llvm-project/issues/103872	2024-08-15 09:49:40 +01:00
Sergei Barannikov	6cf3e7d067	[DataLayout] Use member initialization (NFC) (#103712 ) This also adds a default constructor and a few uses of it.	2024-08-14 15:02:47 +03:00
Rahul Joshi	1753008bbb	[NFC] Eliminate top-level "using namespace" from some headers. (#102751 ) - Eliminate top-level "using namespace" from some headers.	2024-08-11 13:10:48 -07:00
Tobias Stadler	d2336fd75c	[RFC][GlobalISel] InstructionSelect: Allow arbitrary instruction erasure (#97670 ) See https://discourse.llvm.org/t/rfc-globalisel-instructionselect-allow-arbitrary-instruction-erasure	2024-08-11 17:26:43 +02:00
Michael Maitland	0dd1128d63	[DAG] Add SDPatternMatch::m_VSelect (#100758 ) As per the comment in https://github.com/llvm/llvm-project/pull/100686#pullrequestreview-2201991135	2024-07-29 13:19:43 -04:00
Michael Maitland	ad778889cf	[DAG] Add SDPatternMatch for VScale nodes	2024-07-29 06:50:26 -07:00
Michael Maitland	862d837e48	[DAG] Add SDPatternMatch::m_Select (#100686 ) This will enable us to use SDPatternMatch with ISD::SELECT SDNodes in the future.	2024-07-26 10:43:06 -04:00
Matt Arsenault	63e1647827	CodeGen: Remove MachineModuleInfo reference from MachineFunction (#100357 ) This avoids another unserializable field. Move the DbgInfoAvailable field into the AsmPrinter, which is only really a cache/convenience bit for checking a direct IR module metadata check.	2024-07-26 13:10:08 +04:00
Vitaly Buka	455990d18f	Reland "SelectionDAG: Avoid using MachineFunction::getMMI" (#99779 ) Reverts llvm/llvm-project#99777 Co-authored-by: Matt Arsenault <Matthew.Arsenault@amd.com>	2024-07-24 10:38:53 +04:00
Vitaly Buka	98c0e55d9d	Revert "SelectionDAG: Avoid using MachineFunction::getMMI" (#99777 ) Reverts llvm/llvm-project#99696 https://lab.llvm.org/buildbot/#/builders/164/builds/1262	2024-07-20 12:20:50 -07:00
Matt Arsenault	c2019a37bd	SelectionDAG: Avoid using MachineFunction::getMMI (#99696 )	2024-07-20 10:53:41 +04:00
Jeremy Morse	676efd0ffb	Reapply 078198f310d5 "Index DebugVariables and some DILocations" Now revised to actually make the unit test compile, which I'd been ignoring. No actual functional change, it's a type difference. Original commit message follows. [DebugInfo][InstrRef] Index DebugVariables and some DILocations (#99318) A lot of time in LiveDebugValues is spent computing DenseMap keys for DebugVariables, and they're made up of three pointers, so are large. This patch installs an index for them: for the SSA and value-to-location mapping parts of InstrRefBasedLDV we don't need to access things like the variable declaration or the inlining site, so just use a uint32_t identifier for each variable fragment that's tracked. The compile-time performance improvements are substantial (almost 0.4% on the tracker). About 80% of this patch is just replacing DebugVariable references with DebugVariableIDs instead, however there are some larger consequences. We spend lots of time fetching DILocations when emitting DBG_VALUE instructions, so index those with the DebugVariables: this means all DILocations on all new DBG_VALUE instructions will normalise to the first-seen DILocation for the variable (which should be fine). We also used to keep an ordering of when each variable was seen first in a DBG_* instruction, in the AllVarsNumbering collection, so that we can emit new DBG_* instructions in a stable order. We can hang this off the DebugVariable index instead, so AllVarsNumbering is deleted. Finally, rather than ordering by AllVarsNumbering just before DBG_* instructions are linked into the output MIR, store instructions along with their DebugVariableID, so that they can be sorted by that instead.	2024-07-18 15:55:06 +01:00
Jeremy Morse	50b657c8f6	Revert "[DebugInfo][InstrRef] Index DebugVariables and some DILocations (#99318 )" This reverts commit 078198f310d55925ccd9e1aa5b6ff4af3b36bbc7. Buildbots unhappy, I must have fluffed it	2024-07-18 15:05:57 +01:00
Jeremy Morse	078198f310	[DebugInfo][InstrRef] Index DebugVariables and some DILocations (#99318 ) A lot of time in LiveDebugValues is spent computing DenseMap keys for DebugVariables, and they're made up of three pointers, so are large. This patch installs an index for them: for the SSA and value-to-location mapping parts of InstrRefBasedLDV we don't need to access things like the variable declaration or the inlining site, so just use a uint32_t identifier for each variable fragment that's tracked. The compile-time performance improvements are substantial (almost 0.4% on the tracker). About 80% of this patch is just replacing DebugVariable references with DebugVariableIDs instead, however there are some larger consequences. We spend lots of time fetching DILocations when emitting DBG_VALUE instructions, so index those with the DebugVariables: this means all DILocations on all new DBG_VALUE instructions will normalise to the first-seen DILocation for the variable (which should be fine). We also used to keep an ordering of when each variable was seen first in a DBG_* instruction, in the AllVarsNumbering collection, so that we can emit new DBG_* instructions in a stable order. We can hang this off the DebugVariable index instead, so AllVarsNumbering is deleted. Finally, rather than ordering by AllVarsNumbering just before DBG_* instructions are linked into the output MIR, store instructions along with their DebugVariableID, so that they can be sorted by that instead.	2024-07-18 15:04:02 +01:00
Simon Pilgrim	61a4e1e70f	[DAG] Add SDPatternMatch::m_SetCC and update some combines to use it (#98646 ) The plan is to add more TernaryOp in the future (SELECT/VSELECT and FMA in particular)	2024-07-14 17:18:43 +01:00
Manish Kausik H	69192e0193	[LegalizeDAG] Optimize CodeGen for `ISD::CTLZ_ZERO_UNDEF` (#83039 ) Previously we had the same instructions being generated for `ISD::CTLZ` and `ISD::CTLZ_ZERO_UNDEF` which did not take advantage of the fact that zero is an invalid input for `ISD::CTLZ_ZERO_UNDEF`. This commit separates codegen for the two cases to allow for the optimization for the latter case. The details of the optimization are outlined in #82075 Fixes #82075 Co-authored-by: Manish Kausik H <hmamishkausik@gmail.com>	2024-07-08 14:01:32 +01:00
Kazu Hirata	75bc20ff89	[llvm] Remove redundant calls to std::unique_ptr<T>::get (NFC) (#97914 )	2024-07-07 08:23:41 +09:00
Alexis Engelke	80ffec7884	[AsmPrinter] Remove timers (#97046 ) Timers are an out-of-line function call and a global variable access, here twice per emitted instruction. At this granularity, not only the time results become skewed, but the timers also add a performance overhead when profiling is disabled. Also outside of the innermost loop, timers add a measurable overhead. As this is quite expensive for a mostly unused profiling facility, remove the timers. Fixes #39650.	2024-07-01 16:20:54 +02:00
Alexis Engelke	117b53ae38	[AsmPrinter] Reduce AsmPrinterHandlers virt. fn calls (#96785 ) Currently, an AsmPrinterHandler has several methods that allow to dynamically hook in unwind or debug info emission, e.g. at begin/end of every function or instruction. The class hierarchy and the actually overridden functions are as follows: (SymSz=setSymbolSize, mFE=markFunctionEnd, BBS=BasicBlockSection, FL=Funclet; b=beginX, e=endX) SymSz Mod Fn mFE BBS FL Inst AsmPrinterHandler - - - - - - - ` PseudoProbeHandler - - - - - - - ` WinCFGuard - e e - - - - ` EHStreamer - - - - - - - ` DwarfCFIException - e be - be - - ` ARMException - - be e - - - ` AIXException - - e - - - - ` WinException - e be e - be - ` WasmException - e be - - - - ` DebugHandlerBase - b be - be - be ` BTFDebug - e - - - - b ` CodeViewDebug - be - - - - b ` DWARFDebug yes be - - - - b Doing virtual function calls per instruction is costly and useless when the called function does nothing. This commit performs the following clean-up/improvements: - PseudoProbeHandler is no longer an AsmPrinterHandler -- it used nothing of its functionality to hook in at the possible points. This avoids virtual function calls when a pseudo probe printer is present. - DebugHandlerBase is no longer an AsmPrinterHandler, but a separate base class. DebugHandlerBase is the only remaining "hook" for begin/end instruction and setSymbolSize (only used by DWARFDebug). begin/end for function and basic block sections are never overriden and therefore are no longer virtual. (Originally I intended there to be only one debug handler, but BPF as the only target supports two at the same time: DWARF and BTF.) - AsmPrinterHandler no longer has begin/end instruction and setSymbolSize hooks -- these were only used by DebugHandlerBase. This avoid iterating over handlers in every instruction. AsmPrinterHandler Mod Fn mFE BBS FL ` WinCFGuard e e - - - ` EHStreamer - - - - - ` DwarfCFIException e be - be - ` ARMException - be e - - ` AIXException - e - - - ` WinException e be e - be ` WasmException e be - - - SymSz Mod Fn BBS Inst DebugHandlerBase - b be be be ` BTFDebug - e b ` CodeViewDebug - be b ` DWARFDebug yes be b PseudoProbeHandler (no shared methods) To continue allowing external users (e.g., Julia) to hook in at every instruction, a new method addDebugHandler is exposed. This results in a performance improvement, especially in the -O0 -g0 case with unwind information (e.g., JIT baseline).	2024-07-01 13:55:58 +02:00
Nikita Popov	74deadf196	[IRBuilder] Don't include Module.h (NFC) (#97159 ) This used to be necessary to fetch the DataLayout, but isn't anymore.	2024-06-29 15:05:04 +02:00
paperchalice	bf52884981	[DomTreeUpdater] Fix use after free in unittests (#97133 ) In #96851, the unit test contains use after free, which triggers sanitizer error. Fix https://lab.llvm.org/buildbot/#/builders/169/builds/490	2024-06-29 09:33:39 +08:00
Nikita Popov	4169338e75	[IR] Don't include Module.h in Analysis.h (NFC) (#97023 ) Replace it with a forward declaration instead. Analysis.h is pulled in by all passes, but not all passes need to access the module.	2024-06-28 14:30:47 +02:00
paperchalice	c931ac5994	Reapply "[CodeGen] Introduce `MachineDomTreeUpdater`" (#96846 ) (#96851 ) This reverts commit 0f8849349ae3d3f2f537ad6ab233a586fb39d375. Resolve conflict in `MachinePostDominators.h` There is a conflict after merging #96378, resolved in #96852. Both PRs modified `MachinePostDominators.h` and triggered build failure.	2024-06-28 14:48:09 +08:00
darkbuck	1ff05876fb	[GlobalISel] Add unit tests for call lowering on byref support Reviewers: tschuett, spaits, aemerson, arsenm Reviewed By: spaits, arsenm Pull Request: https://github.com/llvm/llvm-project/pull/96805	2024-06-27 13:19:44 -04:00
paperchalice	0f8849349a	Revert "[CodeGen] Introduce `MachineDomTreeUpdater`" (#96846 ) Reverts llvm/llvm-project#95369 Many build bots failed	2024-06-27 12:31:24 +08:00
paperchalice	6ca387cbcb	[CodeGen] Introduce `MachineDomTreeUpdater` (#95369 ) This commit converts most of `DomTreeUpdater` into `GenericDomTreeUpdater` class template, so IR and MIR can reuse some codes. There are some differences between interfaces of `BasicBlock` and `MachineBasicBlock`, so subclasses still need to implement some functions, like `forceFlushDeletedBB`.	2024-06-27 12:25:18 +08:00
Jay Foad	d6f906eadb	[SlotIndexes] Use simple_ilist instead of ilist. NFC. (#96747 ) simple_ilist does not take ownership of its nodes, which is fine for SlotIndexes because the IndexListEntry nodes are allocated with a BumpPtrAllocator and do not need to be freed.	2024-06-26 12:58:59 +01:00
Serge Pavlov	f9795f34a6	[GlobalISel] Add build methods for FP environment intrinsics (#96607 ) This change adds methods like buildGetFPEnv and similar for opcodes that represent manipulation on floating-point state.	2024-06-25 16:13:52 +07:00
c8ef	4f54b91842	[SDPatternMatch] Only match ISD::SIGN_EXTEND in m_SExt (#95415 ) Context: https://github.com/llvm/llvm-project/pull/95365#discussion_r1638236603 The current implementation of `m_SExt` matches both `ISD::SIGN_EXTEND` and `ISD::SIGN_EXTEND_INREG`. However, in cases where we specifically need to match _only_ `ISD::SIGN_EXTEND`, such as in the SelectionDAG graph below, this can lead to issues and unintended combinations. ``` SelectionDAG has 13 nodes: t0: ch,glue = EntryToken t2: v2i32,ch = CopyFromReg t0, Register:v2i32 %0 t21: v2i32 = sign_extend_inreg t2, ValueType:ch:v2i8 t4: v2i32,ch = CopyFromReg t0, Register:v2i32 %1 t22: v2i32 = sign_extend_inreg t4, ValueType:ch:v2i8 t23: v2i32 = avgfloors t21, t22 t24: v2i32 = sign_extend_inreg t23, ValueType:ch:v2i8 t15: ch,glue = CopyToReg t0, Register:v2i32 $d0, t24 t16: ch = AArch64ISD::RET_GLUE t15, Register:v2i32 $d0, t15:1 ```	2024-06-14 10:44:29 +01:00
Pierre van Houtryve	ed299b3efd	[GlobalISel] Optimize ULEB128 usage (#90565 ) - Remove some cases where ULEB128 isn't needed - Add a fastDecodeULEB128 tailored for GlobalISel which does unchecked decoding optimized for the common case, which is 1 byte values. We rarely have >1 byte Inst IDs, OpIdx, etc. and those are the most common ULEB users by far. This specific LEB128 decode function generates almost 2x less instructions than the generic one.	2024-05-03 10:26:54 +02:00
Min-Yih Hsu	0638e222f3	[SDPatternMatch] Add m_CondCode, m_NoneOf, and some SExt improvements (#90762 ) - Add m_CondCode to match the ISD::CondCode value from CondCodeSDNode - Add m_NoneOf combinator - m_SExt now recognizes sext_inreg	2024-05-02 08:56:42 -07:00
paperchalice	6ea0c0a283	[NewPM][CodeGen] Add `MachineFunctionAnalysis` (#88610 ) In new pass system, `MachineFunction` could be an analysis result again, machine module pass can now fetch them from analysis manager. `MachineModuleInfo` no longer owns them. Remove `FreeMachineFunctionPass`, replaced by `InvalidateAnalysisPass<MachineFunctionAnalysis>`. Now `FreeMachineFunction` is replaced by `InvalidateAnalysisPass<MachineFunctionAnalysis>`, the workaround in `MachineFunctionPassManager` is no longer needed, there is no difference between `unittests/MIR/PassBuilderCallbacksTest.cpp` and `unittests/IR/PassBuilderCallbacksTest.cpp`.	2024-04-30 09:54:48 +08:00
chuongg3	bf57d2e57c	[AArch64][GlobalISel] Enable computeNumSignBits for G_XOR, G_AND, G_OR (#89896 )	2024-04-29 10:53:30 +01:00

1 2 3 4 5 ...

666 Commits