llvm-project

Author	SHA1	Message	Date
Ben Shi	2a528760bf	[AVR] Fix an issue of writing 16-bit ports For 16-bit ports, the normal devices reqiure writing high byte first and then low byte. But the XMEGA devices require the reverse order. Fixes https://github.com/llvm/llvm-project/issues/58395 Reviewed By: aykevl, jacquesguan Differential Revision: https://reviews.llvm.org/D141752	2023-04-17 15:35:33 +08:00
NAKAMURA Takumi	7d5d987e93	[CMake] Reorder and reformat deps	2023-04-17 00:32:16 +09:00
Ben Shi	811759b100	[AVR] Disable post increment load from program memory space We temporarily only allow post increment load/store from/to data memory, and disable post increment load from program space. Updates https://github.com/llvm/llvm-project/issues/59914 Reviewed By: mzh Differential Revision: https://reviews.llvm.org/D147761	2023-04-12 11:52:55 +08:00
Ben Shi	6e57f68e41	[AVR] Reject invalid LDD instruction with explicit error We should reject "ldd Rn, X" with explicit error message rather than "llvm_unreachable" in llvm's release build. Fixes https://github.com/llvm/llvm-project/issues/62012 Reviewed By: Miss_Grape Differential Revision: https://reviews.llvm.org/D147877	2023-04-10 10:34:45 +08:00
Ben Shi	acb4d143bd	[AVR] Fix incorrect expansion of pseudo instructions LPMWRdZ/ELPMWRdZ The 'ELPM' instruction has three forms: -------------------------- \| form \| feature \| \| ----------- \| -------- \| \| ELPM \| hasELPM \| \| ELPM Rd, Z \| hasELPMX \| \| ELPM Rd, Z+ \| hasELPMX \| -------------------------- The second form is always used in the expansion of pseudo instructions LPMWRdZ/ELPMWRdZ. But for devices without ELPMX and with only ELPM, only the first form can be used. Reviewed By: aykevl, Miss_Grape Differential Revision: https://reviews.llvm.org/D141264	2023-04-06 15:27:04 +08:00
Craig Topper	219ff07f72	[Targets] Rename Flag->Glue. NFC Long long ago Glue was called Flag, and it was never completely renamed.	2023-04-02 19:28:51 -07:00
Ben Shi	2a6e39dbf8	[AVR] Do not emit 'LPM Rd, Z' on devices without FeatureLPMX The 'LPM' instruction has three forms: ------------------------ \| form \| feature \| \| ---------- \| --------\| \| LPM \| hasLPM \| \| LPM Rd, Z \| hasLPMX \| \| LPM Rd, Z+ \| hasLPMX \| ------------------------ The second form is always selected in ISelDAGToDAG, even on devices without FeatureLPMX. This patch emits "LPM + MOV" on devices with only FeatureLPM. Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D141246	2023-03-24 17:47:24 +08:00
Kazu Hirata	7bb6d1b32e	[llvm] Skip getAPIntValue (NFC) ConstantSDNode provides some convenience functions like isZero, getZExtValue, and isMinSignedValue that are named identically to those provided by APInt, so we can "skip" getAPIntValue.	2023-03-22 22:10:25 -07:00
Ben Shi	4fa9dc9482	[AVR] Fix incorrect expansion of the pseudo 'ELPMBRdZ' instruction The 'ELPM' instruction has three forms: -------------------------- \| form \| feature \| \| ----------- \| -------- \| \| ELPM \| hasELPM \| \| ELPM Rd, Z \| hasELPMX \| \| ELPM Rd, Z+ \| hasELPMX \| -------------------------- The second form is always used in the expansion of the pseudo instruction 'ELPMBRdZ'. But for devices without ELPMX but only with ELPM, only the first form can be emitted. Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D141221	2023-03-21 11:33:56 +08:00
Ben Shi	30d8f4e843	[AVR] Fix incorrect flags of livein registers when spilling them In AVRFrameLowering::spillCalleeSavedRegisters(), when a 16-bit livein register is spilled, two PUSH instructions are generated for the higher and lower 8-bit registers. But these two 8-bit registers are marked as killed in the two PUSH instructions, so any future use of them will cause a crash. This patch fixes the above issue by adding the two sub 8-bit registers to the livein list. Fixes https://github.com/llvm/llvm-project/issues/56423 Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D144720	2023-02-28 11:08:54 +08:00
Ben Shi	366d34b39e	[AVR][MC] Add ELF flag 'EF_AVR_LINKRELAX_PREPARED' to OBJ files This is in accordance with avr-gcc, even '-mno-relax' is specified to avr-gcc, this flag will also be added to the output relocatables. With this flag set, the GNU ld will perform long call -> short call optimization for AVR, otherwise not. Fixes https://github.com/llvm/llvm-project/issues/54508 Reviewed By: MaskRay, jacquesguan, aykevl Differential Revision: https://reviews.llvm.org/D144617	2023-02-24 11:16:42 +08:00
Fangrui Song	432caca39a	Simplify with hasFeature. NFC	2023-02-17 18:22:24 -08:00
Kazu Hirata	7e6e636fb6	Use llvm::has_single_bit<uint32_t> (NFC) This patch replaces isPowerOf2_32 with llvm::has_single_bit<uint32_t> where the argument is wider than uint32_t.	2023-02-15 22:17:27 -08:00
Ben Shi	697a162fa6	[AVR] Fix inaccurate offsets in PC relative branch instructions In avr-gcc, the destination of "rjmp label + offset" is address 'label + offset', while destination of "rjmp . + offset" is 'address_of_rjmp + offset + 2'. Clang is in accordance with avr-gcc for "rjmp label + offset", but emits incorrect destination of "rjmp . + offset" to 'address_of_rjmp + offset', in which the expected offset 2 is missing. This patch fixes the above issue. Fixes https://github.com/llvm/llvm-project/issues/60019 Reviewed By: jacquesguan, aykevl Differential Revision: https://reviews.llvm.org/D143901	2023-02-14 16:22:14 +08:00
Ben Shi	f37d7c9381	[AVR] Optimize 16-bit comparison with a constant Fixes https://github.com/llvm/llvm-project/issues/30923 Reviewed By: jacquesguan, aykevl Differential Revision: https://reviews.llvm.org/D142281	2023-02-09 15:13:01 +08:00
Archibald Elliott	62c7f035b4	[NFC][TargetParser] Remove llvm/ADT/Triple.h I also ran `git clang-format` to get the headers in the right order for the new location, which has changed the order of other headers in two files.	2023-02-07 12:39:46 +00:00
Bjorn Pettersson	2dd221fe48	Remove no longer needed includes of LegacyPassManager.h Most of the removed includes should probably have been removed already when we removed TargetMachine::adjustPassManager.	2023-02-06 13:38:57 +01:00
Ben Shi	5f1c6d4444	[AVR][NFC] Refactor 'AVRAsmPrinter::PrintAsmOperand' Reviewed By: Chenbing.Zheng, aykevl Differential Revision: https://reviews.llvm.org/D142170	2023-01-31 11:24:34 +08:00
Ben Shi	cadeb8720b	[AVR][NFC] Remove redundant target feature PROGMEM The functionality of FeaturePROGMEM is all equivalant to FeatureLPM. Reviewed By: Chenbing.Zheng, aykevl Differential Revision: https://reviews.llvm.org/D141242	2023-01-30 11:29:46 +08:00
Ben Shi	4d9969ebe3	[AVR] Fix incorrect decoding of conditional branch instructions This patch fixes the inaccurate decoding of the offset operand of the conditional branch instructions. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D140816	2023-01-29 16:28:42 +08:00
Ayke van Laethem	8202a3da3c	[AVR] Support most address space casts All hardware address spaces on AVR can be freely cast between (they keep the same bit pattern). They just aren't dereferenceable when they're in a different address space as they really do point to a separate address space. This is supported in avr-gcc: https://godbolt.org/z/9Gfvhnhv9 avr-gcc also supports the `__memx` address space which is 24 bits. We don't support this address space yet but I've added a safeguard just in case. Differential Revison: https://reviews.llvm.org/D142107	2023-01-24 18:41:14 +01:00
Jay Foad	768aed1378	[MC] Make more use of MCInstrDesc::operands. NFC. Change MCInstrDesc::operands to return an ArrayRef so we can easily use it everywhere instead of the (IMHO ugly) opInfo_begin and opInfo_end. A future patch will remove opInfo_begin and opInfo_end. Also use it instead of raw access to the OpInfo pointer. A future patch will remove this pointer. Differential Revision: https://reviews.llvm.org/D142213	2023-01-23 11:31:41 +00:00
Ben Shi	029f669db3	[AVR] Emit 'eicall' for devices with large program memory Fixes https://github.com/llvm/llvm-project/issues/58856 Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D142298	2023-01-23 09:06:10 +08:00
Ben Shi	c919ea5b48	[AVR] Fix incorrectly printed global symbol operands in inline-asm Fixes https://github.com/llvm/llvm-project/issues/58879 Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D142096	2023-01-20 09:45:00 +08:00
Sergei Barannikov	6ae84d668f	[MC] Use MCRegister instead of unsigned in MCInstPrinter (NFC) Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140654	2023-01-17 22:39:39 +03:00
Roman Lebedev	cc39c3b17f	[Codegen][LegalizeIntegerTypes] New legalization strategy for scalar shifts: shift through stack https://reviews.llvm.org/D140493 is going to teach SROA how to promote allocas that have variably-indexed loads. That does bring up questions of cost model, since that requires creating wide shifts. Indeed, our legalization for them is not optimal. We either split it into parts, or lower it into a libcall. But if the shift amount is by a multiple of CHAR_BIT, we can also legalize it throught stack. The basic idea is very simple: 1. Get a stack slot 2x the width of the shift type 2. store the value we are shifting into one half of the slot 3. pad the other half of the slot. for logical shifts, with zero, for arithmetic shift with signbit 4. index into the slot (starting from the base half into which we spilled, either upwards or downwards) 5. load 6. split loaded integer This works for both little-endian and big-endian machines: https://alive2.llvm.org/ce/z/YNVwd5 And better yet, if the original shift amount was not a multiple of CHAR_BIT, we can just shift by that remainder afterwards: https://alive2.llvm.org/ce/z/pz5G-K I think, if we are going perform shift->shift-by-parts expansion more than once, we should instead go through stack, which is what this patch does. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140638	2023-01-14 19:12:18 +03:00
Ben Shi	159e2a804d	[AVR] Fix a bug in AsmPrinter when printing inline-asm operands Fixes https://github.com/llvm/llvm-project/issues/58878 Reviewed By: aykevl, Miss_Grape Differential Revision: https://reviews.llvm.org/D141589	2023-01-13 14:23:41 +08:00
Vitaly Buka	c0b475bd5e	[NFC][AVR] Use inline field initializers	2023-01-10 17:48:37 -08:00
Vitaly Buka	c866dc9dd2	[AVR] Init member added by D140830	2023-01-10 17:44:31 -08:00
Ben Shi	4c6d6fc176	[AVR] Do not select unimplemented pseudo instructions Reviewed By: aykevl, Miss_Grape Differential Revision: https://reviews.llvm.org/D141210	2023-01-10 16:51:46 +08:00
Fangrui Song	8f7e674771	[AVR] Support .reloc directive Reviewed By: benshi001 Differential Revision: https://reviews.llvm.org/D141176	2023-01-08 23:27:21 +00:00
Ayke van Laethem	9592920890	[AVR] Optimize 32-bit shifts: optimize REG_SEQUENCE This pseudo-instruction stores two small (8-bit) registers into one wide (16-bit) register. But apparently the order matters a lot to the register allocator. This patch changes the order of inserting the registers to optimize for the best register allocation in the tests of shift32.ll. It might be detrimental in other cases, but keeping the registers in the same physical register seems like it would be a common case. Differential Revision: https://reviews.llvm.org/D140573	2023-01-08 20:05:31 +01:00
Ayke van Laethem	fad5e0cf50	[AVR] Optimize 32-bit shifts: reverse shift + move This optimization turns shifts of almost a multiple of 8 into a shift into the opposite direction. Unfortunately it doesn't compose well with the other optimizations (I've tried) so it's separate from them. Differential Revision: https://reviews.llvm.org/D140572	2023-01-08 20:05:31 +01:00
Ayke van Laethem	81f5f22f27	[AVR] Optimize 32-bit shifts: shift by 4 bits This uses a complicated shift sequence that avr-gcc also uses, but extended to work over any number of bytes and in both directions (logical shift left and logical shift right). Unfortunately it can't be used for an arithmetic shift right: I've tried to come up with a sequence but couldn't. Differential Revision: https://reviews.llvm.org/D140571	2023-01-08 20:05:31 +01:00
Ayke van Laethem	8f8afabd32	[AVR] Optimize 32-bit shift: move bytes around This patch optimizes 32-bit constant shifts by renaming registers. This is very effective as the compiler would otherwise need to do a lot of single bit shift instructions. Instead, the registers are renamed at the SSA level which means the register allocator will insert the necessary mov instructions. Unfortunately, the register allocator will insert some unnecessary movs with the current code. This will be fixed in a later patch. Differential Revision: https://reviews.llvm.org/D140570	2023-01-08 20:05:31 +01:00
Ayke van Laethem	840d10a1d2	[AVR] Custom lower 32-bit shift instructions 32-bit shift instructions were previously expanded using the default SelectionDAG expander, which meant it used 16-bit constant shifts and ORed them together. This works, but is far from optimal. I've optimized 32-bit shifts on AVR using a custom inserter. This is done using three new pseudo-instructions that take the upper and lower bits of the value in two separate 16-bit registers and outputs two 16-bit registers. This is the first commit in a series. When completed, shift instructions will take around 31% less instructions on average for constant 32-bit shifts, and is in all cases equal or better than the old behavior. It also tends to match or outperform avr-gcc: the only cases where avr-gcc does better is when it uses a loop to shift, or when the LLVM register allocator inserts some unnecessary movs. But it even outperforms avr-gcc in some cases where avr-gcc does not use a loop. As a side effect, non-constant 32-bit shifts also become more efficient. For some real-world differences: the build of compiler-rt I use in TinyGo becomes 2.7% smaller and the build of picolibc I use becomes 0.9% smaller. I think picolibc is a better representation of real-world code, but even a ~1% reduction in code size is really significant. The current patch just lays the groundwork. The result is actually a regression in code size. Later patches will use this as a basis to optimize these shift instructions. Differential Revision: https://reviews.llvm.org/D140569	2023-01-08 20:05:31 +01:00
Ayke van Laethem	167338de96	[AVR] correctly declare __do_copy_data and __do_clear_bss These two symbols are declared in object files to indicate whether .data needs to be copied from flash or .bss needs to be cleared. They are supported on avr-gcc and reduce firmware size a bit, which is especially important on very small chips. I checked the behavior of avr-gcc and matched it as well as possible. From my investigation, it seems to work as follows: __do_copy_data is set when the compiler finds a data symbol: * without a section name * with a section name starting with ".data" or ".gnu.linkonce.d" * with a section name starting with ".rodata" or ".gnu.linkonce.r" and flash and RAM are in the same address space __do_clear_bss is set when the compiler finds a data symbol: * without a section name * with a section name that starts with .bss Simply checking whether the calculated section name starts with ".data", ".rodata" or ".bss" should result in the same behavior. Fixes: https://github.com/llvm/llvm-project/issues/58857 Differential Revision: https://reviews.llvm.org/D140830	2023-01-08 18:56:06 +01:00
Ben Shi	6dc85bd3fd	[AVR] Fix incorrect decoding of RJMP and RCALL This patch fixes the inaccurate decoding of the offset operand of the RCALL & RJMP instructions. Reviewed By: aykevl, MaskRay Differential Revision: https://reviews.llvm.org/D140815	2023-01-07 23:26:40 +08:00
Ben Shi	7a45b13cd7	[AVR] Fix some ambiguous cases in AsmParser Some specific operands in specific instructions should be treated as variables/symbols/labels, other than registers. This patch fixes those ambiguous cases, such as "lds r25, r24", which means loading the value inside symbol 'r24' into register 'r25'. Fixes https://github.com/llvm/llvm-project/issues/58853 Reviewed by: aykevl Differential Revision: https://reviews.llvm.org/D140777	2023-01-07 10:42:07 +08:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Ben Shi	a59e96f1a1	[AVR] Select 16-bit LDS/STS for load/store on AVRTiny. The 32-bit LDS/STS are not available on AVRTiny, so we have to use their compact 16-bit form for memory access. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D139687	2022-12-23 11:03:45 +08:00
Ben Shi	77249ea6bc	[AVR] Support 16-bit LDS/STS on AVRTiny. LDS/STS are 32-bit instructions on AVR, which can access up to 64KB data space. While they are 16-bit instructions on AVRTiny, which can only access 128B data space. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D139621	2022-12-23 11:03:45 +08:00
Ben Shi	c41d425030	[AVR][MC] Fix illegal operand forms. These operands are illegal and rejected by avr-gcc. subi r24, -lo8(symobl+offset) sbci r25, -hi8(symobl+offset) And their correct form should be subi r24, lo8(-(symobl+offset)) sbci r25, hi8(-(symobl+offset)) Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D140473	2022-12-23 09:48:06 +08:00
Ben Shi	3730f13428	[AVR] Fix a bug in AsmPrinter when printing memory operands. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D140383	2022-12-23 09:42:29 +08:00
Ayke van Laethem	d1d3005c9f	[AVR] Do not emit instructions invalid for attiny10 The attiny4/attiny5/attiny9/attiny10 have a slightly modified instruction set that drops a number of useful instructions. This patch makes sure to not emit them on these "reduced tiny" cores. The affected instructions are: * lds and sts (load/store directly from data) * ldd and std (load/store with displacement) * adiw and sbiw (add/sub register pairs) * various other instructions that were emitted without checking whether the chip actually supports them (movw, adiw, etc) There is a variant on lds and sts on these chips, but it can only address a limited portion of the address space and is mainly useful to load/store I/O registers (as an extension to the in and out instructions). I have not implemented it here, implementing it can be done in a separate patch. This patch is not optimal. I'm sure it can be improved a lot. For example, we could teach the instruction selector to not select lddw/stdw instructions so that the weird pointer adjustments are not necessary. But for now I've focused just on correctness, not on code quality. Updates: https://github.com/llvm/llvm-project/issues/53459 Differential Revision: https://reviews.llvm.org/D131867	2022-12-22 17:04:53 +01:00
Nick Desaulniers	19a004b468	[llvm][SelectionDAGISel] support -{start\|stop}-{before\|after}= for remaining targets Follow up to the series: 1. https://reviews.llvm.org/D140161 2. https://reviews.llvm.org/D140349 3. https://reviews.llvm.org/D140331 4. https://reviews.llvm.org/D140323 Completes the work from the previous two for remaining targets. This creates the following named passes that can be run via `llc -{start\|stop}-{before\|after}`: - arc-isel - arm-isel - avr-isel - bpf-isel - csky-isel - hexagon-isel - lanai-isel - loongarch-isel - m68k-isel - msp430-isel - mips-isel - nvptx-isel - ppc-codegen - riscv-isel - sparc-isel - systemz-isel - ve-isel - wasm-isel - xcore-isel A nice way to write tests for SelectionDAGISel might be to use a RUN: line like: llc -mtriple=<triple> -start-before=<arch>-isel -stop-after=finalize-isel -o - Fixes: https://github.com/llvm/llvm-project/issues/59538 Reviewed By: asb, zixuan-wu Differential Revision: https://reviews.llvm.org/D140364	2022-12-21 13:25:15 -08:00
Matt Arsenault	69e75ae695	CodeGen: Don't lazily construct MachineFunctionInfo This fixes what I consider to be an API flaw I've tripped over multiple times. The point this is constructed isn't well defined, so depending on where this is first called, you can conclude different information based on the MachineFunction. For example, the AMDGPU implementation inspected the MachineFrameInfo on construction for the stack objects and if the frame has calls. This kind of worked in SelectionDAG which visited all allocas up front, but broke in GlobalISel which hasn't visited any of the IR when arguments are lowered. I've run into similar problems before with the MIR parser and trying to make use of other MachineFunction fields, so I think it's best to just categorically disallow dependency on the MachineFunction state in the constructor and to always construct this at the same time as the MachineFunction itself. A missing feature I still could use is a way to access an custom analysis pass on the IR here.	2022-12-21 10:49:32 -05:00
Archibald Elliott	f09cf34d00	[Support] Move TargetParsers to new component This is a fairly large changeset, but it can be broken into a few pieces: - `llvm/Support/TargetParser` are all moved from the LLVM Support component into a new LLVM Component called "TargetParser". This potentially enables using tablegen to maintain this information, as is shown in https://reviews.llvm.org/D137517. This cannot currently be done, as llvm-tblgen relies on LLVM's Support component. - This also moves two files from Support which use and depend on information in the TargetParser: - `llvm/Support/Host.{h,cpp}` which contains functions for inspecting the current Host machine for info about it, primarily to support getting the host triple, but also for `-mcpu=native` support in e.g. Clang. This is fairly tightly intertwined with the information in `X86TargetParser.h`, so keeping them in the same component makes sense. - `llvm/ADT/Triple.h` and `llvm/Support/Triple.cpp`, which contains the target triple parser and representation. This is very intertwined with the Arm target parser, because the arm architecture version appears in canonical triples on arm platforms. - I moved the relevant unittests to their own directory. And so, we end up with a single component that has all the information about the following, which to me seems like a unified component: - Triples that LLVM Knows about - Architecture names and CPUs that LLVM knows about - CPU detection logic for LLVM Given this, I have also moved `RISCVISAInfo.h` into this component, as it seems to me to be part of that same set of functionality. If you get link errors in your components after this patch, you likely need to add TargetParser into LLVM_LINK_COMPONENTS in CMake. Differential Revision: https://reviews.llvm.org/D137838	2022-12-20 11:05:50 +00:00
Sergei Barannikov	4d48ccfc88	[MC] Use `MCRegister` instead of `unsigned` in `MCTargetAsmParser` Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D140273	2022-12-18 12:12:05 -08:00
Christudasan Devadasan	b5efec4b27	[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot With D134950, targets get notified when a virtual register is created and/or cloned. Targets can do the needful with the delegate callback. AMDGPU propagates the virtual register flags maintained in the target file itself. They are useful to identify a certain type of machine operands while inserting spill stores and reloads. Since RegAllocFast spills the physical register itself, there is no way its virtual register can be mapped back to retrieve the flags. It can be solved by passing the virtual register as an additional argument. This argument has no use when the spill interfaces are called during the greedy allocator or even the PrologEpilogInserter and can pass a null register in such cases. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D138656	2022-12-17 11:55:34 +05:30

1 2 3 4 5 ...

502 Commits