llvm-project

Author	SHA1	Message	Date
Nikita Popov	a6a526ec54	[IR] Add AllocaInst::getAllocationSize() (NFC) When fetching allocation sizes, we almost always want to have the size in bytes, but we were only providing an InBits API. Also add the corresponding byte-based conjugate to save some *8 and /8 juggling everywhere.	2023-01-06 15:36:16 +01:00
Chuanqi Xu	65e3398869	[NFC] [Coroutines] Move collectFrameAlloca to decrease the times to iterate the function Previously in collectFrameAllocas, we will iterate every instruction in the Function and we will iterate the function again later. It is redundnt.	2023-01-06 16:38:07 +08:00
Dawid Jurczak	7e6c7562cb	[NFC][Coroutines] Build DominatorTree only once before collecting frame allocas (PR58650) Assuming that collecting frame allocas doesn't modify CFG we can safely move DominatorTree construction outside loop and avoid expensive computations. Differential Revision: https://reviews.llvm.org/D140818	2023-01-05 10:32:28 +01:00
Matthias Braun	ae7bf2b80b	CoroFrame: Put escaped variables with multiple lifetimes on coroutine frame The llvm.lifetime.start intrinsic guarantees that the address for a given alloca is always the same. So variables with escaped addresses reaching reaching a lifetime start/end block before and after a suspend must be placed onto the coroutine frame even if the variable itself is not alive across the suspend point. This computes a new `LoopKill` flag in the suspend crossing data flow anaysis to catch the case where a lifetime marker can reach itself via suspend-crossing path. This fixes https://llvm.org/PR52501 Differential Revision: https://reviews.llvm.org/D140231	2023-01-04 07:30:08 -08:00
Fangrui Song	a9fae58782	[Transforms/Coroutines] llvm::Optional => std::optional	2022-12-13 08:32:44 +00:00
Chuanqi Xu	7dccdd76b3	[Coroutines] Don't mark the parameter attribute of resume function as noalias Close https://github.com/llvm/llvm-project/issues/59221. The root cause for the problem is that we marked the parameter of the resume/destroy functions as noalias previously. But this is not true. See https://github.com/llvm/llvm-project/issues/59221 for the details. Long story short, for this C++ program (https://compiler-explorer.com/z/6qGcozG93), the optimized frame will be something like: ``` struct test_frame { void (__resume_)(), // a function pointer points to the `test.resume` function, which can be imaged as the test() function in the example. .... struct a_frame { ... void caller; // may points to test_frame at runtime. }; }; ``` And the function a and function test looks just like: ``` define i32 @a(ptr noalias %alloc_8) { %alloc_8_16 = getelementptr ptr, ptr %alloc_8, i64 16 store i32 42, ptr %alloc_8_16, align 8 %alloc_8_8 = getelementptr ptr, ptr %alloc_8, i64 8 %alloc = load ptr, ptr %alloc_8_8, align 8 %p = load ptr, ptr %alloc, align 8 %r = call i32 %p(ptr %alloc) ret i32 %r } define i32 @b(ptr %p) { entry: %alloc = alloca [128 x i8], align 8 %alloc_8 = getelementptr ptr, ptr %alloc, i64 8 %alloc_8_8 = getelementptr ptr, ptr %alloc_8, i64 8 store ptr %alloc, ptr %alloc_8_8, align 8 store ptr %p, ptr %alloc, align 8 %r = call i32 @a(ptr nonnull %alloc_8) ret i32 %r } ``` Here inside the function `a`, we can access the parameter `%alloc_8` by `%alloc` and we pass `%alloc` to an unknown function. So it breaks the assumption of `noalias` parameter. Note that although only CoroElide optimization can put a frame inside another frame directly, the following case is not valid too: ``` struct test_frame { .... void a_frame; // may points to a_frame at runtime. }; struct a_frame { void *caller; // may points to test_frame at runtime. }; ``` Since the C++ language allows the programmer to get the address of coroutine frames, we can't assume the above case wouldn't happen in the source codes. So we can't set the parameter as noalias no matter if CoroElide applies or not. And for other languages, it may be safe if they don't allow the programmers to get the address of coroutine frames. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D139295	2022-12-13 10:19:24 +08:00
Kazu Hirata	3ee6f7cf65	[Coroutines] Use std::optional in CoroFrame.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 08:27:36 -08:00
Krzysztof Parzyszek	f3b6dbfda8	Instructions: convert Optional to std::optional	2022-12-04 14:25:11 -06:00
Kazu Hirata	343de6856e	[Transforms] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:37 -08:00
Vasileios Porpodas	bebca2b6d5	[NFC] Cleanup: Replaces BB->getInstList().splice() with BB->splice(). This is part of a series of cleanup patches towards making BasicBlock::getInstList() private. Differential Revision: https://reviews.llvm.org/D138979	2022-12-01 15:37:51 -08:00
Guillaume Chatelet	3e6001e91e	[Coroutines] createStructType takes alignment in bits but receives bytes This has been found while trying to remove the last few places relying on `unsigned` to convey alignment operations. This seems to be untested. Differential Revision: https://reviews.llvm.org/D138784	2022-11-29 16:05:06 +00:00
Kazu Hirata	8dd2416e44	[Coroutines] Use std::optional in CoroFrame.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:18:17 -08:00
Kazu Hirata	d7fdb5d87b	[Coroutines] Use std::optional in CoroElide.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:15:51 -08:00
Chuanqi Xu	2c61848c9d	[Coroutines] Handle the writes to promise alloca prior to llvm.coro.begin Previously we've taken care of the writes to allocas prior to llvm.coro.begin. However, since the promise alloca is special so that we never handled it before. For the long time, since the programmers can't access the promise_type due to the c++ language specification, we still failed to recognize the problem until a recent report: https://github.com/llvm/llvm-project/issues/57861 And we've tested many codes that the problem gone away after we handle the writes to the promise alloca prior to @llvm.coro.begin() prope until a recent report: https://github.com/llvm/llvm-project/issues/57861 And we've tested many codes that the problem gone away after we handle the writes to the promise alloca prior to @llvm.coro.begin() properly. Closes https://github.com/llvm/llvm-project/issues/57861	2022-11-18 15:39:39 +08:00
Sebastian Neubauer	ce879a03c9	[Coroutines] Do not add allocas for retcon coroutines Same as for async-style lowering, if there are no resume points in a function, the coroutine frame pointer will be replaced by an undef, making all accesses to the frame undefinde behavior. Fix this by not adding allocas to the coroutine frame if there are no resume points. Differential Revision: https://reviews.llvm.org/D137866	2022-11-14 10:46:46 +01:00
Arthur Eubanks	f59e1bcc22	[PrintPipeline] Handle CoroConditionalWrapper and add more verification Add a check (can be disabled via a flag) that the pipeline we generate is actually parsable. Can be disabled because we don't expect to handle every pass in -print-pipeline-passes. Fixes #58280. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D135703	2022-10-12 09:36:45 -07:00
Nikita Popov	972840aa3b	[IR] Add Instruction::getInsertionPointAfterDef() Transforms occasionally want to insert an instruction directly after the definition point of a value. This involves quite a few different edge cases, e.g. for phi nodes the next insertion point is not the next instruction, and for invokes and callbrs its not even in the same block. Additionally, the insertion point may not exist at all if catchswitch is involved. This adds a general Instruction::getInsertionPointAfterDef() API to implement the necessary logic. For now it is used in two places where this should be mostly NFC. I will follow up with additional uses where this fixes specific bugs in the existing implementations. Differential Revision: https://reviews.llvm.org/D129660	2022-08-31 10:50:10 +02:00
Kazu Hirata	56ea4f9bd3	[Transforms] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-27 21:21:02 -07:00
Adrian Vogelsgesang	5af06ba7dc	[Coro][Debuginfo] Add debug info to `__NoopCoro_ResumeDestroy` function With this commit, we now attach an `DISubprogram` to the LLVM-generated `_NoopCoro_ResumeDestroy` function. Thereby, lldb can show a `std::coroutine_handle` to a `std::noop_coroutine` as ``` continuation = coro frame = 0x555555560d98 { resume = 0x0000555555555c50 (a.out`__NoopCoro_ResumeDestroy) destroy = 0x0000555555555c50 (a.out`__NoopCoro_ResumeDestroy) } ``` instead of ``` continuation = coro frame = 0x555555560d98 { resume = 0x0000555555555c50 (a.out`___lldb_unnamed_symbol211) destroy = 0x0000555555555c50 (a.out`___lldb_unnamed_symbol211) } ``` I renamed the function from `NoopCoro.ResumeDestroy` to `_NoopCoro_ResumeDestroy` because: * the leading `_` makes sure this is a reserved name and should not clash with any user-provided names * the `.` was replaced by a `_`, so the name is now a valid identifier in C, making it allows me to type its name in the debugger Differential Revision: https://reviews.llvm.org/D132580	2022-08-26 05:49:52 -07:00
Chuanqi Xu	17631ac676	[Coroutines] Store the index for final suspend point if there is unwind coro end Closing https://github.com/llvm/llvm-project/issues/57339 The root cause for this issue is an pre-mature optimization to eliminate the index for the final suspend point since we feel like we can judge if a coroutine is suspended at the final suspend by if resume_fn_addr is null. However this is not true if the coroutine exists via an exception in promise.unhandled_exception(). According to [dcl.fct.def.coroutine]p14: > If the evaluation of the expression promise.unhandled_exception() > exits via an exception, the coroutine is considered suspended at the > final suspend point. But from the perspective of the implementation, we can't set the coro index to the final suspend point directly since it breaks the states. To fix the issue, we block the optimization if we find there is any unwind coro end, which indicates that it is possible that the coroutine exists via an exception from promise.unhandled_exception(). Test Plan: folly	2022-08-26 14:05:46 +08:00
Ting Wang	d2d77e050b	[PowerPC][Coroutines] Add tail-call check with call information for coroutines Fixes #56679. Reviewed By: ChuanqiXu, shchenz Differential Revision: https://reviews.llvm.org/D131953	2022-08-21 22:20:40 -04:00
Chuanqi Xu	e190b7cc90	[Coroutines] Maintain the position of final suspend Closing https://github.com/llvm/llvm-project/issues/56329 The problem happens when we try to simplify the suspend points. We might break the assumption that the final suspend lives in the last slot of Shape.CoroSuspends. This patch tries to main the assumption and fixes the problem.	2022-08-12 13:05:08 +08:00
Kazu Hirata	e20d210eef	[llvm] Qualify auto (NFC) Identified with readability-qualified-auto.	2022-08-07 23:55:27 -07:00
Kazu Hirata	0e37ef0186	[Transforms] Fix comment typos (NFC)	2022-08-07 23:55:24 -07:00
Fangrui Song	fa66789d06	[llvm] LLVM_NODISCARD => [[nodiscard]]. NFC With C++17 there is no Clang pedantic warning.	2022-08-07 00:26:33 +00:00
Chuanqi Xu	230d6f93aa	[Coroutines] Remove lifetime intrinsics for spliied allocas in coroutine frames Closing https://github.com/llvm/llvm-project/issues/56919 It is meaningless to preserve the lifetime markers for the spilled allocas in the coroutine frames and it would block some optimizations too.	2022-08-05 14:50:43 +08:00
Arnold Schwaighofer	bc4870f09e	[coro async] Add missing llvm.coro.id.async intrinsic to declaresCoroCleanupIntrinsics rdar://97214593 Differential Revision: https://reviews.llvm.org/D130038	2022-07-19 07:25:04 -07:00
Arnold Schwaighofer	28ebd13d63	[coro async] Fix code to run coro.async.end cleanup like the legacy pass did The code executed for the Switch ABI does not change. rdar://97074714 Differential Revision: https://reviews.llvm.org/D129865	2022-07-18 10:41:29 -07:00
Kazu Hirata	3112987d5c	Remove unused forward declarations (NFC)	2022-07-17 15:37:48 -07:00
Chuanqi Xu	66e15d4c01	[NFC] [Coroutines] Update the comments for lowering coro.save The original comment is not right. We don't store 0 all the time.	2022-07-07 14:57:41 +08:00
Chuanqi Xu	e3b4452e07	[Debug] [Coroutines] Get rid of DW_ATE_address Closing https://github.com/llvm/llvm-project/issues/55916 This patch tries to get rid of DW_ATE_address and enhance the test coverage. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D127625	2022-07-07 10:47:09 +08:00
Chuanqi Xu	7137ebc4ce	[Debug] [Coroutine] Adjust the scope and name for coroutine frame Previously the scope of debug type of __coro_frame is limited in the current function. It looked good at the first sight. But it prevent us to print the type in splitted functions and other functions. Also the debug type is different for different coroutine functions. So it makes sense to rename the debug type to make it related to the function name. After this patch, we could access the coroutine frame type in a function by `function_name.coro_frame_ty`. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D127623	2022-07-07 10:35:32 +08:00
Chuanqi Xu	0b5ead6590	[WebAssembly] Don't set musttail for coroutines when tail-call is not enabled The C++20 Coroutines couldn't be compiled to WebAssembly due to an optimization named symmetric transfer requires the support for musttail calls but WebAssembly doesn't support it yet. This patch tries to fix the problem by adding a supportsTailCalls method to TargetTransformImpl to skip the symmetric transfer when tail-call feature is not supported. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D128794	2022-06-30 11:15:40 +08:00
Nikita Popov	5548e807b5	[IR] Remove support for extractvalue constant expression This removes the extractvalue constant expression, as part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179. extractvalue is already not supported in bitcode, so we do not need to worry about bitcode auto-upgrade. Uses of ConstantExpr::getExtractValue() should be replaced with IRBuilder::CreateExtractValue() (if the fact that the result is constant is not important) or ConstantFoldExtractValueInstruction() (if it is). Though for this particular case, it is also possible and usually preferable to use getAggregateElement() instead. The C API function LLVMConstExtractValue() is removed, as the underlying constant expression no longer exists. Instead, LLVMBuildExtractValue() should be used (which will constant fold or create an instruction). Depending on the use-case, LLVMGetAggregateElement() may also be used instead. Differential Revision: https://reviews.llvm.org/D125795	2022-06-28 10:40:17 +02:00
Yuanfang Chen	e2e9e708e5	[Coroutine] Remove the '!func_sanitize' metadata for split functions There is no proper RTTI for these split functions. So just delete the metadata. Fixes https://github.com/llvm/llvm-project/issues/49689. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D116130	2022-06-27 12:09:13 -07:00
Chuanqi Xu	24e53b01d5	Revert "[Coroutines] Only do symmetric transfer if optimization is on" This reverts commit 7782e080e80a90f7bb32049beb3787e2118c2251. According to the discussion of WG21, symmetric transfer is a desired feature.	2022-06-27 10:54:56 +08:00
Kazu Hirata	7a47ee51a1	[llvm] Don't use Optional::getValue (NFC)	2022-06-20 22:45:45 -07:00
Kazu Hirata	e0e687a615	[llvm] Don't use Optional::hasValue (NFC)	2022-06-20 10:38:12 -07:00
Guillaume Chatelet	7296811910	[NFC] Simplify alignment code in CoroFrame	2022-06-20 15:15:52 +00:00
Chuanqi Xu	7782e080e8	[Coroutines] Only do symmetric transfer if optimization is on Symmetric transfer is not a part of C++ standards. So the vendors is not forced to implement it any way. Given the symmetric transfer nowadays is an optimization. It makes more sense to enable it only if the optimization is enabled. It is also helpful for the compilation speed in O0.	2022-06-20 16:20:36 +08:00
Guillaume Chatelet	77bba68de6	[NFC][Alignment] Use Align in CoroFrame	2022-06-14 10:56:36 +00:00
Chuanqi Xu	735e6c40b5	[Coroutines] Convert coroutine.presplit to enum attr This is required by @nikic in https://reviews.llvm.org/D127383 to decrease the cost to check whether a function is a coroutine and this fixes a FIXME too. Reviewed By: rjmccall, ezhulenev Differential Revision: https://reviews.llvm.org/D127471	2022-06-14 14:23:46 +08:00
Chuanqi Xu	733d7cf964	[Debug] [Coroutines] Add deref operator for non complex expression Background: When we construct coroutine frame, we would insert a dbg.declare intrinsic for it: ``` %hdl = call void @llvm.coro.begin() ; would return coroutine handle call void @llvm.dbg.declare(metadata ptr %hdl, metadata ![[DEBUG_VARIABLE: __coro_frame]], metadata !DIExpression()) ``` And in the splitted coroutine, it looks like: ``` define void @coro_func.resume(ptr *hdl) { entry.resume: call void @llvm.dbg.declare(metadata ptr %hdl, metadata ![[DEBUG_VARIABLE: __coro_frame]], metadata !DIExpression()) } ``` And we would salvage the debug info by inserting a new alloca here: ``` define void @coro_func.resume(ptr %hdl) { entry.resume: %frame.debug = alloca ptr call void @llvm.dbg.declare(metadata ptr %frame.debug, metadata ![[DEBUG_VARIABLE: __coro_frame]], metadata !DIExpression()) store ptr %hdl, %frame.debug } ``` But now, the problem comes since the `dbg.declare` refers to the address of that alloca instead of actual coroutine handle. I saw there are codes to solve the problem but it only applies to complex expression only. I feel if it is OK to relax the condition to make it work for `__coro_frame`. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D126277	2022-06-08 10:53:51 +08:00
Kazu Hirata	3b9707dbc0	[llvm] Convert for_each to range-based for loops (NFC)	2022-06-05 12:07:14 -07:00
Arnold Schwaighofer	5c902af572	[coro async] Add code to support dynamic aligment of over-aligned types in async frames Async context frames are allocated with a maximum alignment. If a type requests an alignment bigger than that dynamically align the address in the frame. Differential Revision: https://reviews.llvm.org/D126715	2022-06-03 07:06:14 -07:00
serge-sans-paille	fb67d683db	[iwyu] Handle regressions in libLLVM header include Running iwyu-diff on LLVM codebase since 7030654296a0416bd9402a0278 detected a few regressions, fixing them. Differential Revision: https://reviews.llvm.org/D126417	2022-05-26 08:12:34 +02:00
Chuanqi Xu	02d6845234	[NFC] [Coroutines] Remove EnableReuseStorageInFrame option The EnableReuseStorageInFrame option is designed for testing only. But it is better to use *_PASS_WITH_PARAMS macro to keep consistent with other passes.	2022-05-10 17:28:43 +08:00
Chuanqi Xu	beeed0994e	[Coroutines] Use PassManager instead of Legacy PassManager internally This is a following cleanup for the previous work D123918. I missed serveral places which still use legacy pass managers. This patch tries to remove them.	2022-05-10 13:15:11 +08:00
Chuanqi Xu	2d037873a3	[Coroutines] Don't re-materialize for debug instructions Re-materialize for debug instructions would cause a different code generated if we enabled `-g`. This is bad. So we disable to re-materialize for debug instructions.	2022-05-06 13:52:19 +08:00
Chuanqi Xu	405bf90235	[NFC] [Pipelines] Hoist CoroCleanup as Module Pass This is similar to previous patch https://reviews.llvm.org/D123925. It could also reduce the time we call declaresCoroCleanupIntrinsics. And it is helpful for further changes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D124362	2022-05-05 15:15:09 +08:00

1 2 3 4 5 ...

376 Commits