llvm-project

Author	SHA1	Message	Date
Gabriel Ravier	ea540bc210	[polly] Fixed a number of typos. NFC I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Reviewed By: inclyc Differential Revision: https://reviews.llvm.org/D131167	2022-08-07 22:56:07 +08:00
Michael Kruse	fe0e5b3e43	[Polly] Insert !dbg metadata for emitted CallInsts. The IR Verifier requires that every call instruction to an inlineable function (among other things, its implementation must be visible in the translation unit) must also have !dbg metadata attached to it. When parallelizing, Polly emits calls to OpenMP runtime function out of thin air, or at least not directly derived from a bounded list of previous instruction. While we could search for instructions in the SCoP that has some debug info attached to it, there is no guarantee that we find any. Our solution is to generate a new DILocation that points to line 0 to represent optimized code. The OpenMP function implementation is usually not available in the user's translation unit, but can become visible in an LTO build. For the bug to appear, libomp must also be built with debug symbols. IMHO, the IR verifier rule is too strict. Runtime functions can also be inserted by other optimization passes, such as LoopIdiomRecognize. When inserting a call to e.g. memset, it uses the DebugLoc from a StoreInst from the unoptimized code. It is not required to have !dbg metadata attached either. Fixes #56692	2022-07-26 19:43:53 -05:00
Michael Kruse	6fa65f8a98	[Polly][MatMul] Abandon dependence analysis. The copy statements inserted by the matrix-multiplication optimization introduce new dependencies between the copy statements and other statements. As a result, the DependenceInfo must be recomputed. Not recomputing them caused IslAstInfo to deduce that some loops are parallel but cause race conditions when accessing the packed arrays. As a result, matrix-matrix multiplication currently cannot be parallelized. Also see discussion at https://reviews.llvm.org/D125202	2022-06-29 17:20:05 -05:00
Nikita Popov	41d5033eb1	[IR] Enable opaque pointers by default This enabled opaque pointers by default in LLVM. The effect of this is twofold: * If IR that contains neither explicit ptr nor %T* types is passed to tools, we will now use opaque pointer mode, unless -opaque-pointers=0 has been explicitly passed. * Users of LLVM as a library will now default to opaque pointers. It is possible to opt-out by calling setOpaquePointers(false) on LLVMContext. A cmake option to toggle this default will not be provided. Frontends or other tools that want to (temporarily) keep using typed pointers should disable opaque pointers via LLVMContext. Differential Revision: https://reviews.llvm.org/D126689	2022-06-02 09:40:56 +02:00
Arthur Eubanks	caf6af2ed7	[polly] Remove last instances of -analyze As mentioned in D120782, the loop block order can be different depending on if LoopInfo is incrementally updated or freshly computed. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D122195	2022-03-24 09:47:43 -07:00
Wael Yehia	c80198b3d3	Reland "Load pass plugins during option processing, so that plugin options are registered and live." Fix Polly failures. Reviewed By: mehdi_amini, Meinersbur Differential Revision: https://reviews.llvm.org/D121566	2022-03-18 03:27:53 +00:00
Michael Kruse	5c02808131	[polly] Introduce -polly-print-* passes to replace -analyze. The `opt -analyze` option only works with the legacy pass manager and might be removed in the future, as explained in llvm.org/PR53733. This patch introduced -polly-print-* passes that print what the pass would print with the `-analyze` option and replaces all uses of `-analyze` in the regression tests. There are two exceptions: `CodeGen\single_loop_param_less_equal.ll` and `CodeGen\loop_with_condition_nested.ll` use `-analyze on the `-loops` pass which is not part of Polly. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120782	2022-03-14 10:27:15 -05:00
Michael Kruse	d7851685a3	[polly] Remove trailing whitespace from tests. NFC.	2022-02-22 15:41:13 -06:00
Florian Hahn	782c0dd1a1	[IRBuilder] Migrate and-folding to value-based FoldAnd. Similar to the migration of or-folding to FoldOr, there are a few cases where the fold in IRBuilder::CreateAnd triggered directly. Those have been updated. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D117431	2022-01-20 10:22:21 +00:00
Michael Kruse	19db33c06e	[Polly] Remove support for code generated by gfortran+DragonEgg. DragonEgg is not maintained anymore, hence there is no need for this functionality. Fixes llvm.org/PR52173	2021-10-14 14:12:06 -05:00
Michael Kruse	9820dd970c	[Polly] Support for InlineAsm. Inline assembly was not handled at all and treated like a llvm::Value. In particular, it tried to create a pointer it which is not allowed. Fix by handling like a llvm::Constant such that it is just reused when required, instead of trying to marshall it in memory. Fixes llvm.org/PR51960	2021-09-26 03:26:43 -05:00
Michael Kruse	35f7020098	[Polly] Dissolve Isl test directory. NFC. All tests use ISL, integrate its subfolder into the components they belong to.	2021-09-22 17:45:07 -05:00
Nikita Popov	53720f74e4	[Polly] Partially fix scoped alias metadata This partially addresses the verifier failures caused by D110026. In particular, it does not fix the "second level" alias metadata.	2021-09-20 22:51:31 +02:00
Michael Kruse	b85c98b4c5	[Polly][Codegen] Emit access group metadata. Emit llvm.loop.parallel_accesses metadata instead of llvm.mem.parallel_loop_access. The latter is deprecated because it assumes that LoopIDs are persistent, which they are not. We also emit parallel access metadata for all surrounding parallel loops, not just the innermost parallel.	2021-03-04 03:58:03 -06:00
Tobias Grosser	e5340a8ce9	Move code generation test case to test/CodeGen/ llvm-svn: 327857	2018-03-19 15:05:30 +00:00
Tobias Grosser	b693f42b71	[Polly] Fix code generation of llvm.expect intrinsic At the time of code generation, an instruction with an llvm intrinsic is ignored in copyBB. However, if the value of the instruction is used later in the program, the value needs to be synthesized. However, this is causing some issues with the instructions being generated in a hoisted basic block. Removing llvm.expect from the list of ignored intrinsics fixes this bug. This resolves http://llvm.org/PR32324. Contributed-by: Annanay Agarwal <cs14btech11001@iith.ac.in> Tags: #polly Differential Revision: https://reviews.llvm.org/D32992 llvm-svn: 303006	2017-05-14 09:09:54 +00:00
Michael Kruse	e0b34f366f	Update to ISL 0.17. This release includes sevaral improvments compared to the previous version isl-0.16.1-145-g243bf7c (from the ISL 0.17 announcement): - optionally combine SCCs incrementally in scheduler - optionally maximize coincidence in scheduler - optionally avoid loop coalescing in scheduler - minor AST generator improvements - improve support for expansions in schedule trees llvm-svn: 268500	2016-05-04 14:41:36 +00:00
Johannes Doerfert	b3410db2b7	[FIX] Do not recompute SCEVs but pass them to subfunctions This reverts commit 2879c53e80e05497f408f21ce470d122e9f90f94. Additionally, it adds SDiv and SRem instructions to the set of values discovered by the findValues function even if we add the operands to be able to recompute the SCEVs. In subfunctions we do not want to recompute SDiv and SRem instructions but pass them instead as they might have been created through the IslExprBuilder and are more complicated than simple SDiv/SRem instructions in the code. llvm-svn: 265873	2016-04-09 14:30:11 +00:00
Sebastian Pop	b08a52898a	execute cloog specific testcases only with CLOOG_FOUND llvm-svn: 169159	2012-12-03 21:33:40 +00:00
Patrik Hägglund	b476cdfde5	Fix tests with broken datalayout strings. Buildbot failure at r168785. llvm-svn: 168791	2012-11-28 13:30:31 +00:00
Sebastian Pop	ee4baf3eec	do not execute the OpenMP tests when cloog is not found llvm-svn: 168724	2012-11-27 21:15:15 +00:00
Tobias Grosser	3344f733fd	test: LLVM supports now vectors of arbitrary pointers This allows Polly to vectorize more code. Fix the relevant test cases. llvm-svn: 167923	2012-11-14 08:25:52 +00:00
Tobias Grosser	38ea9cd721	Tests: Pipe test files into 'opt' Use 'opt < %s' instead of just 'opt %s' to ensure that no temporary files are created. llvm-svn: 167372	2012-11-04 16:56:20 +00:00
Tobias Grosser	dcebf1e9da	Tests: remove ModuleID lines llvm-svn: 167284	2012-11-02 06:09:20 +00:00
Tobias Grosser	41b20a62c9	Tests: move content of .c files in .ll llvm-svn: 167283	2012-11-02 06:08:39 +00:00
Tobias Grosser	3eb851f370	Remove runtime tests from polly test suite Similar to LLVM we now follow the policy of only having LLVM-IR level tests in the Polly test suite. Testing for miscompilation of larger programs should be done with the llvm test suite. llvm-svn: 167255	2012-11-01 21:44:59 +00:00
Tobias Grosser	ebe8c8cea2	Codegen: Selectively copy in array addresses for OpenMP code The detection of values that need to be copied in to the generated OpenMP subfunction also detects the array base addresses needed in the SCoP. Hence, it is not necessary to unconditionally copy all the base addresses to the generated function. Test cases are modified to reflect this change. Arrays which are global variables do not occur in the struct passed to the subfunction anymore. A test case for base address copy-in is added in copy_in_array.{c,ll}. Committed with slight modifications Contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167215	2012-11-01 05:34:55 +00:00
Tobias Grosser	177982c478	CodeGen: Add scop-parameters to the OpenMP context In addition to the arrays and clast variables a SCoP statement may also refer to values defined before the SCoP or to function arguments. Detect these values and add them to the set of values passed to the function generated for OpenMP parallel execution of a clast. Committed with additional test cases and some refactoring. Contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167214	2012-11-01 05:34:48 +00:00
Tobias Grosser	a17f666f99	Codegen: Copy and restore the ValueMap and ClastVars explicitly When generating OpenMP or GPGPU code the original ValueMap and ClastVars must be kept. We already recovered the original ClastVars by reverting the changes, but we did not keep the content of the ValueMap. This patch keeps now an explicit copy of both maps and restores them after generating OpenMP or GPGPU code. This is an adapted version of a patch contributed by: Armin Groesslinger <armin.groesslinger@uni-passau.de> llvm-svn: 167213	2012-11-01 05:34:35 +00:00
Tobias Grosser	6217e18a7d	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with the cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. The patch was committed with smaller changes to the build system: There is now a flag to enable gpu code generation explictly. This was required as we need the llvm.codegen() patch applied on the llvm sources, to compile this feature correctly. Also, enabling gpu code generation does not require cuda. This requirement was removed to allow 'make polly-test' runs, even without an installed cuda runtime. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 161239	2012-08-03 12:50:07 +00:00
Tobias Grosser	6cc23b07e6	Revert "Add preliminary implementation for GPGPU code generation." I did not take into account, that this patch fails to compile without the llvm.codegen patch applied. This breaks buildbots. I revert this until we found a solution to commit this without buildbots complaining. This reverts commit cb43ab80e94434e780a66be3b9a6ad466822fe33. llvm-svn: 160165	2012-07-13 07:44:56 +00:00
Tobias Grosser	b299d28181	Add preliminary implementation for GPGPU code generation. Translate the selected parallel loop body into a ptx string and run it with cuda driver API. We limit this preliminary implementation to target the following special test cases: - Support only 2-dimensional parallel loops with or without only one innermost non-parallel loop. - Support write memory access to only one array in a SCoP. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 160164	2012-07-13 07:21:00 +00:00
Hongbin Zheng	6417255283	Regression tests: Adapt the vectorize option change. llvm-svn: 156255	2012-05-06 10:22:43 +00:00
Tobias Grosser	e71c6ab54c	SCEV based code generation This is an incomplete implementation of the SCEV based code generation. When finished it will remove the need for -indvars -enable-iv-rewrite. For the moment it is still disabled. Even though it passes 'make polly-test', there are still loose ends especially in respect of OpenMP code generation. llvm-svn: 155717	2012-04-27 16:36:14 +00:00
Tobias Grosser	7c3061acdd	Make vector tests less sensible to codegen changes llvm-svn: 155438	2012-04-24 11:08:07 +00:00
Tobias Grosser	4cb5461dae	CodeGen: Generate scalar code if vector instructions cannot be generated This fixes two crashes that appeared in case of: - A load of a non vectorizable type (e.g. float**) - An instruction that is not vectorizable (e.g. call) llvm-svn: 154586	2012-04-12 10:46:55 +00:00
Tobias Grosser	84ecc47e1c	CodeGen: Allow Polly to do 'grouped unrolling', but no vector generation. Grouped unrolling means that we unroll a loop such that the different instances of a certain statement are scheduled right after each other, but we do not generate any vector code. The idea here is that we can schedule the bb vectorizer right afterwards and use it heuristics to decide when vectorization should be performed. llvm-svn: 154251	2012-04-07 06:16:08 +00:00
Tobias Grosser	0905a23806	CodeGen: Recreate old ivs with the original type To avoid overflows we still use a larger type (i64) while calculating the value of the old ivs. However, we truncate the result to the type of the old iv when providing it to the new code. A corresponding test case is added to the polly test suite. Also, a failing test case is fixed. This fixes PR12311. Contributed by: Tsingray Liu <tsingrayliu@gmail.com> llvm-svn: 153952	2012-04-03 12:24:32 +00:00
Tobias Grosser	89339067b0	CodeGen: Allow function parameters to be rewritten in getNewValue() When deriving new values for the statements of a SCoP, we assumed that parameter values are constant within the SCoP and consquently do not need to be rewritten. For OpenMP code generation this assumption is wrong, as such values are not available in the OpenMP subfunction and consequently also may need to be rewritten. Committed with some changes. Contributed-By: Johannes Doerfert <s9jodoer@stud.uni-saarland.de> llvm-svn: 153838	2012-04-01 16:49:45 +00:00
Tobias Grosser	900893d2d8	CodeGeneration: Proberly build the dominator tree llvm-svn: 153645	2012-03-29 13:10:26 +00:00
Hongbin Zheng	0578aaf77c	Don't fail the lli testcases on 32bit platform. llvm-svn: 153440	2012-03-26 15:16:48 +00:00
Tobias Grosser	cf88d84d79	test: Remove memaccess prefix The prefix is not needed, as all test cases are already in a separate folder. llvm-svn: 153320	2012-03-23 08:24:04 +00:00
Tobias Grosser	d6adda3071	CodeGen: Full support for isl_pw expressions in modified access functions. This also adds support for modifiable write accesses (until now only read accesses where supported). We currently do not derive an exact type for the expression, but assume that i64 is good enough. This will be improved in future patches. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 153319	2012-03-23 08:21:22 +00:00
Tobias Grosser	3ec2abc5fb	Don't allow pointer types in affine expressions We currently do not support pointer types in affine expressions. Hence, we disallow in the SCoP detection. Later we may decide to add support for them. This fixes PR12277 Reported-By: Sebastian Pop <sebpop@gmail.com> llvm-svn: 152928	2012-03-16 16:36:47 +00:00
Tobias Grosser	df3823750e	CodeGen: Pass the scalar maps properly llvm-svn: 151916	2012-03-02 15:20:35 +00:00
Tobias Grosser	f6beec674e	CodeGen: Simplify the generation of a splat llvm-svn: 151912	2012-03-02 15:20:21 +00:00
Tobias Grosser	b61e6318ac	CodeGen: Name stmt bbs 'polly.stmt.' + OriginalName llvm-svn: 150575	2012-02-15 09:58:46 +00:00
Tobias Grosser	04eadc476e	tests: Replace . by %s llvm-svn: 150377	2012-02-13 12:29:43 +00:00
Tobias Grosser	8518bbe39f	CodeGen: Always name merge block llvm-svn: 150337	2012-02-12 12:09:46 +00:00
Tobias Grosser	0dbbdd7637	Codegen: Give split and merge basic blocks better names llvm-svn: 150335	2012-02-12 12:09:37 +00:00

1 2

66 Commits