llvm-project

Author	SHA1	Message	Date
Bill Wendling	49c4dfb534	Revert accidental commit. llvm-svn: 148065	2012-01-12 23:06:28 +00:00
Bill Wendling	ee5eaebc58	Fix the code that was WRONG. The registers are placed into the saved registers list in the reverse order, which is why the original loop was written to loop backwards. llvm-svn: 148064	2012-01-12 23:05:03 +00:00
Pete Cooper	99415fea87	Added FPOW, FEXP, FLOG to PromoteNode so that custom actions can be set to Promote for those operations. Sorry, no test case yet llvm-svn: 148050	2012-01-12 21:46:18 +00:00
Evan Cheng	5c03a6b8f5	When hoisting common code, watch out for uses which are marked "kill". If the killed registers are needed below the insertion point, then unset the kill marker. Sorry I'm not able to find a reduced test case. rdar://10660944 llvm-svn: 148043	2012-01-12 20:31:24 +00:00
Evan Cheng	09cc429cb1	Allow targets to select source order pre-RA scheduler. llvm-svn: 148033	2012-01-12 18:27:52 +00:00
Jakob Stoklund Olesen	994fed689f	Make SplitAnalysis::UseSlots private. llvm-svn: 148031	2012-01-12 17:53:44 +00:00
Jakob Stoklund Olesen	20f19eb9ab	Make data structures private. llvm-svn: 147979	2012-01-11 23:19:08 +00:00
Jakob Stoklund Olesen	73edbf1682	Sink spillInterferences into RABasic. This helper method is too simplistic for RAGreedy. llvm-svn: 147976	2012-01-11 22:52:14 +00:00
Jakob Stoklund Olesen	06ec420347	Cleanup. llvm-svn: 147975	2012-01-11 22:52:11 +00:00
Jakob Stoklund Olesen	a818d804a1	Move RegAllocBase into its own cpp file separate from RABasic. No functional change. llvm-svn: 147972	2012-01-11 22:28:30 +00:00
Nadav Rotem	b5ce6ee835	On AVX, we can load v8i32 at a time. The bug happens when two uneven loads are used. When we load the v12i32 type, the GenWidenVectorLoads method generates two loads: v8i32 and v4i32 and attempts to use CONCAT_VECTORS to join them. In this fix I concat undef values to widen the smaller value. The test "widen_load-2.ll" also exposes this bug on AVX. llvm-svn: 147964	2012-01-11 20:19:17 +00:00
Chandler Carruth	55b2cdee26	Teach the X86 instruction selection to do some heroic transforms to detect a pattern which can be implemented with a small 'shl' embedded in the addressing mode scale. This happens in real code as follows: unsigned x = my_accelerator_table[input >> 11]; Here we have some lookup table that we look into using the high bits of 'input'. Each entity in the table is 4-bytes, which means this implicitly gets turned into (once lowered out of a GEP): (unsigned)((char)my_accelerator_table + ((input >> 11) << 2)); The shift right followed by a shift left is canonicalized to a smaller shift right and masking off the low bits. That hides the shift right which x86 has an addressing mode designed to support. We now detect masks of this form, and produce the longer shift right followed by the proper addressing mode. In addition to saving a (rather large) instruction, this also reduces stalls in Intel chips on benchmarks I've measured. In order for all of this to work, one part of the DAG needs to be canonicalized still further* than it currently is. This involves removing pointless 'trunc' nodes between a zextload and a zext. Without that, we end up generating spurious masks and hiding the pattern. llvm-svn: 147936	2012-01-11 08:41:08 +00:00
Jakob Stoklund Olesen	8b1d023a4a	Detect when a value is undefined on an edge to a landing pad. Consider this code: int h() { int x; try { x = f(); g(); } catch (...) { return x+1; } return x; } The variable x is undefined on the first edge to the landing pad, but it has the f() return value on the second edge to the landing pad. SplitAnalysis::getLastSplitPoint() would assume that the return value from f() was live into the landing pad when f() throws, which is of course impossible. Detect these cases, and treat them as if the landing pad wasn't there. This allows spill code to be inserted after the function call to f(). <rdar://problem/10664933> llvm-svn: 147912	2012-01-11 02:07:05 +00:00
Jakob Stoklund Olesen	67aec12409	Exclusively use SplitAnalysis::getLastSplitPoint(). Delete the alternative implementation in LiveIntervalAnalysis. These functions computed the same thing, but SplitAnalysis caches the result. llvm-svn: 147911	2012-01-11 02:07:00 +00:00
Evan Cheng	d9725a38d6	Avoid CSE of instructions which define physical registers across MBBs unless the physical registers are not allocatable. llvm-svn: 147902	2012-01-11 00:38:11 +00:00
Evan Cheng	da46832e42	80 col violation. llvm-svn: 147884	2012-01-10 22:27:32 +00:00
Chandler Carruth	f3e8502cc1	Add 'llvm_unreachable' to passify GCC's understanding of the constraints of several newly un-defaulted switches. This also helps optimizers (including LLVM's) recognize that every case is covered, and we should assume as much. llvm-svn: 147861	2012-01-10 18:08:01 +00:00
David Blaikie	edbb58c577	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Nadav Rotem	61bdf79035	Fix a bug in the legalization of shuffle vectors. When we emulate shuffles using BUILD_VECTORS we may be using a BV of different type. Make sure to cast it back. llvm-svn: 147851	2012-01-10 14:28:46 +00:00
Evan Cheng	0be4144a68	Allow machine-cse to look across MBB boundary when cse'ing instructions that define physical registers. It's currently very restrictive, only catching cases where the CE is in an immediate (and only) predecessor. But it catches a surprising large number of cases. rdar://10660865 llvm-svn: 147827	2012-01-10 02:02:58 +00:00
Rafael Espindola	5cb98f1062	Remove the logging streamer. llvm-svn: 147820	2012-01-10 00:40:39 +00:00
Evan Cheng	520730ff23	Avoid eraseing copies from a reserved register unless the definition can be safely proven not to have been clobbered. No small test case possible. llvm-svn: 147751	2012-01-08 19:52:28 +00:00
Craig Topper	0515cd41e4	Replace some uses of hasNUsesOfValue(0, X) with !hasAnyUseOfValue(X) llvm-svn: 147733	2012-01-07 18:31:09 +00:00
Craig Topper	43a1bd6ac7	Add some DAG combines for SUBC/SUBE. If nothing uses the carry/borrow out of subc, turn it into a sub. Turn (subc x, x) into 0 with no borrow. Turn (subc x, 0) into x with no borrow. Turn (subc -1, x) into (xor x, -1) with no borrow. Turn sube with no borrow in into subc. llvm-svn: 147728	2012-01-07 09:06:39 +00:00
Jakob Stoklund Olesen	434fb37bb4	Optimize reserved register coalescing. Reserved registers don't have proper live ranges, their LiveInterval simply has a snippet of liveness for each def. Virtual registers with a single value that is a copy of a reserved register (typically %esp) can be coalesced with the reserved register if the live range doesn't overlap any reserved register defs. When coalescing with a reserved register, don't modify the reserved register live range. Just leave it as a bunch of dead defs. This eliminates quadratic coalescer behavior in i386 functions with many function calls. PR11699 llvm-svn: 147726	2012-01-07 07:39:50 +00:00
Jakob Stoklund Olesen	a8879087b5	Use the 'regalloc' debug tag for most register allocator tracing. llvm-svn: 147725	2012-01-07 07:39:47 +00:00
Evan Cheng	6cc8d49885	Revert part of r147716. Looks like x87 instructions kill markers are all messed up so branch folding pass can't use the scavenger. :-( This doesn't breaks anything currently. It just means targets which do not carefully update kill markers cannot run post-ra scheduler (not new, it has always been the case). We should fix this at some point since it's really hacky. llvm-svn: 147719	2012-01-07 03:35:48 +00:00
Evan Cheng	00b1a3cd7e	Added a late machine instruction copy propagation pass. This catches opportunities that only present themselves after late optimizations such as tail duplication .e.g. ## BB#1: movl %eax, %ecx movl %ecx, %eax ret The register allocator also leaves some of them around (due to false dep between copies from phi-elimination, etc.) This required some changes in codegen passes. Post-ra scheduler and the pseudo-instruction expansion passes have been moved after branch folding and tail merging. They were before branch folding before because it did not always update block livein's. That's fixed now. The pass change makes independently since we want to properly schedule instructions after branch folding / tail duplication. rdar://10428165 rdar://10640363 llvm-svn: 147716	2012-01-07 03:02:36 +00:00
Andrew Trick	ff4e2b7d23	Missing raw_ostream.h breaks MSVC build. llvm-svn: 147703	2012-01-07 00:54:28 +00:00
Chad Rosier	73a3fab480	Add comment. llvm-svn: 147696	2012-01-06 23:45:47 +00:00
Eric Christopher	8ea8e4fc76	Add a comment and ensure that anyone else looking at this code doesn't start to bleed from the eyes. llvm-svn: 147695	2012-01-06 23:03:37 +00:00
Eric Christopher	090fcc1a10	Use const vector references instead of a vector copy. Spotted by Devang. llvm-svn: 147694	2012-01-06 23:03:34 +00:00
Eric Christopher	5a28a6ee2f	Use -> instead of (*iter). llvm-svn: 147693	2012-01-06 23:03:27 +00:00
Andrew Trick	85460d0d32	Tracing to help investigate issues with SjLj spill code. llvm-svn: 147682	2012-01-06 21:16:27 +00:00
Eric Christopher	667a074be0	Fix a leak I noticed while reviewing the accelerator table changes. Passes lldb testsuite. rdar://10652330 llvm-svn: 147673	2012-01-06 19:35:04 +00:00
Eric Christopher	21bde87bf3	As part of the ongoing work in finalizing the accelerator tables, extend the debug type accelerator tables to contain the tag and a flag stating whether or not a compound type is a complete type. rdar://10652330 llvm-svn: 147651	2012-01-06 04:35:23 +00:00
Benjamin Kramer	69eab4e0af	Kill ObjectCodeEmitter and BinaryObject, they were unused and superseded by MC. llvm-svn: 147618	2012-01-05 22:31:37 +00:00
Rafael Espindola	afcf571ef9	Remove the old ELF writer. llvm-svn: 147615	2012-01-05 22:07:43 +00:00
Chandler Carruth	eab5029964	Remove an unused variable. llvm-svn: 147605	2012-01-05 11:25:47 +00:00
Chandler Carruth	e041a30bb9	Prevent a DAGCombine from firing where there are two uses of a combined-away node and the result of the combine isn't substantially smaller than the input, it's just canonicalized. This is the first part of a significant (7%) performance gain for Snappy's hot decompression loop. llvm-svn: 147604	2012-01-05 11:05:55 +00:00
Andrew Trick	100af0adf7	Minor postra scheduler cleanup. It could result in more precise antidependence latency on ARM in exceedingly rare cases. llvm-svn: 147594	2012-01-05 02:52:11 +00:00
Jakob Stoklund Olesen	d19d3cab09	Freeze reserved registers before starting register allocation. The register allocators don't currently support adding reserved registers while they are running. Extend the MRI API to keep track of the set of reserved registers when register allocation started. Target hooks like hasFP() and needsStackRealignment() can look at this set to avoid reserving more registers during register allocation. llvm-svn: 147577	2012-01-05 00:26:49 +00:00
Craig Topper	f726e15f44	Allow vector shuffle normalizing to use concat vector even if the sources are commuted in the shuffle mask. llvm-svn: 147527	2012-01-04 09:23:09 +00:00
Craig Topper	279c77b677	Implement VECTOR_SHUFFLE canonicalizations during DAG combine. llvm-svn: 147525	2012-01-04 08:07:43 +00:00
Chris Lattner	6b77a07f75	Turn a few more inline asm errors into "emitErrors" instead of fatal errors. Before we'd get: $ clang t.c fatal error: error in backend: Invalid operand for inline asm constraint 'i'! Now we get: $ clang t.c t.c:16:5: error: invalid operand for inline asm constraint 'i'! "movq (%4), %%mm0\n" ^ Which at least gets us the inline asm that is the problem. llvm-svn: 147502	2012-01-03 23:51:01 +00:00
Jakob Stoklund Olesen	4043d92872	Assert when reserved registers have been assigned. This can only happen if the set of reserved registers changes during register allocation. <rdar://problem/10625436> llvm-svn: 147486	2012-01-03 22:34:31 +00:00
Nadav Rotem	1e7dda13c8	Fix incorrect widening of the bitcast sdnode in case the incoming operand is integer-promoted. llvm-svn: 147484	2012-01-03 22:12:28 +00:00
Owen Anderson	fcc041eabf	Remove the restriction that target intrinsics can only involve legal types. Targets can perfects well support intrinsics on illegal types, as long as they are prepared to perform custom expansion during type legalization. For example, a target where i64 is illegal might still support the i64 intrinsic operation using pairs of i32's. ARM already does some expansions like this for non-intrinsic operations. llvm-svn: 147472	2012-01-03 20:09:02 +00:00
Lang Hames	c405ac4429	Clarified assert text. llvm-svn: 147471	2012-01-03 20:05:57 +00:00
Nick Lewycky	bc26b2d162	Fix typo in ruler. No functionality change. llvm-svn: 147454	2012-01-03 18:22:43 +00:00

1 2 3 4 5 ...

12876 Commits