llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-05-03 16:46:07 +00:00

Author	SHA1	Message	Date
Chandler Carruth	1071cfa4ae	Teach machine block placement to cope with unnatural loops. These don't get loop info structures associated with them, and so we need some way to make forward progress selecting and placing basic blocks. The technique used here is pretty brutal -- it just scans the list of blocks looking for the first unplaced candidate. It keeps placing blocks like this until the CFG becomes tractable. The cost is somewhat unfortunate, it requires allocating a vector of all basic block pointers eagerly. I have some ideas about how to simplify and optimize this, but I'm trying to get the logic correct first. Thanks to Benjamin Kramer for the reduced test case out of GCC. Sadly there are other bugs that GCC is tickling that I'm reducing and working on now. llvm-svn: 144516	2011-11-14 00:00:35 +00:00
Chandler Carruth	8d15078927	Rewrite #3 of machine block placement. This is based somewhat on the second algorithm, but only loosely. It is more heavily based on the last discussion I had with Andy. It continues to walk from the inner-most loop outward, but there is a key difference. With this algorithm we ensure that as we visit each loop, the entire loop is merged into a single chain. At the end, the entire function is treated as a "loop", and merged into a single chain. This chain forms the desired sequence of blocks within the function. Switching to a single algorithm removes my biggest problem with the previous approaches -- they had different behavior depending on which system triggered the layout. Now there is exactly one algorithm and one basis for the decision making. The other key difference is how the chain is formed. This is based heavily on the idea Andy mentioned of keeping a worklist of blocks that are viable layout successors based on the CFG. Having this set allows us to consistently select the best layout successor for each block. It is expensive though. The code here remains very rough. There is a lot that needs to be done to clean up the code, and to make the runtime cost of this pass much lower. Very much WIP, but this was a giant chunk of code and I'd rather folks see it sooner than later. Everything remains behind a flag of course. I've added a couple of tests to exercise the issues that this iteration was motivated by: loop structure preservation. I've also fixed one test that was exhibiting the broken behavior of the previous version. llvm-svn: 144495	2011-11-13 11:20:44 +00:00
Chad Rosier	1198d894d0	The order in which the predicate is added differs between Thumb and ARM mode. Fix predicate when in ARM mode and restore SelectIntrinsicCall. llvm-svn: 144494	2011-11-13 09:44:21 +00:00
Chad Rosier	a476e391f1	Temporarily disable SelectIntrinsicCall when in ARM mode. This is causing failures. llvm-svn: 144492	2011-11-13 05:14:43 +00:00
Chad Rosier	c8cfd3a8fb	Add support for emitting both signed- and zero-extend loads. Fix SimplifyAddress to handle either a 12-bit unsigned offset or the ARM +/-imm8 offsets (addressing mode 3). This enables a load followed by an integer extend to be folded into a single load. For example: ldrb r1, [r0] ldrb r1, [r0] uxtb r2, r1 => mov r3, r2 mov r3, r1 llvm-svn: 144488	2011-11-13 02:23:59 +00:00
Jakob Stoklund Olesen	6ddb767fb5	Remove the -color-ss-with-regs option. It was off by default. The new register allocators don't have the problems that made it necessary to reallocate registers during stack slot coloring. llvm-svn: 144481	2011-11-13 00:31:23 +00:00
Jakob Stoklund Olesen	7ef502f6d1	Delete the 'standard' spiller with used the old spilling framework. The current register allocators all use the inline spiller. llvm-svn: 144477	2011-11-12 23:29:02 +00:00
Jakob Stoklund Olesen	ce4ef9f8d5	Remove histogram tests. Counting the number of occurences of each opcode is not a useful test. llvm-svn: 144474	2011-11-12 22:39:40 +00:00
Jakob Stoklund Olesen	0eac531bc2	RAGreedy is better about hinting now. Or maybe we are just getting lucky. llvm-svn: 144473	2011-11-12 22:39:37 +00:00
Jakob Stoklund Olesen	8ec1a92afd	Linear scan is going away. llvm-svn: 144472	2011-11-12 22:39:34 +00:00
Jakob Stoklund Olesen	654d60888e	XFAIL test that depends on linear scan to remove dead code. Filed PR11364 to track the problem. Should the register allocator eliminate dead code? llvm-svn: 144471	2011-11-12 22:39:30 +00:00
Jakob Stoklund Olesen	fa3a8ee6e2	Remove obsolete test. This test was committed with a bugfix to RemoveCopyByCommutingDef, but that optimization is no longer triggered by this test. llvm-svn: 144470	2011-11-12 22:39:27 +00:00
Jakob Stoklund Olesen	80b3d299a9	Remove obsolete test. This test is for a very specific LocalRewriter bug. LocalRewriter is going away. llvm-svn: 144469	2011-11-12 22:39:24 +00:00
Jakob Stoklund Olesen	0c7d9d90ef	Remove obsolete test. I don't think this test does what is was supposed to do, and LocalRewriter is going away anyway. llvm-svn: 144463	2011-11-12 20:37:57 +00:00
Jakob Stoklund Olesen	126f9779c3	Eliminate more linear scan tests. llvm-svn: 144462	2011-11-12 20:35:26 +00:00
Jakob Stoklund Olesen	9d090daa33	Switch a couple -O0 tests to RABasic. llvm-svn: 144461	2011-11-12 20:11:04 +00:00
Jakob Stoklund Olesen	4deff7bc1d	Switch a few tests off linearscan. llvm-svn: 144460	2011-11-12 19:53:52 +00:00
Jakob Stoklund Olesen	6ac6aa782d	Delete old test of a VirtRegRewriter feature. This test doesn't expose the issue with RAGreedy. I filed PR11363 to track the missing InlineSpiller feature. llvm-svn: 144459	2011-11-12 19:53:48 +00:00
Jakob Stoklund Olesen	74d091b395	Remove old test that doesn't make sense. The test is checking that the output doesn't contains any 'mov ' strings. It does contain movl, though. llvm-svn: 144458	2011-11-12 19:53:45 +00:00
Craig Topper	3dc75f9e3b	Add more AVX2 shift lowering support. Move AVX2 variable shift to use patterns instead of custom lowering code. llvm-svn: 144457	2011-11-12 09:58:49 +00:00
Nick Lewycky	d48ab84556	Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! llvm-svn: 144454	2011-11-12 03:09:12 +00:00
Eli Friedman	ecb453805d	Make sure scalarrepl picks the correct alloca when it rewrites a bitcast. Fixes PR11353. llvm-svn: 144442	2011-11-12 02:07:50 +00:00
Rafael Espindola	e7cc8bff82	The dwarf standard says that the only differences between a out-of-line instance and a concrete inlined instance are the use of DW_TAG_subprogram instead of DW_TAG_inlined_subroutine and the who owns the tree. We were also omitting DW_AT_inline from the abstract roots. To fix this, make sure we mark abstract instance roots with DW_AT_inline even when we have only out-of-line instances referring to them with DW_AT_abstract_origin. FileCheck is not a very good tool for tests like this, maybe we should add a -verify mode to llvm-dwarfdump. llvm-svn: 144441	2011-11-12 01:57:54 +00:00
Eli Friedman	9d448e4a42	Don't try to form pre/post-indexed loads/stores until after LegalizeDAG runs. Fixes PR11029. llvm-svn: 144438	2011-11-12 00:35:34 +00:00
Jim Grosbach	609d113874	ARM optional size suffix for VLDR/VSTR syntax. llvm-svn: 144427	2011-11-11 23:34:43 +00:00
Chad Rosier	a7ebc5617d	Add support in fast-isel for selecting memset/memcpy/memmove intrinsics. llvm-svn: 144426	2011-11-11 23:31:03 +00:00
Chad Rosier	ab1a4a2301	Loosen test by using REs. Approved by Devang. llvm-svn: 144425	2011-11-11 23:25:38 +00:00
Andrew Trick	28c1d18434	Preserve MachineMemOperands in ARMLoadStoreOptimizer. Fixes PR8113. llvm-svn: 144409	2011-11-11 22:18:09 +00:00
Jim Grosbach	85a2343b01	ARM allow Q registers in vldm/vstm register lists. rdar://9672822 llvm-svn: 144407	2011-11-11 21:27:40 +00:00
Devang Patel	a90cb76489	Move X86 specific test in X86 directory. llvm-svn: 144395	2011-11-11 18:13:19 +00:00
Devang Patel	a39794b029	Move X86 specific test in X86 directory. llvm-svn: 144394	2011-11-11 18:10:38 +00:00
Dan Bailey	089cc53232	allow non-device function calls in PTX when natively handling device-side printf llvm-svn: 144388	2011-11-11 14:45:12 +00:00
Craig Topper	ea28a34c43	Add lowering for AVX2 shift instructions. llvm-svn: 144380	2011-11-11 07:39:23 +00:00
Chad Rosier	7ddd63ce4e	Add support for using immediates with select instructions. rdar://10412592 llvm-svn: 144376	2011-11-11 06:20:39 +00:00
Eli Friedman	c4a001478c	Make sure to expand SIGN_EXTEND_INREG for NEON vectors. PR11319, round 3. llvm-svn: 144361	2011-11-11 03:16:38 +00:00
Eli Friedman	0a309292c4	Get rid of an optimization in SCCP which appears to have many issues. Specifically, it doesn't handle many cases involving undef correctly, and it is missing other checks which lead to it trying to re-mark a value marked as a constant with a different value. It also appears to trigger very rarely. Fixes PR11357. llvm-svn: 144352	2011-11-11 01:16:15 +00:00
Chad Rosier	2a3503e061	Add support for using MVN to materialize negative constants. rdar://10412592 llvm-svn: 144348	2011-11-11 00:36:21 +00:00
Jim Grosbach	9bded9dc24	Thumb2 parsing for push/pop w/ hi registers in the reglist. rdar://10130228. llvm-svn: 144331	2011-11-10 23:17:11 +00:00
Rafael Espindola	79278365d3	Check in getOrCreateSubprogramDIE if a declaration exists and if so output it first. This is a more general fix to pr11300. llvm-svn: 144324	2011-11-10 22:34:29 +00:00
Jim Grosbach	5a5ce63742	Thumb MUL assembly parsing for 3-operand form. Get the source register that isn't tied to the destination register correct, even when the assembly source operand order is backwards. rdar://10428630 llvm-svn: 144322	2011-11-10 22:10:12 +00:00
Chad Rosier	d1762e00e2	When in ARM mode, LDRH/STRH require special handling of negative offsets. For correctness, disable this for now. rdar://10418009 llvm-svn: 144316	2011-11-10 21:09:49 +00:00
Jim Grosbach	c14871cc67	ARM assembly parsing for LSR/LSL/ROR(immediate). More of rdar://9704684 llvm-svn: 144301	2011-11-10 19:18:01 +00:00
Jim Grosbach	61db5a59f7	ARM assembly parsing for ASR(immediate). Start of rdar://9704684 llvm-svn: 144293	2011-11-10 16:44:55 +00:00
NAKAMURA Takumi	270354100a	test/CodeGen/X86/lsr-loop-exit-cond.ll: Try to appease linux and freebsd bots to specify explicit -mtriple=x86_64-darwin. I guess it expects -relocation-model=pic. llvm-svn: 144290	2011-11-10 14:18:59 +00:00
Evan Cheng	d33b2d6b7a	Use a bigger hammer to fix PR11314 by disabling the "forcing two-address instruction lower optimization" in the pre-RA scheduler. The optimization, rather the hack, was done before MI use-list was available. Now we should be able to implement it in a better way, perhaps in the two-address pass until a MI scheduler is available. Now that the scheduler has to backtrack to handle call sequences. Adding artificial scheduling constraints is just not safe. Furthermore, the hack is not taking all the other scheduling decisions into consideration so it's just as likely to pessimize code. So I view disabling this optimization goodness regardless of PR11314. llvm-svn: 144267	2011-11-10 07:43:16 +00:00
Chad Rosier	3fbd094ad9	For immediate encodings of icmp, zero or sign extend first. Then determine if the value is negative and flip the sign accordingly. rdar://10422026 llvm-svn: 144258	2011-11-10 01:30:39 +00:00
Jakob Stoklund Olesen	eef48b6938	Strip old implicit operands after foldMemoryOperand. The TII.foldMemoryOperand hook preserves implicit operands from the original instruction. This is not what we want when those implicit operands refer to the register being spilled. Implicit operands referring to other registers are preserved. This fixes PR11347. llvm-svn: 144247	2011-11-10 00:17:03 +00:00
Jim Grosbach	25bc090170	Thumb2 assembly parsing STMDB w/ optional .w suffix. rdar://10422955 llvm-svn: 144242	2011-11-09 23:44:23 +00:00
Eli Friedman	2d4055b683	Make sure we correctly unroll conversions between v2f64 and v2i32 on ARM. llvm-svn: 144241	2011-11-09 23:36:02 +00:00
Pete Cooper	856977cb15	DeadStoreElimination can now trim the size of a store if the end of the store is dead. Currently checks alignment and killing stores on a power of 2 boundary as this is likely to trim the size of the earlier store without breaking large vector stores into scalar ones. Fixes <rdar://problem/10140300> llvm-svn: 144239	2011-11-09 23:07:35 +00:00

1 2 3 4 5 ...

14955 Commits