llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-05-10 22:36:05 +00:00

Author	SHA1	Message	Date
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	ea6357828b	Use push_back(...) instead of resize(1, ...), per review feedback. llvm-svn: 50561	2008-05-02 00:03:54 +00:00
Dan Gohman	752ce50b2d	Fix uninitialized uses of the FPC variable. llvm-svn: 50558	2008-05-01 23:40:44 +00:00
Chris Lattner	d4b2a67cf3	don't randomly miscompile seto/setuo just because we are in ffastmath mode. This fixes rdar://5902801, a miscompilation of gcc.dg/builtins-8.c. Bill, please pull this into Tak. llvm-svn: 50523	2008-05-01 07:26:11 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Chris Lattner	5c88f7b1ad	make the vector conversion magic handle multiple results. We now compile test2/test3 to: _test2: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End addps %xmm1, %xmm0 ret _test3: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End paddd %xmm1, %xmm0 ret as expected. llvm-svn: 50389	2008-04-29 04:48:56 +00:00
Chris Lattner	f9a49c4322	add support for multiple return values in inline asm. This is a step towards PR2094. It now compiles the attached .ll file to: _sad16_sse2: movslq %ecx, %rax ## InlineAsm Start %ecx %rdx %rax %rax %r8d %rdx %rsi ## InlineAsm End ## InlineAsm Start set %eax ## InlineAsm End ret which is pretty decent for a 3 output, 4 input asm. llvm-svn: 50386	2008-04-29 04:29:54 +00:00
Evan Cheng	b96782ecbd	Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs: c1, f1 = CopyToReg c2, f2 = CopyToReg c3 = TokenFactor c1, c2 ... = user c3, ..., f2 Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes. llvm-svn: 50376	2008-04-28 22:07:13 +00:00
Dan Gohman	77ce6da378	Delete an unused constructor. llvm-svn: 50367	2008-04-28 18:28:49 +00:00
Dan Gohman	d961d30b7f	Add a comment to CreateRegForValue that clarifies the handling of aggregate types. llvm-svn: 50366	2008-04-28 18:19:43 +00:00
Dan Gohman	80c692d439	Rewrite the comments for RegsForValue and its members, and reorder some of the members for clarity. llvm-svn: 50365	2008-04-28 18:10:39 +00:00
Dan Gohman	14a05df97b	Don't call size() on each iteration of the loop. llvm-svn: 50361	2008-04-28 17:42:03 +00:00
Chris Lattner	c9e280c78a	Another collection of random cleanups. No functionality change. llvm-svn: 50341	2008-04-28 07:16:35 +00:00
Chris Lattner	52504e78fb	Remove the SmallVector ctor that converts from a SmallVectorImpl. This conversion open the door for many nasty implicit conversion issues, and can be easily solved by initializing with (V.begin(), V.end()) when needed. This patch includes many small cleanups for sdisel also. llvm-svn: 50340	2008-04-28 06:44:42 +00:00
Chris Lattner	8c7f5ad968	switch RegsForValue::Regs to be a SmallVector to avoid heap thrash on tiny (usually single-element) vectors. llvm-svn: 50335	2008-04-28 06:02:19 +00:00
Chris Lattner	d04b818a91	move static function out of anon namespace, no functionality change. llvm-svn: 50330	2008-04-27 23:48:12 +00:00
Chris Lattner	122721843b	Another step to getting multiple result inline asm to work. llvm-svn: 50329	2008-04-27 23:44:28 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	a937baeb9b	isa+cast -> dyn_cast llvm-svn: 50314	2008-04-27 00:16:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Dan Gohman	e9e3891c09	Use isa instead of dyn_cast. llvm-svn: 50181	2008-04-23 20:25:16 +00:00
Dan Gohman	b418aafabf	Add support to codegen for getresult instructions with undef operands. llvm-svn: 50180	2008-04-23 20:21:29 +00:00
Nicolas Geoffray	7000c8f1aa	Change Divided flag to Split, as suggested by Evan llvm-svn: 49715	2008-04-15 08:08:50 +00:00
Nicolas Geoffray	db0ea1ff4e	Fix /test/CodeGen/PowerPC/big-endian-actual-args.ll for linux/ppc32 llvm-svn: 49652	2008-04-14 17:17:14 +00:00
Nicolas Geoffray	dcc2eda5fc	Add a divided flag for the first piece of an argument divided into mulitple parts. Fixes PR1643 llvm-svn: 49611	2008-04-13 13:40:22 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dale Johannesen	0ce4a7cc44	Make sure both PendingLoads and PendingExports are flushed before an invoke. Failure to do this causes references in the landing pad to variables that were not set. Fixes g++.dg/eh/delayslot1.C g++.dg/eh/fp-regs.C g++.old-deja/g++.brendan/eh1.C llvm-svn: 49243	2008-04-04 23:48:31 +00:00
Dale Johannesen	fd967cf3fa	Recommitting EH patch; this should answer most of the review feedback. -enable-eh is still accepted but doesn't do anything. EH intrinsics use Dwarf EH if the target supports that, and are handled by LowerInvoke otherwise. The separation of the EH table and frame move data is, I think, logically figured out, but either one still causes full EH info to be generated (not sure how to split the metadata correctly). MachineModuleInfo::needsFrameInfo is no longer used and is removed. llvm-svn: 49064	2008-04-02 00:25:04 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Chris Lattner	0f760dfe09	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Dan Gohman	cad51cb671	Avoid creating chain dependencies from CopyToReg nodes to load and store nodes. This doesn't currently have much impact the generated code, but it does produce simpler-looking SelectionDAGs, and consequently simpler-looking ScheduleDAGs, because there are fewer spurious dependencies. In particular, CopyValueToVirtualRegister now uses the entry node as the input chain dependency for new CopyToReg nodes instead of calling getRoot and depending on the most recent memory reference. Also, rename UnorderedChains to PendingExports and pull it up from being a local variable in SelectionDAGISel::BuildSelectionDAG to being a member variable of SelectionDAGISel, so that it doesn't have to be passed around to all the places that need it. llvm-svn: 48893	2008-03-27 19:56:19 +00:00
Duncan Sands	d97eea372a	Introduce a new node for holding call argument flags. This is needed by the new legalize types infrastructure which wants to expand the 64 bit constants previously used to hold the flags on 32 bit machines. There are two functional changes: (1) in LowerArguments, if a parameter has the zext attribute set then that is marked in the flags; before it was being ignored; (2) PPC had some bogus code for handling two word arguments when using the ELF 32 ABI, which was hard to convert because of the bogusness. As suggested by the original author (Nicolas Geoffray), I've disabled it for the moment. Tested with "make check" and the Ada ACATS testsuite. llvm-svn: 48640	2008-03-21 09:14:45 +00:00
Duncan Sands	858e6385f7	Do not generate special entries in the dwarf eh table for nounwind calls. llvm-svn: 48373	2008-03-14 21:36:24 +00:00
Duncan Sands	87de65fc29	Don't try to extract an i32 from an f64. This getCopyToParts problem was noticed by the new LegalizeTypes infrastructure. In order to avoid this kind of thing in the future I've added a check that EXTRACT_ELEMENT is only used with integers. Once LegalizeTypes is up and running most likely BUILD_PAIR and EXTRACT_ELEMENT can be removed, in favour of using apints instead. llvm-svn: 48294	2008-03-12 20:30:08 +00:00
Dan Gohman	1351025a91	Initial codegen support for functions and calls with multiple return values. llvm-svn: 48244	2008-03-11 21:11:25 +00:00
Scott Michel	a6729e8666	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Dale Johannesen	4e622ec86d	Increase ISD::ParamFlags to 64 bits. Increase the ByValSize field to 32 bits, thus enabling correct handling of ByVal structs bigger than 0x1ffff. Abstract interface a bit. Fixes gcc.c-torture/execute/pr23135.c and gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing on ppc32, quietly producing wrong code on x86-32.) llvm-svn: 48122	2008-03-10 02:17:22 +00:00
Chris Lattner	4c4234b59c	remove an extraneous (and ugly) default argument, thanks Duncan. llvm-svn: 48117	2008-03-09 20:04:36 +00:00
Chris Lattner	ce5f841bb5	fp_round's produced by getCopyFromParts should always be exact, because they are produced by calls (which are known exact) and by cross block copies which are known to be produced by extends. This improves: define double @test2() { %tmp85 = call double asm sideeffect "fld0", "={st(0)}"() ret double %tmp85 } from: _test2: subl $20, %esp # InlineAsm Start fld0 # InlineAsm End fstpl 8(%esp) movsd 8(%esp), %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $20, %esp #FP_REG_KILL ret to: _test2: # InlineAsm Start fld0 # InlineAsm End #FP_REG_KILL ret by avoiding a f64 <-> f80 trip llvm-svn: 48108	2008-03-09 09:38:46 +00:00
Chris Lattner	83b3473dd8	extend fp values with FP_EXTEND not FP_ROUND. llvm-svn: 48097	2008-03-09 07:47:22 +00:00
Evan Cheng	95cf661534	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Dan Gohman	ec6be4a782	Use the new APInt-enabled form of getConstant instead of converting an APInt into a uint64_t to call getConstant. llvm-svn: 47742	2008-02-29 01:41:59 +00:00
Evan Cheng	ccc0c996a4	Refactor inline asm constraint matching code out of SDIsel into TargetLowering. llvm-svn: 47587	2008-02-26 02:33:44 +00:00
Dan Gohman	1f372edd97	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Dale Johannesen	eabc5f39af	Pass alignment on ByVal parameters, from FE, all the way through. It is now used for codegen. llvm-svn: 47484	2008-02-22 17:49:45 +00:00
Chris Lattner	3422b673d1	Make the clobber analysis a bit more smart: we only are careful about early clobbers if the clobber list contains a register not some thing like {memory}, {dirflag} etc. llvm-svn: 47457	2008-02-21 20:54:31 +00:00

1 2 3 4 5 ...

622 Commits