aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	d943455e10	Issue error on overloaded "export"ed functions. Issue #270.	2012-05-25 10:35:34 -07:00
Matt Pharr	fd03ba7586	Export reference parameters as C++ references, not pointers.	2012-05-24 07:12:48 -07:00
Matt Pharr	2c5a57e386	Fix bugs related to varying pointers to functions that return void.	2012-05-23 14:29:17 -07:00
Matt Pharr	e8858150cb	Allow redundant semicolons at global scope. (Ingo Wald)	2012-05-23 14:20:20 -07:00
Matt Pharr	333f901187	Fix build with LLVM 3.2 dev top-of-tree	2012-05-23 14:19:50 -07:00
Matt Pharr	7dd4d6c75e	Update for LLVM 3.2dev API change	2012-05-22 15:53:14 -07:00
Matt Pharr	99f57cfda6	Issue more sensible error message for varying pointers in exported functions.	2012-05-18 12:00:11 -07:00
Matt Pharr	4d1eb94dfd	Fix bug in AddElementOffset() error checking.	2012-05-18 11:57:05 -07:00
Matt Pharr	22d584f302	Don't issue perf. warnings for various conversions with generic target.	2012-05-18 11:56:11 -07:00
Matt Pharr	72c41f104e	Fix various malformed program crashes.	2012-05-18 10:44:45 -07:00
Matt Pharr	8d3ac3ac1e	Fix build with LLVM ToT	2012-05-18 10:09:09 -07:00
Matt Pharr	299ae186f1	Expect support for half and transcendentals from all generic targets	2012-05-18 06:13:45 -07:00
Matt Pharr	f4df2fb176	Improvements to mask update code for generic targets. Rather than XOR'ing with a temporary 'all-on' vector, we call __not. Also, we call out to __and_not1 and __and_not2, for an AND where the first or second operand, respectively, has had NOT applied to it.	2012-05-16 13:52:51 -07:00
Matt Pharr	625fbef613	Fix Windows build	2012-05-15 12:19:10 -07:00
Matt Pharr	fbed0ac56b	Remove allOffMaskIsSafe from Target The intent of this was to indicate whether it was safe to run code with an 'all of' mask on the given target (and then sometimes be more flexible about e.g. running both true and false blocks of if statements, etc.) The problem is that even if the architecture has full native mask support, it's still not safe to run 'uniform' memory operations with the mask all off. Even more tricky, we sometimes transform masked varying memory operations to uniform ones during optimization (e.g. gather->load and broadcast). This fixes a number of the tests/switch-* tests that were failing on the generic targets due to this issue.	2012-05-09 14:18:47 -07:00
Matt Pharr	dc120f3962	Fix regression in masked_store_blend for generic target. In `ee1fe3aa9f`, the LLVM_VERSION define was updated to never have the 'svn' suffix and the build was updated to handle LLVM 3.2. This file had a check for LLVM_3_1svn that was no longer hitting. This fixes some issues with unnecessary loads and stores in generated C++ code for the generic targets.	2012-05-09 14:18:47 -07:00
Matt Pharr	4f053e5b83	Pass OPT flags when linking	2012-05-08 13:25:09 -07:00
Matt Pharr	c6241581a0	Add an extra parameter to __smear functions to encode return type. Now, the __smear* functions in generated C++ code have an unused first parameter of the desired return type; this allows us to have headers that include variants of __smear for multiple target widths. (This approach is necessary since we can't overload by return type in C++.) Issue #256.	2012-05-08 09:54:23 -07:00
Nipunn Koorapati	041ade66d5	Placated compiler by initializing variable	2012-05-06 06:59:17 -07:00
Nipunn Koorapati	067a2949ba	Added syntax highlighting for 'uniform' and 'varying' types.	2012-05-06 06:58:53 -07:00
Matt Pharr	55c754750e	Remove a number of redundant/unneeded optimization passes. Performance and code quality of performance suite is unchanged, compilation times are improved by another 20% or so for simple programs (e.g. rt.ispc). One very complex programs compiles about 2.4x faster now.	2012-05-05 15:47:24 -07:00
Matt Pharr	72b6c12856	Notify LLVM pass mgr that the MakeInternalFuncsStaticPass doesn't change the CFG.	2012-05-05 15:47:24 -07:00
Matt Pharr	15ea0af687	Add -f option to run_tests.py This allows providing additional command-line arguments to ispc, e.g. to force compilation with -O1, -g, etc.	2012-05-05 15:47:24 -07:00
Matt Pharr	ee7e367981	Do global dead code elimination early in optimization. This gives a 15-20% speedup in compilation time for simple programs (but only ~2% for the big 21k monster program).	2012-05-05 15:47:19 -07:00
Matt Pharr	8006589828	Use llvm::SmallVectors for struct member types and function types. Further reduction of dynamic memory allocation...	2012-05-04 13:55:38 -07:00
Matt Pharr	413264eaae	Make return values const &s to save copying.	2012-05-04 13:55:38 -07:00
Matt Pharr	7db8824da2	Reduce dynamic memory allocation in getting unif/varying variants of AtomicTypes	2012-05-04 13:55:38 -07:00
Matt Pharr	e1bc010bd1	More reduction of dynamic allocations in lDoTypeConv()	2012-05-04 13:55:38 -07:00
Matt Pharr	bff02017da	Cache const/non-const variants of Atomic and ReferenceTypes. More reduction of dynamic memory allocation.	2012-05-04 13:55:38 -07:00
Matt Pharr	c0019bd8e5	Cache type and lvalue type in IndexExpr and MemberExpr This saves a bunch of redundant work and unnecessary duplicated memory allocations.	2012-05-04 13:55:38 -07:00
Matt Pharr	e495ef2c48	Reduce dynamic memory allocation by reusing scope maps in symbol table.	2012-05-04 13:55:38 -07:00
Matt Pharr	78d62705cc	Cache element types in StructType. Previously, GetElementType() would end up causing dynamic allocation to happen to compute the final element type (turning types with unbound variability into the same type with the struct's variability) each it was called, which was wasteful and slow. Now we cache the result. Another 20% perf on compiling that problematic program.	2012-05-04 13:55:38 -07:00
Matt Pharr	2791bd0015	Improve performance of lCheckTypeEquality() We don't need to explicitly create the non-const Types to do type comparison when ignoring const-ness in the check. We can also save some unnecessary dynamic memory allocation by keeping strings returned from GetStructName() as references to strings. This gives another 10% on front-end perf on that big program.	2012-05-04 13:55:38 -07:00
Matt Pharr	7cf66eb61f	Small optimizations to various AtomicType methods.	2012-05-04 13:55:38 -07:00
Matt Pharr	944c53bff1	Stop using dynamic_cast for Types. We now have a set of template functions CastType<AtomicType>, etc., that in turn use a new typeId field in each Type instance, allowing them to be inlined and to be quite efficient. This improves front-end performance for a particular large program by 28%.	2012-05-04 13:55:38 -07:00
Matt Pharr	c756c855ea	Compile with -O2 by default on Linux/OSX.	2012-05-04 13:55:37 -07:00
Matt Pharr	58bb2826b2	Perf: cache connection between const/non-const struct variants. In one very large program, we were spending quite a bit of time repeatedly getting const variants of StructTypes. This speeds up the front-end by about 40% for that test case. (This is something of a band-aid, pending uniquing types.)	2012-05-04 13:55:37 -07:00
Nipunn Koorapati	b7bef87a4d	Added README for vim syntax highlighting.	2012-05-03 14:23:33 -07:00
Matt Pharr	0c1b206185	Pass log/exp/pow transcendentals through to targets that support them. Currently, this is the generic targets.	2012-05-03 13:49:56 -07:00
Matt Pharr	7d7e99a92c	Update ISPC_MINOR_VERSION to 2 (This should have been done with the 1.2.0 release!)	2012-05-03 12:04:24 -07:00
Matt Pharr	1ba8d7ef74	Fix test that had undefined behavior.	2012-05-03 11:11:21 -07:00
Matt Pharr	d99bd279e8	Add generic-32 target.	2012-05-03 11:11:06 -07:00
Matt Pharr	ee1fe3aa9f	Update build to handle existence of LLVM 3.2 dev branch. We now compile with LLVM 3.0, 3.1, and 3.2svn.	2012-05-03 08:25:25 -07:00
Matt Pharr	c4b1d79c5c	When a function is defined, set its symbol's position to the code position. Before, if the function was declared before being defined, then the symbol's SourcePos would be left set to the position of the declaration. This ended up getting the debugging symbols mixed up in this case, which was undesirable.	2012-04-28 20:28:39 -07:00
Matt Pharr	a1a43cdfe0	Fix bug so that programIndex (et al.) are available in the debugger. It's now possible to successfully print out the value of programIndex, programCount, etc., in the debugger. The issue was that they were defined as having InternalLinkage, which meant that DCE removed them at the end of compilation. Now they're declared to have WeakODRLinkage, which ensures that one copy survives (but there aren't multiply-defined symbols when compiling multiple files.)	2012-04-28 17:12:57 -07:00
Matt Pharr	27b62781cc	Fix bug in lStripUnusedDebugInfo(). This was causing an assert to hit in llvm's DwarfDebug.cpp.	2012-04-28 13:06:29 -10:00
Matt Pharr	0c5d7ff8f2	Add rygorous's float->srgb8 conversion routine to the stdlib. Issue #230	2012-04-27 10:03:19 -10:00
Matt Pharr	0e2b315ded	Add FAQ about foreach code generation. (i.e. "why's there that extra stuff at the end and what can I do about it if it's not necessary?) Issue #231.	2012-04-27 09:35:37 -10:00
Matt Pharr	3e74d1c544	Fix documentation bug with typedef.	2012-04-25 17:15:20 -10:00
Matt Pharr	da690acce5	Fix build with LLVM 3.0	2012-04-25 14:27:33 -10:00

1 2 3 4 5 ...

844 Commits