aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	afcd42028f	Add support for function pointers. Both uniform and varying function pointers are supported; when a function is called through a varying function pointer, each unique function pointer value across the running program instances is called once for the set of active program instances that want to call it.	2011-11-03 16:14:14 -07:00
Matt Pharr	d528533fba	Add FunctionEmitContext::SmearScalar() method (and use it).	2011-11-03 16:14:14 -07:00
Matt Pharr	7d6f89c8d2	Improvements to source file position tracking. Be better about tracking the full extent of expressions in the parser; this leads to more intelligible error messages when we indicate where exactly the error happened.	2011-11-03 16:14:14 -07:00
Matt Pharr	43a2d510bf	Incorporate per-lane offsets for varying data in the front-end. Previously, it was only in the GatherScatterFlattenOpt optimization pass that we added the per-lane offsets when we were indexing into varying data. (Specifically, the case of float foo[]; int index; foo[index], where foo is an array of varying elements rather than uniform elements.) Now, this is done in the front-end as we're first emitting code. In addition to the basic ugliness of doing this in an optimization pass, it was also error-prone to do it there, since we no longer have access to all of the type information that's around in the front-end. No functionality or performance change.	2011-11-03 13:15:07 -07:00
Matt Pharr	e009c0a61d	Be able to determine if two types can be converted without requiring an Expr *. The Expr::TypeConv() method has been replaced with both a CanConvertTypes() routine that indicates whether one type can be converted to another and a TypeConvertExpr() routine that provides the same functionality as Expr::TypeConv() used to.	2011-10-30 14:12:12 -07:00
Matt Pharr	d5a8538192	Move logic for resolving function call overloads. This code previously lived in FunctionCallExpr but is now part of FunctionSymbolExpr. This change doesn't change any current functionality, but lays groundwork for function pointers in the language, where we'll want to do function call overload resolution at other times besides when a function call is actually being made.	2011-10-30 14:00:11 -07:00
Matt Pharr	290032f4f5	Be more careful about using the right mask when emitting gathers. Specifically, we had been using the full mask for all gathers, rather than using the internal mask when we were loading from locally-declared arrays. Thus, given code like: uniform float x[programCount] = { .. . }; float xx = x[programIndex]; Previously we weren't generating a plain vector load to initialize xx, when this code was in a function where it wasn't known that the mask was all on, even though it should have. Now it does.	2011-10-17 20:25:52 -04:00
Matt Pharr	70047fbf5f	Disable warnings about type conversions that may lose precision. It's not clear that these are actually all that helpful. This also works around issue #89, wherein code like "int8 = 0" would give a warning about conversion from int32 to int8.	2011-10-17 06:36:42 -04:00
Matt Pharr	39ed7e14b2	Various improvements to function overload resolution. Generalize the overload resolution code to be based on estimating a cost for various overload options and picking the one with the minimal cost. Add a step that considers type conversions that are guaranteed to not lose information in function overload resolution. Print better diagnostics when we can't find an unambiguous match.	2011-10-16 20:46:56 -04:00
Matt Pharr	dce25249ce	Use the "avoid masked assignments when possible" tricks for pre/post decrement exprs. Also, call out to the subroutine that handles this logic for dealing with call-by-value-return stuff in function calls.	2011-10-13 16:46:30 -07:00
Matt Pharr	a89e26d725	Improvements to mask management code; removes a number of unnecessary blends. We now maintain a the distinction between the value of the mask passed into a function and the "internal" mask within the function that only accounts for varying control flow within the function. The full mask (the AND of the function mask and the internal mask) must be used for assignments to static and global variables, and reference function parameters. Further, it is the appropriate mask to use for making decisions about varying control flow. However, we can use the internal mask for assignments to variables declared in the current function (including the return value and non-reference parameters to the function). Doing so allows us to catch a few more cases where the internal mask is all on, even if the mask coming into the function wasn't all on, and thence use moves rather than blends for those assignments. (Which in turn can allow additional optimizations to happen.) Fixes issue #23.	2011-10-10 11:47:19 -07:00
Matt Pharr	a6fc657b40	Remove 'externGlobals' member from Module; instead find them when needed via new SymbolTable::GetMatchingVariables method.	2011-10-04 06:36:31 -07:00
Matt Pharr	cb7976bbf6	Added updated task launch implementation that now tracks task groups. Within each function that launches tasks, we now can easily track which tasks that function launched, so that the sync at the end of the function can just sync on the tasks launched by that function (not all tasks launched by all functions.) Implementing this led to a rework of the task system API that ispc generates code to call; the example task systems in examples/tasksys.cpp have been updated to conform to this API. (The updated API is also documented in the ispc user's guide.) As part of this, "launch[n]" syntax was added to launch a number of tasks in a single launch statement, rather than requiring a loop over 'n' to launch n tasks. This commit thus fixes issue #84 (enhancement to launch multiple tasks from a single launch statement) as well as issue #105 (recursive task launches were broken).	2011-09-30 11:20:53 -07:00
Matt Pharr	32a0a30cf5	Only allow exact matches for function overload resolution for builtins. The intent is that the code in stdlib.ispc that is calling out to the built-ins should match argument types exactly (using explicit casts as needed), just for maximal clarity/safety.	2011-09-28 17:20:31 -07:00
Matt Pharr	fade1cdf1d	Pretty much all conversions to varying double are slow, so don't bother warning about them.	2011-09-23 16:03:35 -07:00
Matt Pharr	4e91f3777a	Fix BinaryExpr to handle reference-typed operands. Fixes issue #101.	2011-09-23 15:19:14 -07:00
Matt Pharr	9921b8e530	Predicated 'if' statement performance improvements. Go back to running both sides of 'if' statements with masking and without branching if we can determine that the code is relatively simple (as per the simple cost model), and is safe to run even if the mask is 'all off'. This gives a bit of a performance improvement for some of the examples (most notably, the ray tracer), and is the code that one wants generated in this case anyhow.	2011-09-19 09:54:09 -07:00
Matt Pharr	3607f3e045	Remove support for building with LLVM 2.8. Fixes issue #66 . Both 2.9 and top-of-tree generate substantially better code than LLVM 2.8 did, so it's not worth fixing the 2.8 build.	2011-09-17 13:18:59 -07:00
Matt Pharr	a501ab1aa6	Fix parenthesization bugs in cost estimates. Also added the debugging print that helped find these issues. Revert inlining some functions in examples	2011-09-16 19:07:07 -07:00
Matt Pharr	ca87579f23	Add a very simple cost model to estimate runtime cost of running code. This is currently only used to decide whether it's worth doing an "are all lanes running" check at the start of functions--for small functions, it's not worth the overhead. The cost is estimated relatively early in compilation (e.g. before we know if an array access is a scatter/gather or not, before constant folding, etc.), so there are many known shortcomings.	2011-09-16 15:09:17 -07:00
Matt Pharr	6734021520	Issue warning when compile-time constant out-of-bounds array index is used. Issue #98. Also fixes two examples that had bugs of this type that this warning uncovered!	2011-09-13 14:42:20 -07:00
Matt Pharr	9ca7541d52	Remove check for any program instances running before function calls. Given the change in `0c20483853`, this is no longer necessary, since we know that one instance will always be running if we're executing a given block of code.	2011-09-13 06:26:16 -07:00
Matt Pharr	67e00b97c6	Fix incorrect assertions in ConstExpr constructors	2011-08-30 11:08:53 -07:00
Matt Pharr	d0db46aac5	Use logical shift right op for shifts of unsigned ints. Fixes issue #88 .	2011-08-29 10:32:26 -07:00
Matt Pharr	c74116aa24	Fix crasher with malformed program	2011-08-12 07:47:17 +01:00
Matt Pharr	d0674b1706	When doing << or >> operators, don't convert the return type to the type of the shift amount. Fixes issue #73. Previously, if we had e.g. an int16 type that was being shifted left by 1, then the constant integer 1 would come in as an int32, we'd convert the int16 to an int32, and then we'd do the shift. Now, for shifts, the type of the expression is always the same as the type of the value being shifted.	2011-07-25 23:36:05 +01:00
Pete Couperus	59036cdf5b	Add support for multi-element vector swizzles. Issue #17 . This commit adds support for swizzles like "foo.zy" (if "foo" is, for example, a float<3> type) as rvalues. (Still need support for swizzles as lvalues.)	2011-07-22 13:10:14 +01:00
Matt Pharr	165f90357f	Tiny cleanups, doc update re int8/16 performance	2011-07-21 16:04:16 +01:00
Matt Pharr	bba7211654	Add support for int8/int16 types. Addresses issues #9 and #42 .	2011-07-21 06:57:40 +01:00
Matt Pharr	654cfb4b4b	Many fixes for recent LLVM dev tree API changes	2011-07-18 15:54:39 +01:00
Matt Pharr	f0f876c3ec	Add support for enums.	2011-07-17 16:43:05 +02:00
Matt Pharr	a535aa586b	Fix issue #2 : use zero extend to convert bool->int, not sign extend. This way, we match C/C++ in that casting a bool to an int gives either the value zero or the value one. There is a new stdlib function int sign_extend(bool) that does sign extension for cases where that's desired.	2011-07-12 13:30:05 +01:00
Matt Pharr	6b5ee6ccc0	Add missing "$$=NULL;" in error production in parser. This fixes a crash from a malformed program bug.	2011-07-07 11:10:27 +01:00
Matt Pharr	af70718eca	Always include the passed arg types when printing errors about function overload resolution failing.	2011-07-06 15:33:50 +01:00
Matt Pharr	c6bc8fd64f	Type conversion and function call overload resolution bug fixes.	2011-07-04 15:11:28 +01:00
Matt Pharr	28a68e3c1f	More code simplifications from using CollectionType. Finishes Issue #37	2011-06-29 09:32:31 +01:00
Matt Pharr	6b153566f3	Simplify a bunch of code by using CollectionType to collect struct codepaths in with array/vector codepaths. (Issue #37).	2011-06-29 07:59:43 +01:00
Matt Pharr	214fb3197a	Initial plumbing to add CollectionType base-class as common ancestor to StructTypes, ArrayTypes, and VectorTypes. Issue #37.	2011-06-29 07:42:09 +01:00
Andreas Wendleder	f39d31174e	Follow LLVM API change.	2011-06-23 16:10:03 -07:00
Matt Pharr	2ced56736e	small comment changes, remove dead code	2011-06-22 14:38:49 -07:00
Matt Pharr	18af5226ba	Initial commit.	2011-06-21 12:48:50 -07:00

1 2 3

141 Commits