aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	f81acbfe80	Implement unbound varibility for struct types. Now, if a struct member has an explicit 'uniform' or 'varying' qualifier, then that member has that variability, regardless of the variability of the struct's variability. Members without 'uniform' or 'varying' have unbound variability, and in turn inherit the variability of the struct. As a result of this, now structs can properly be 'varying' by default, just like all the other types, while still having sensible semantics.	2012-02-21 10:28:31 -08:00
Matt Pharr	6d7ff7eba2	Update defaults for variability of pointed-to types. Now, if rate qualifiers aren't used to specify otherwise, varying pointers point to uniform types by default. As before, uniform pointers point to varying types by default. float foo; // varying pointer to uniform float float uniform foo; // uniform pointer to varying float These defaults seem to require the least amount of explicit uniform/varying qualifiers for most common cases, though TBD if it would be easier to have a single rule that e.g. the pointed-to type is always uniform by default.	2012-02-21 06:27:34 -08:00
Matt Pharr	73bf552cd6	Add support for coalescing memory accesses from gathers. There are two related optimizations that happen now. (These currently only apply for gathers where the mask is known to be all on, and to gathers that are accessing 32-bit sized elements, but both of these may be generalized in the future.) First, for any single gather, we are now more flexible in mapping it to individual memory operations. Previously, we would only either map it to a general gather (one scalar load per SIMD lane), or an unaligned vector load (if the program instances could be determined to be accessing a sequential set of locations in memory.) Now, we are able to break gathers into scalar, 2-wide (i.e. 64-bit), 4-wide, or 8-wide loads. Further, we now generate code that shuffles these loads around. Doing fewer, larger loads in this manner, when possible, can be more efficient. Second, we can coalesce memory accesses across multiple gathers. If we have a series of gathers without any memory writes in the middle, then we try to analyze their reads collectively and choose an efficient set of loads for them. Not only does this help if different gathers reuse values from the same location in memory, but it's specifically helpful when data with AOS layout is being accessed; in this case, we're often able to generate wide vector loads and appropriate shuffles automatically.	2012-02-10 13:10:39 -08:00
Matt Pharr	83c8650b36	Add support for "local" atomics. Also updated aobench example to use them, which in turn allows using foreach() and thence a much cleaner implementation. Issue #58.	2012-02-03 13:15:21 -08:00
Matt Pharr	89cb809922	Short-circuit evaluation of ? : operator for varying tests. ? : now short-circuits evaluation of the expressions following the boolean test for varying test types. (It already did this for uniform tests). Issue #169.	2012-02-01 11:03:58 -08:00
Matt Pharr	fdb4eaf437	Fix bug in &&/\|\| short-circuiting. Use full mask, not internal mask when checking "any lanes running" before evaluating expressions. Added some more tests to try to cover this case.	2012-02-01 08:17:25 -08:00
Matt Pharr	8d1631b714	Constant fold in SelectExpr::Optimize(). Resolves issue #170.	2012-01-31 12:22:11 -08:00
Matt Pharr	dac091552d	Fix errors in tests for scalar target. Issue #167.	2012-01-31 11:57:12 -08:00
Matt Pharr	e19f4931d1	Short-circuit evaluation of && and \|\| operators. We now follow C's approach of evaluating these: we don't evaluate the second expression in the operator if the value of the first one determines the overall result. Thus, these can now be used idiomatically like (index < limit && array[index] > 0) and such. For varying expressions, the mask is set appropriately when evaluating the second expression. (For expressions that can be determined to be both simple and safe to evaluate with the mask all off, we still evaluate both sides and compute the logical op result directly, which saves a number of branches and tests. However, the effect of this should never be visible to the programmer.) Issue #4.	2012-01-30 05:58:41 -08:00
Matt Pharr	0575b1f38d	Update run_tests and examples makefile for scalar target. Fixed a number of tests that didn't handle the programCount == 1 case correctly.	2012-01-29 16:22:25 -08:00
Matt Pharr	664dc3bdda	Add support for "new" and "delete" to the language. Issue #139.	2012-01-27 14:47:06 -08:00
Matt Pharr	1867b5b317	Use native float/half conversion instructions with the AVX2 target.	2012-01-24 15:33:38 -08:00
Matt Pharr	1bba9d4307	Improve atomic_swap_global() to take advantage of associativity. We now do a single atomic hardware swap and then effectively do swaps between the running program instances such that the result is the same as if they had happened to run a particular ordering of hardware swaps themselves. Also cleaned up __atomic_swap_uniform_* built-in implementations to not take the mask, which they weren't using anyway. Finishes Issue #56.	2012-01-20 10:37:33 -08:00
Matt Pharr	748b292e77	Improve code for uniform switches with a 'break' under varying control flow. Previously, when we had a switch statement with a uniform switch condition but a 'break' statement that was under varying control flow inside the switch, we'd promote the switch condition to be varying so that the break would work correctly. Now, we leave the condition as uniform and are thus able to use the more-efficient LLVM switch instruction in this case. Issue #156.	2012-01-19 08:41:19 -07:00
Matt Pharr	0740299860	Fix switch test	2012-01-12 09:45:31 -08:00
Matt Pharr	b67446d998	Add support for "switch" statements. Switches with both uniform and varying "switch" expressions are supported. Switch statements with varying expressions and very large numbers of labels may not perform well; some issues to be filed shortly will track opportunities for improving these.	2012-01-11 09:16:31 -08:00
Matt Pharr	8da9be1a09	Add support for 'k', 'M', and 'G' suffixes to integer constants. (Denoting units of 1024, 10241024, and 10241024*1024, respectively.) Issue #128.	2012-01-06 14:47:47 -08:00
Matt Pharr	11033e108e	Fix bug that prohibited assignments with pointer expressions on the LHS Previously, code like "*(ptr+1) = foo" would claim that the LHS was invalid for an assignment expression. Issue #138.	2012-01-06 14:21:03 -08:00
Matt Pharr	78c6d3c02f	Add initial support for 'goto' statements. ispc now supports goto, but only under uniform control flow--i.e. it must be possible for the compiler to statically determine that all program instances will follow the goto. An error is issued at compile time if a goto is used when this is not the case.	2012-01-05 12:22:36 -08:00
Matt Pharr	5d35349dc9	We were (unintentionally) only using structural equivalence to compare struct types. Now we require that the struct name match for two struct types to be the same. Added a test to check this. (Also removed a stale test, movmsk-opt.ispc)	2012-01-04 11:44:00 -08:00
Matt Pharr	6f26ae9801	Fix bugs with offsetting for varying values with gathers/scatters. Fixes issue #134.	2011-12-12 14:13:46 -08:00
Matt Pharr	198aa9620e	Fix bug with mask used for gather/scatter code generation. We should always use the full mask for this, never the internal mask. Added tests for this.	2011-12-06 15:51:56 -08:00
Matt Pharr	bd70182369	Add some additional tests	2011-12-06 14:26:52 -08:00
Matt Pharr	48a6c2a35b	Fix test for 16-wide case	2011-12-05 11:45:06 -08:00
Matt Pharr	0388f46a3b	Remove test that was failing (now recorded as issue #130 ).	2011-12-05 09:39:50 -08:00
Matt Pharr	186d0223d2	Fix AoS/SoA stdlib functions to match documentation (i.e. actually remove the old offset parameter stuff now that we can actually pass pointers.)	2011-12-03 22:44:16 -08:00
Matt Pharr	d492ba08e6	Fix bugs that broke typedefs in function definitions. Issue #118.	2011-12-03 15:35:44 -08:00
Matt Pharr	a1c0b4f95a	Allow 'continue' statements in 'foreach' loops.	2011-12-03 09:31:02 -08:00
Matt Pharr	8bc7367109	Add foreach and foreach_tiled looping constructs These make it easier to iterate over arbitrary amounts of data elements; specifically, they automatically handle the "ragged extra bits" that come up when the number of elements to be processed isn't evenly divided by programCount. TODO: documentation	2011-11-30 13:17:31 -08:00
Matt Pharr	7a2561c429	Add count_{leading,trailing}_zeros() functions to stdlib. (Documentation is still yet to be written.)	2011-11-30 10:12:16 -08:00
Matt Pharr	1703f2717c	Add some new tests One tricky pointer one currently hits an assertion (fix forthcoming).	2011-11-30 09:43:25 -08:00
Matt Pharr	a3641d7691	Convert arrays to pointers in expressions like (a+5) This was one instance of the C-style array/pointer duality that was missed the first time around.	2011-11-29 17:41:00 -08:00
Matt Pharr	11547cb950	stdlib updates to take advantage of pointers The packed_{load,store}_active now functions take a pointer to a location at which to start loading/storing, rather than an array base and a uniform index. Variants of the prefetch functions that take varying pointers are now available. There are now variants of the various atomic functions that take varying pointers (issue #112).	2011-11-29 15:41:38 -08:00
Matt Pharr	e52104ff55	Pointer fixes/improvements. Allow <, <=, >, >= comparisons of pointers Allow explicit type-casting of pointers to and from integers Fix bug in handling expressions of the form "int + ptr" ("ptr + int" was fine). Fix a bug in TypeCastExpr where varying -> uniform typecasts would be allowed (leading to a crash later)	2011-11-29 13:22:36 -08:00
Matt Pharr	867efc2bce	Multiple small fixes for better C conformance. Allow atomic types to be initialized with single-element expression lists: int x = { 5 }; Issue an error if a storage class is provided with a function parameter. Issue an error if two members of a struct have the same name. Issue an error on trying to assign to a struct with a const member, even if the struct itself isn't const. Issue an error if a function is redefined. Issue an error if a function overload is declared that differs only in return type from a previously-declared function. Issue an error if "inline" or "task" qualifiers are used outside of function declarations. Allow trailing ',' at the end of enumerator lists. Multiple tests for all of the above.	2011-11-27 13:09:59 -08:00
Matt Pharr	975db80ef6	Add support for pointers to the language. Pointers can be either uniform or varying, and behave correspondingly. e.g.: "uniform float * varying" is a varying pointer to uniform float data in memory, and "float * uniform" is a uniform pointer to varying data in memory. Like other types, pointers are varying by default. Pointer-based expressions, & and *, sizeof, ->, pointer arithmetic, and the array/pointer duality all bahave as in C. Array arguments to functions are converted to pointers, also like C. There is a built-in NULL for a null pointer value; conversion from compile-time constant 0 values to NULL still needs to be implemented. Other changes: - Syntax for references has been updated to be C++ style; a useful warning is now issued if the "reference" keyword is used. - It is now illegal to pass a varying lvalue as a reference parameter to a function; references are essentially uniform pointers. This case had previously been handled via special case call by value return code. That path has been removed, now that varying pointers are available to handle this use case (and much more). - Some stdlib routines have been updated to take pointers as arguments where appropriate (e.g. prefetch and the atomics). A number of others still need attention. - All of the examples have been updated - Many new tests TODO: documentation	2011-11-27 13:09:59 -08:00
Matt Pharr	d3e6879223	Improve error checking for unsized arrays. Added support for resolving dimensions of multi-dimensional unsized arrays from their initializer exprerssions (previously, only the first dimension would be resolved.) Added checks to make sure that no unsized array dimensions remain after doing this (except for the first dimensision of array parameters to functions.)	2011-11-21 10:41:23 -08:00
Matt Pharr	7290f7b16b	Generalize/improve parsing of pointer declarations. Substantial improvements and generalizations to the parsing and declaration handling code to properly parse declarations involving pointers. (No change to user-visible functionality, but this lays groundwork for supporting a more general pointer model.)	2011-11-14 08:45:55 -08:00
Matt Pharr	ba9bb3338f	Add tests for function pointers.	2011-11-03 16:14:15 -07:00
Matt Pharr	43a2d510bf	Incorporate per-lane offsets for varying data in the front-end. Previously, it was only in the GatherScatterFlattenOpt optimization pass that we added the per-lane offsets when we were indexing into varying data. (Specifically, the case of float foo[]; int index; foo[index], where foo is an array of varying elements rather than uniform elements.) Now, this is done in the front-end as we're first emitting code. In addition to the basic ugliness of doing this in an optimization pass, it was also error-prone to do it there, since we no longer have access to all of the type information that's around in the front-end. No functionality or performance change.	2011-11-03 13:15:07 -07:00
Matt Pharr	422b8268a9	Add assert() statement support. Issue #106 .	2011-10-15 13:50:05 -07:00
Matt Pharr	9f2aa8d92a	Handle ConstantExpressions when computing address+offset vectors for scatter/gather. In particular, this fixes issue #81, where a global variable access was leading to ConstantExpressions showing up in this code, which it wasn't previously expecting.	2011-10-14 11:20:08 -07:00
Matt Pharr	2460fa5c83	Improve gather/scatter optimization passes to handle loops better. Specifically, now we can work through phi nodes in the IR to detect cases where an index value is actually the same across lanes or is linear across the lanes. For example, this is a loop that used to require gathers but is now turned into vector loads: for (int i = programIndex; i < 16; i += programCount) sum += a[i]; Fixes issue #107.	2011-10-13 17:01:25 -07:00
Matt Pharr	88e317f1a9	These tests now pass with LLVM ToT	2011-10-11 16:17:50 -07:00
Matt Pharr	1198520029	Improve gather->vector load optimization to detect <linear sequence>-<uniform> case. Previously, we didn't handle subtraction ops when deciphering offsets in order to try to change gathers t evictor loads.	2011-10-11 13:24:40 -07:00
Matt Pharr	ecda4561bd	Move some tests that now pass with LLVM 3.0 from failing_tests to tests/	2011-10-10 11:51:47 -07:00
Matt Pharr	3cb0115dce	Add routines to standard library to do efficient AOS/SOA conversions. Currently, we just support 3 and 4-wide variants (i.e. xyzxyz.. and xyzwxyzw..), for int32 and float types.	2011-10-10 10:56:06 -07:00
Matt Pharr	cb7976bbf6	Added updated task launch implementation that now tracks task groups. Within each function that launches tasks, we now can easily track which tasks that function launched, so that the sync at the end of the function can just sync on the tasks launched by that function (not all tasks launched by all functions.) Implementing this led to a rework of the task system API that ispc generates code to call; the example task systems in examples/tasksys.cpp have been updated to conform to this API. (The updated API is also documented in the ispc user's guide.) As part of this, "launch[n]" syntax was added to launch a number of tasks in a single launch statement, rather than requiring a loop over 'n' to launch n tasks. This commit thus fixes issue #84 (enhancement to launch multiple tasks from a single launch statement) as well as issue #105 (recursive task launches were broken).	2011-09-30 11:20:53 -07:00
Matt Pharr	aad269fdf4	Added support for 'uniform' global atomics. Issue #93.	2011-09-28 16:06:07 -07:00
Matt Pharr	6734021520	Issue warning when compile-time constant out-of-bounds array index is used. Issue #98. Also fixes two examples that had bugs of this type that this warning uncovered!	2011-09-13 14:42:20 -07:00

1 2

78 Commits