aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	ed13dd066b	Distinguish between 'regular' foreach and foreach_unique in FunctionEmitContext We need to do this since it's illegal to have nested foreach statements, but nested foreach_unique, or foreach_unique inside foreach, etc., are all fine.	2012-06-22 06:04:00 -07:00
Matt Pharr	2b4a3b22bf	Issue an error if the user has nested foreach statements. Partially addresses issue #280. (We should support them properly, but at least now we don't silently generate incorrect code.)	2012-06-21 16:53:27 -07:00
Matt Pharr	007a734595	Add support for 'unmasked' function qualifier.	2012-06-20 15:36:00 -07:00
Matt Pharr	89a2566e01	Add separate variants of memory built-ins for floats and doubles. Previously, we'd bitcast e.g. a vector of floats to a vector of i32s and then use the i32 variant of masked_load/masked_store/gather/scatter. Now, we have separate float/double variants of each of those.	2012-06-07 14:47:16 -07:00
Matt Pharr	1ac3e03171	Gather/scatter function improvements in builtins. More naming consistency: _i32 rather than i32, now. Also improved the m4 macros to generate these sequences to not require as many parameters.	2012-06-07 14:19:23 -07:00
Matt Pharr	b86d40091a	Improve naming of masked load/store instructions in builtins. Now, use _i32 suffixes, rather than _32, etc. Also cleaned up the m4 macro to generate these functions, using WIDTH to get the target width, etc.	2012-06-07 13:58:31 -07:00
Matt Pharr	7b6bd90903	Remove various equality checks between GetInternalMask() and LLVMMaskAllOn These were never kicking in, since GetInternalMask() always loads from the mask storage memory.	2012-06-06 11:08:42 -07:00
Matt Pharr	6118643232	Handle more error cases if the user tries to declare a method.	2012-06-04 09:07:13 -07:00
Matt Pharr	90db01d038	Represent MOVMSK'ed masks with int64s rather than int32s. This allows us to scale up to 64-wide execution.	2012-05-25 11:57:23 -07:00
Matt Pharr	64807dfb3b	Add AssertPos() macro that provides rough source location in error It can sometimes be useful to know the general place we were in the program when an assertion hit; when the position is available / applicable, this macro is now used. Issue #268.	2012-05-25 10:59:45 -07:00
Matt Pharr	2c5a57e386	Fix bugs related to varying pointers to functions that return void.	2012-05-23 14:29:17 -07:00
Matt Pharr	4d1eb94dfd	Fix bug in AddElementOffset() error checking.	2012-05-18 11:57:05 -07:00
Matt Pharr	72c41f104e	Fix various malformed program crashes.	2012-05-18 10:44:45 -07:00
Matt Pharr	944c53bff1	Stop using dynamic_cast for Types. We now have a set of template functions CastType<AtomicType>, etc., that in turn use a new typeId field in each Type instance, allowing them to be inlined and to be quite efficient. This improves front-end performance for a particular large program by 28%.	2012-05-04 13:55:38 -07:00
Matt Pharr	ee1fe3aa9f	Update build to handle existence of LLVM 3.2 dev branch. We now compile with LLVM 3.0, 3.1, and 3.2svn.	2012-05-03 08:25:25 -07:00
Matt Pharr	a1a43cdfe0	Fix bug so that programIndex (et al.) are available in the debugger. It's now possible to successfully print out the value of programIndex, programCount, etc., in the debugger. The issue was that they were defined as having InternalLinkage, which meant that DCE removed them at the end of compilation. Now they're declared to have WeakODRLinkage, which ensures that one copy survives (but there aren't multiply-defined symbols when compiling multiple files.)	2012-04-28 17:12:57 -07:00
Matt Pharr	da690acce5	Fix build with LLVM 3.0	2012-04-25 14:27:33 -10:00
Matt Pharr	0baa2b484d	Fix multiple bugs related to DIBuilder::createFunction() call. The DIType passed to this method should correspond to the FunctionType of the function, not its return type. The first parameter should be the DIScope for the compile unit, not the DIFile. We previously had the unmangled function name and the mangled function name interchanged. The argument corresponding to "first line number of the function" was missing, which in turn led to subsequent arguments being off, and thus providing bogus values vs. what was supposed to be passed. Rename FunctionEmitContext::diFunction to diSubprogram, to better reflect its type.	2012-04-25 08:43:11 -10:00
Matt Pharr	d5cc2ad643	Call Verify() methods of various debugging llvm::DI* types after creation.	2012-04-25 08:43:11 -10:00
Matt Pharr	7167442d6e	Debugging info: include parameter number for function params.	2012-04-25 08:43:11 -10:00
Nipunn Koorapati	040421942f	Goto statements with a bad label produces error message. Now it also produces a short list of suggestions based on string distance.	2012-04-20 14:42:14 -04:00
Matt Pharr	32815e628d	Improve naming of llvm Instructions created. We now try harder to keep the names of instructions related to the initial names of variables they're derived from and so forth. This is useful for making both LLVM IR as well as generated C++ code easier to correlate back to the original ispc source code. Issue #244.	2012-04-19 16:36:46 -07:00
Matt Pharr	99a27fe241	Add support for forward declarations of structures. Now a declaration like 'struct Foo;' can be used to establish the name of a struct type, without providing a definition. One can pass pointers to such types around the system, but can't do much else with them (as in C/C++). Issue #125.	2012-04-16 06:27:21 -07:00
Matt Pharr	fefa86e0cf	Remove LLVM_TYPE_CONST #define / usage. Now with LLVM 3.0 and beyond, types aren't const.	2012-04-15 20:11:27 -07:00
Matt Pharr	098c4910de	Remove support for building with LLVM 2.9. A forthcoming change uses some features of LLVM 3.0's new type system, and it's not worth back-porting this to also all work with LLVM 2.9.	2012-04-15 20:08:51 -07:00
Matt Pharr	7e954e4248	Don't issue gather/scatter warnigns in the 'extra' bits of foreach loops. With AOS data, we can often coalesce the accesses into gathers for the main part of foreach loops but only fail on the last bits where the mask is not all on (since the coalescing code doesn't handle mixed masks, yet.) Before, we'd report success with coalescing and then also report that gathers were needed for the same accesses that were coalesced, which was a) confusing, and b) didn't accurately represent what was going on for the majority of the loop iterations.	2012-03-19 15:08:35 -07:00
Matt Pharr	436c53037e	Fix assertion in FunctionEmitContext::storeUniformToSOA()	2012-03-19 11:29:14 -07:00
Matt Pharr	db5db5aefd	Add native support for (AO)SOA data layout. There's now a SOA variability class (in addition to uniform, varying, and unbound variability); the SOA factor must be a positive power of 2. When applied to a type, the leaf elements of the type (i.e. atomic types, pointer types, and enum types) are widened out into arrays of the given SOA factor. For example, given struct Point { float x, y, z; }; Then "soa<8> Point" has a memory layout of "float x[8], y[8], z[8]". Furthermore, array indexing syntax has been augmented so that when indexing into arrays of SOA-variability data, the two-stage indexing (first into the array of soa<> elements and then into the leaf arrays of SOA data) is performed automatically.	2012-03-05 09:58:10 -08:00
Matt Pharr	3082ea4765	Require Type::Equal() for all type equality comparisons. Previously, we uniqued AtomicTypes, so that they could be compared by pointer equality, but with forthcoming SOA variability changes, this would become too unwieldy (lacking a more general / ubiquitous type uniquing implementation.)	2012-03-05 09:58:09 -08:00
Matt Pharr	f81acbfe80	Implement unbound varibility for struct types. Now, if a struct member has an explicit 'uniform' or 'varying' qualifier, then that member has that variability, regardless of the variability of the struct's variability. Members without 'uniform' or 'varying' have unbound variability, and in turn inherit the variability of the struct. As a result of this, now structs can properly be 'varying' by default, just like all the other types, while still having sensible semantics.	2012-02-21 10:28:31 -08:00
Matt Pharr	c63d139482	Add FunctionEmitContext::MemcpyInst()	2012-02-14 13:43:59 -08:00
Matt Pharr	420d373d89	Move assert so that an error is issued for "break" outside of loops.	2012-02-06 15:35:43 -08:00
Matt Pharr	0432f97555	Fix build with LLVM 3.1 TOT	2012-01-31 14:10:07 -08:00
Matt Pharr	664dc3bdda	Add support for "new" and "delete" to the language. Issue #139.	2012-01-27 14:47:06 -08:00
Matt Pharr	748b292e77	Improve code for uniform switches with a 'break' under varying control flow. Previously, when we had a switch statement with a uniform switch condition but a 'break' statement that was under varying control flow inside the switch, we'd promote the switch condition to be varying so that the break would work correctly. Now, we leave the condition as uniform and are thus able to use the more-efficient LLVM switch instruction in this case. Issue #156.	2012-01-19 08:41:19 -07:00
Matt Pharr	7045b76f84	Improvements to code generation for "foreach" Specialize the code for the innermost loop to not do any masking computations for the innermost dimension for the iterations where we are certainly working on a full vector's worth of data. This fix improves performance/code quality of "foreach" such that it's essentially the same as the equivalent "for" loop. Fixes issue #151.	2012-01-17 11:34:00 -08:00
Matt Pharr	b67446d998	Add support for "switch" statements. Switches with both uniform and varying "switch" expressions are supported. Switch statements with varying expressions and very large numbers of labels may not perform well; some issues to be filed shortly will track opportunities for improving these.	2012-01-11 09:16:31 -08:00
Matt Pharr	78c6d3c02f	Add initial support for 'goto' statements. ispc now supports goto, but only under uniform control flow--i.e. it must be possible for the compiler to statically determine that all program instances will follow the goto. An error is issued at compile time if a goto is used when this is not the case.	2012-01-05 12:22:36 -08:00
Matt Pharr	4151778f5e	Modify SizeOf() and StructOffset() to not compute value based on target for generic targets. Specifically, we want to be able to late-bind on whether the mask is i32s or i1s, so if there's any chance of ambiguity, we emit code that does the "GEP from a NULL base pointer" trick to compute the value later in compilation.	2012-01-04 12:59:03 -08:00
Matt Pharr	848a432640	Fix various small things that were broken with single-bit-per-lane masks. Also small cleanups to declarations, "no captures" added, etc.	2012-01-04 12:59:03 -08:00
Matt Pharr	1d9201fe3d	Add "generic" 4, 8, and 16-wide targets. When used, these targets end up with calls to undefined functions for all of the various special vector stuff ispc needs to compile ispc programs (masked store, gather, min/max, sqrt, etc.). These targets are not yet useful for anything, but are a step toward having an option to C++ code with calls out to intrinsics. Reorganized the directory structure a bit and put the LLVM bitcode used to define target-specific stuff (as well as some generic built-ins stuff) into a builtins/ directory. Note that for building on Windows, it's now necessary to set a LLVM_VERSION environment variable (with values like LLVM_2_9, LLVM_3_0, LLVM_3_1svn, etc.)	2011-12-19 13:46:50 -08:00
Matt Pharr	8d1b77b235	Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so. Switch to Assert() from assert() to make it clear it's not the C stdlib one we're using any more.	2011-12-15 11:11:16 -08:00
Matt Pharr	10ebe88abf	Directly emit code for the mask checks at the start of complex functions. Previously, we used an IfStmt to wrap complex functions with the equivalent of a "cif" to check to see if the mask was all on, all off, or mixed at the start of executing non-trivial functions. This had the unintended side effect of suggesting to other parts of the compiler that the entire function was under varying control flow (which in turn led to some small code quality issues.) Now, we emit the equivalent code directly.	2011-12-15 06:00:41 -08:00
Matt Pharr	9920b30318	Fix bug that led to incorrect code with return statements. The conceptual error was the assumption that not being under varying control flow implied that the mask was all on; this is not the case if some of the instances have executed a return earlier in the function's execution. The error in practice would be that the mask would be assumed to be all-on for things like memory writes, so there would be unintended side-effects for the instances that had returned.	2011-12-15 06:00:31 -08:00
Matt Pharr	6f26ae9801	Fix bugs with offsetting for varying values with gathers/scatters. Fixes issue #134.	2011-12-12 14:13:46 -08:00
Matt Pharr	5b48354d9a	Fix crashes from malformed programs.	2011-12-12 13:47:46 -08:00
Matt Pharr	46bfef3fce	Add option to turn off codegen improvements when mask 'all on' is statically known.	2011-12-11 16:16:36 -08:00
Matt Pharr	f6605ee465	Small cleanup: allocate storage for the full mask in the FunctionEmitContext constructor	2011-12-10 13:33:28 -08:00
Matt Pharr	198aa9620e	Fix bug with mask used for gather/scatter code generation. We should always use the full mask for this, never the internal mask. Added tests for this.	2011-12-06 15:51:56 -08:00
Matt Pharr	f95504fb5e	Symbol table now properly handles scopes for function declarations. Previously, they all went into one big pile that was never cleaned up; this was the wrong thing to do in a world where one might have a function declaration inside another functions, say.	2011-12-04 17:37:13 -08:00

1 2 3 4

186 Commits