aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	56ec939692	Add perfbench to examples.sln for Windows	2012-02-14 10:07:08 -08:00
Matt Pharr	a86b942730	Fix cases in coalesce opt where offsets would be truncated to 32 bits	2012-02-14 10:05:07 -08:00
Matt Pharr	52eb4c6014	Fix warnings with Windows build	2012-02-14 10:01:45 -08:00
Matt Pharr	f4adbbf90c	Merge a number of cbackend changes from the LLVM dev tree. This fixes a number of failing tests with LLVM 3.1svn when using the generic targets. Issue #175.	2012-02-13 16:52:38 -08:00
Matt Pharr	cc86e4a7d2	Disable coalescing optimizations when using generic target. The main issue is that they end up generating a number of smaller vector ops (e.g. 4-wide and 8-wide on the 16-wide generic target, which the examples/intrinsics implementations don't currently support. This fixes a number of failing tests for now; it may be worth generalizing the stuff in examples/intrinsics at some point, since as a general principle, e.g. if generating LLVM IR output, the coalescing optimizations are still desirable. Issue #175.	2012-02-13 16:52:01 -08:00
Matt Pharr	e864447e4a	Fix silly bug in vector scale extraction optimization. (Introduced in `f20a2d2ee`. How did this ever pass tests?)	2012-02-13 12:06:45 -08:00
Matt Pharr	73bf552cd6	Add support for coalescing memory accesses from gathers. There are two related optimizations that happen now. (These currently only apply for gathers where the mask is known to be all on, and to gathers that are accessing 32-bit sized elements, but both of these may be generalized in the future.) First, for any single gather, we are now more flexible in mapping it to individual memory operations. Previously, we would only either map it to a general gather (one scalar load per SIMD lane), or an unaligned vector load (if the program instances could be determined to be accessing a sequential set of locations in memory.) Now, we are able to break gathers into scalar, 2-wide (i.e. 64-bit), 4-wide, or 8-wide loads. Further, we now generate code that shuffles these loads around. Doing fewer, larger loads in this manner, when possible, can be more efficient. Second, we can coalesce memory accesses across multiple gathers. If we have a series of gathers without any memory writes in the middle, then we try to analyze their reads collectively and choose an efficient set of loads for them. Not only does this help if different gathers reuse values from the same location in memory, but it's specifically helpful when data with AOS layout is being accessed; in this case, we're often able to generate wide vector loads and appropriate shuffles automatically.	2012-02-10 13:10:39 -08:00
Matt Pharr	f20a2d2ee9	Generalize code to extract scales by 2/4/8 from addressing calculations. Now, if we have a scale by 16, say, we extract out the scalar scale of 8 and leave an explicit scale by 2.	2012-02-10 12:35:44 -08:00
Matt Pharr	0c25bc063c	Add lGEPInst() utility routine to opt.cpp. Deal with the messiness of LLVM API changes when creating these in a single place.	2012-02-10 12:32:15 -08:00
Matt Pharr	db72781d2a	Fix C++ backend to not assert with LLVM 3.1 svn builds.	2012-02-10 12:30:31 -08:00
Matt Pharr	0c8ad09040	Fix placement of ParserInit() call This makes it possible to use fuzz testing even without --nostdlib!	2012-02-10 12:29:57 -08:00
Matt Pharr	49880ab761	Constant fold more cases in SelectExpr::Optimize() Specifically, if both of the expressions are compile-time constants and the condition is a varying compile-time constant (even if not all true or all false), then we can assemble a compile-time constant result.	2012-02-10 12:28:54 -08:00
Matt Pharr	fe2d9aa600	Add perfbench to examples: a few small microbenchmarks.	2012-02-10 12:27:13 -08:00
Matt Pharr	1dead425e4	Don't indent too much on continued lines with warnings/errors.	2012-02-10 12:26:35 -08:00
Matt Pharr	adb1e47a59	Add FAQ about how to cross-inline ispc and C/C++ code.	2012-02-10 12:26:19 -08:00
Matt Pharr	ffba8580c1	Make sure that non-zero exit code is returned when input file not found. Fixes issue #174.	2012-02-08 19:53:05 -08:00
Alex Reece	ea18427d29	Remove UnwindInst Code no longer builds against head of LLVM branch after revision 149906 removed the unwind instruction.	2012-02-07 15:46:22 -08:00
Matt Pharr	f3089df086	Improve error handling and reporting in the parser. Add a number of additional error cases in the grammar. Enable bison's extended error reporting, to get better messages about the context of errors and the expected (but not found) tokens at errors. Improve the printing of these by providing an implementation of yytnamerr that rewrites things like "TOKEN_MUL_ASSIGN" to "*=" in error messages. Print the source location (using Error() when yyerror() is called; wiring this up seems to require no longer building a 'pure parser' but having yylloc as a global, which in turn led to having to update all of the uses of it (which previously accessed it as a pointer). Updated a number of tests_errors for resulting changesin error text.	2012-02-07 11:13:32 -08:00
Matt Pharr	157e7c97ae	Fix a variety of cases in the parser that could crash with malformed programs.	2012-02-07 11:08:00 -08:00
Matt Pharr	bb8e13e3c9	Add support for -I command-line argument to specify #include search directories.	2012-02-07 08:39:01 -08:00
Matt Pharr	5b4673e8eb	Fix build with LLVM 2.9.	2012-02-07 08:37:13 -08:00
Matt Pharr	5b9de8cc07	Fix test to account for updated error message.	2012-02-07 08:36:56 -08:00
Matt Pharr	33ea934c8f	Fix over-aggressive check in DereferenceExpr::TypeCheck() (Reference types are allowed as well.)	2012-02-07 08:18:33 -08:00
Matt Pharr	6b3e14b0a4	Add command-line option to enable debugging output from parser.	2012-02-06 15:35:43 -08:00
Matt Pharr	098ceb5567	Issue error on attempted type convert from/to function type.	2012-02-06 15:35:43 -08:00
Matt Pharr	8e2b0632e8	Issue an error if an array of references is declared. (More malformed program fixes.)	2012-02-06 15:35:43 -08:00
Matt Pharr	420d373d89	Move assert so that an error is issued for "break" outside of loops.	2012-02-06 15:35:43 -08:00
Matt Pharr	a59fd7eeb3	Fix a missing return value in the parser.	2012-02-06 15:35:43 -08:00
Matt Pharr	ee91fa1228	Make sure the program doesn't have a dereference of a non-pointer type.	2012-02-06 15:35:43 -08:00
Matt Pharr	a2b5ce0172	Add --help-dev option, only print developer options when it is used.	2012-02-06 15:35:43 -08:00
Matt Pharr	3efbc71a01	Add fuzz testing of input programs. When the --fuzz-test command-line option is given, the input program will be randomly perturbed by the lexer in an effort to trigger assertions or crashes in the compiler (neither of which should ever happen, even for malformed programs.)	2012-02-06 15:34:47 -08:00
Matt Pharr	b7c5af7e64	Prohibit returning functions from functions. (Fix malformed program crasher)	2012-02-06 14:46:03 -08:00
Matt Pharr	f939015b97	Default to int32 for declarations without specified types. (e.g. "uniform foo" == "uniform int32 foo")	2012-02-06 14:46:03 -08:00
Matt Pharr	a9ed71f553	Bug fixes to avoid NULL pointer derefs with malformed programs.	2012-02-06 14:45:58 -08:00
Matt Pharr	96a429694f	80 column fixes	2012-02-06 14:44:55 -08:00
Matt Pharr	fddc5e022e	Fix typo in IfStmt::EstimateCost()	2012-02-06 14:44:54 -08:00
Matt Pharr	2236d53def	Issue error if &=, \|=, ^=, <<=, or >>= used with floats.	2012-02-06 14:44:54 -08:00
Matt Pharr	4e018d0a20	Improve tracking of source position in the presence of /* */ comments. Don't let the preprocessor remove comments anymore, so that the rules in lex.ll can handle them. Fix lCComment() to update the source position as it eats characters in comments.	2012-02-06 14:44:54 -08:00
Matt Pharr	977b983771	Issue error on "void" typed variable, function parameter, or struct member.	2012-02-06 14:44:48 -08:00
Matt Pharr	fa7a7fe23e	Fix error handling in type code.	2012-02-06 12:39:14 -08:00
Matt Pharr	724a843bbd	Add --quiet option to supress all diagnostic output	2012-02-06 12:39:09 -08:00
Matt Pharr	a9ec745275	Release notes, bump doxygen release number for 1.1.4 v1.1.4	2012-02-04 15:38:17 -08:00
Matt Pharr	c2ecc15b93	Add missing "varying/varying" atomic_compare_exchange_global() functions.	2012-02-03 13:19:15 -08:00
Matt Pharr	83c8650b36	Add support for "local" atomics. Also updated aobench example to use them, which in turn allows using foreach() and thence a much cleaner implementation. Issue #58.	2012-02-03 13:15:21 -08:00
Matt Pharr	89cb809922	Short-circuit evaluation of ? : operator for varying tests. ? : now short-circuits evaluation of the expressions following the boolean test for varying test types. (It already did this for uniform tests). Issue #169.	2012-02-01 11:03:58 -08:00
Matt Pharr	fdb4eaf437	Fix bug in &&/\|\| short-circuiting. Use full mask, not internal mask when checking "any lanes running" before evaluating expressions. Added some more tests to try to cover this case.	2012-02-01 08:17:25 -08:00
Matt Pharr	0432f97555	Fix build with LLVM 3.1 TOT	2012-01-31 14:10:07 -08:00
Matt Pharr	8d1631b714	Constant fold in SelectExpr::Optimize(). Resolves issue #170.	2012-01-31 12:22:11 -08:00
Matt Pharr	dac091552d	Fix errors in tests for scalar target. Issue #167.	2012-01-31 11:57:12 -08:00
Matt Pharr	ea027a95a8	Fix various places in deferred shading example that assumed programCount >= 4. This gets deferred closer to working with the scalar target, but there are still some issues. (Partially in gamma correction / final clamping, it seems.) This fix causes a ~0.5% performance degradation with e.g. the AVX target, though it's not clear that it's worth having a separate code path in order to not lose this small amount of perf. (Partially addresses issue #167)	2012-01-31 11:46:33 -08:00

1 2 3 4 5 ...

606 Commits