aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	f52d227d80	Remove extra newline in error message	2012-07-06 11:31:29 -07:00
Matt Pharr	78cb45fb25	Improve error message with ambiguous function overloads. Issue #316.	2012-07-06 11:25:57 -07:00
Matt Pharr	2d8026625b	Always check the execution mask after break/continue/return. When "break", "continue", or "return" is used under varying control flow, we now always check the execution mask to see if all of the program instances are executing it. (Previously, this was only done with "cbreak", "ccontinue", and "creturn", which are now deprecated.) An important effect of this change is that it fixes a family of cases where we could end up running with an "all off" execution mask, which isn't supposed to happen, as it leads to all sorts of invalid behavior. This change does cause the volume rendering example to run 9% slower, but doesn't affect the other examples. Issue #257.	2012-07-06 11:09:11 -07:00
Matt Pharr	73afab464f	Provide mask at block entry for switch statements. This fixes a crash if 'cbreak' was used in a 'switch'. Renamed FunctionEmitContext::SetLoopMask() to SetBlockEntryMask(), and similarly the loopMask member variable.	2012-07-06 11:08:05 -07:00
Matt Pharr	8aa139b6be	For C++ output, store constant vector values in local arrays. When we have a constant vector of primitive types, we now generate a definition of a static const array of the individual values. This in turn allows us to emit a simple aligned vector load to get the constant vector value, rather than inefficiently inserting the values into a vector. Issue #318.	2012-07-06 08:57:09 -07:00
Matt Pharr	e5fe0eabdc	Update __load() builtins to take const pointers.	2012-07-06 08:47:47 -07:00
Matt Pharr	0d3993fa25	More varied support for constant vectors from C++ backend. If we have a vector of all zeros, a __setzero_* function call is emitted, permitting calling specialized intrinsics for this. Undefined values are reflected with an __undef_* call, which similarly allows passing that information along. This change also includes a cleanup to the signature of the __smear_* functions; since they already have different names depending on the scalar value type, we don't need to use the trick of passing an undefined value of the return vector type as the first parameter as an indirect way to overload by return value. Issue #317.	2012-07-05 20:19:11 -07:00
Jean-Luc Duprat	ac421f68e2	Ongoing support for int64 for KNC: Fixes to __load and __store. Added __add, __mul, __equal, __not_equal, __extract_elements, __smear_i64, __cast_sext, __cast_zext, and __scatter_base_offsets32_float. __rcp_varying_float now has a fast-math and full-precision implementation.	2012-07-05 17:05:42 -07:00
Jean-Luc Duprat	b9d1f0db18	Ongoing support for int64 for KNC: Fixes to __load and __store. Added __add, __mul, __equal, __not_equal, __extract_elements, __smear_i64, __cast_sext, __cast_zext, and __scatter_base_offsets32_float. __rcp_varying_float now has a fast-math and full-precision implementation.	2012-07-05 16:56:13 -07:00
Matt Pharr	6aad4c7a39	Bump version number to 1.3.1dev	2012-07-05 13:35:34 -07:00
Matt Pharr	4186ef204d	Fix build with LLVM top of tree.	2012-07-05 13:35:01 -07:00
Matt Pharr	ae7a094ee0	Merge pull request #315 from NicolasT/master Fix build on Fedora 17	2012-07-04 08:21:03 -07:00
Nicolas Trangez	3a007f939a	Build: Include unistd.h where required Some modules require an include of unistd.h (e.g. for getcwd and isatty definitions). These changes were required to build successfully on a Fedora 17 system, using GCC 4.7.0 & glibc-headers 2.15.	2012-07-04 14:49:00 +02:00
Matt Pharr	b8503b9255	News and doxygen version number bump for 1.3.0 v1.3.0	2012-06-29 08:38:38 -07:00
Matt Pharr	b7bc76d3cc	Documentation updates for 1.3.0.	2012-06-29 08:35:29 -07:00
Matt Pharr	27d6c12972	Bump ISPC_MINOR_VERSION to 3	2012-06-28 16:15:46 -07:00
Matt Pharr	b69d783e09	Bump version to 1.3.0	2012-06-28 15:35:52 -07:00
Matt Pharr	3b2ff6301c	Use fputs() rather than puts() for printing final result from print(). puts() sillily adds an undesired newline.	2012-06-28 12:29:40 -07:00
Matt Pharr	6c7043916e	Silence bogus compiler warning	2012-06-28 12:11:56 -07:00
Matt Pharr	96a6e75b71	Fix issues with LLVM 3.0 and 3.1 build in cbackend.cpp Should fix issue #312.	2012-06-28 12:11:27 -07:00
Matt Pharr	a91e4e7981	Fix missing ;s from `66d4c2ddd9`	2012-06-28 12:04:58 -07:00
Jean-Luc Duprat	95d8f76ec3	Added prelimary support for Intel's Xeon Phi KNC processor. float, int32 and double support is included; int8, int16 and int64 not supported yet. This is work in progress and not considered stable yet.	2012-06-28 12:00:55 -07:00
Jean-Luc Duprat	66d4c2ddd9	When the --emit-c++ option is used, the state of the --opt=fast-math option is passed into the generated C++ code. If --opt=fast-math is used then the generated code contains: #define ISPC_FAST_MATH 1 Otherwise it contains: #undef ISPC_FAST_MATH This allows the generic headers to support the user's request.	2012-06-28 11:17:11 -07:00
Jean-Luc Duprat	8115ca739a	Added prelimary support for Intel's Xeon Phi KNC processor. float, int32 and double support is included; int8, int16 and int64 not supported yet. This is work in progress and not considered stable yet.	2012-06-28 10:54:09 -07:00
Jean-Luc Duprat	ec4021bbf4	When the --emit-c++ option is used, the state of the --opt=fast-math option is passed into the generated C++ code. If --opt=fast-math is used then the generated code contains: #define ISPC_FAST_MATH 1 Otherwise it contains: #undef ISPC_FAST_MATH This allows the generic headers to support the user's request.	2012-06-28 10:42:29 -07:00
Jean-Luc Duprat	e431b07e04	Changed the C API to use templates to indicate memory alignment to the C compiler This should help with performance of the generated code. Updated the relevant header files (sse4.h, generic-16.h, generic-32.h, generic-64.h) Updated generic-32.h and generic-64.h to the new memory API	2012-06-28 09:29:15 -07:00
Matt Pharr	d34a87404d	Provide (undocumented for now) __pause() call to emit PAUSE inst.	2012-06-28 09:28:25 -07:00
Matt Pharr	f38770bf2a	Fix build with LLVM ToT	2012-06-28 07:36:10 -07:00
Jean-Luc Duprat	dc9998ccaf	Missed a few minor fixes to generic-64.h in previous commit	2012-06-27 17:14:03 -07:00
Jean-Luc Duprat	f1b3703389	Changed the C API to use templates to indicate memory alignment to the C compiler This should help with performance of the generated code. Updated the relevant header files (sse4.h, generic-16.h, generic-32.h, generic-64.h) Updated generic-32.h and generic-64.h to the new memory API	2012-06-27 16:59:26 -07:00
Jean-Luc Duprat	b6a8d0ee7f	Merge branch 'master' of git://github.com/ispc/ispc	2012-06-27 10:15:24 -07:00
Jean-Luc Duprat	2a4dff38d0	cbackend.cpp now makes explicit use of the llvm namespace (Rather than implicitly with a using declaration.) This will allow for some further changes to ISPC's C backend, without collision with ISPC's namespace. This change aims to have no effect on the code generated by the compiler, it should be a big no-op; except for its side-effects on maintainability.	2012-06-27 08:30:30 -07:00
Jean-Luc Duprat	665c564dcf	cbackend.cpp now makes explicit use of the llvm namespace, rather than implicitly with a using declaration. This will allow for some further changes to ISPC's C backend, without collision with ISPC's namespace. This change aims to have no effect on the code generated by the compiler, it should be a big no-op; except for its side-effects on maintainability.	2012-06-26 22:15:31 -07:00
Jean-Luc Duprat	ed71413e04	Merge branch 'master' of git://github.com/ispc/ispc	2012-06-26 14:32:27 -07:00
Jean-Luc Duprat	4b5e49b00b	Merge branch 'master' of github.com:jduprat/ispc	2012-06-26 14:32:01 -07:00
Matt Pharr	f558ee788e	Fix bug with generating implicit zero initializer values. Issue #300.	2012-06-26 11:58:16 -07:00
Matt Pharr	ceb8ca680c	Fix crash in codegen for assert() with malformed program. Issue #302.	2012-06-26 11:54:55 -07:00
Matt Pharr	79ebcbec4b	Fix crash in SwitchStmt::TypeCheck() with malformed programs.	2012-06-26 11:21:33 -07:00
Matt Pharr	2c7b650240	Add FAQ to explain how to launch per-instance tasks with foreach_active and unmasked. Issue #227.	2012-06-22 14:32:05 -07:00
Matt Pharr	54459255d4	Add unmasked { } statement. This reestablishes an "all on" execution mask for the gang, which can be useful for nested parallelism..	2012-06-22 14:30:58 -07:00
Matt Pharr	b4a078e2f6	Add foreach_active iteration statement. Issue #298.	2012-06-22 10:35:43 -07:00
Matt Pharr	ed13dd066b	Distinguish between 'regular' foreach and foreach_unique in FunctionEmitContext We need to do this since it's illegal to have nested foreach statements, but nested foreach_unique, or foreach_unique inside foreach, etc., are all fine.	2012-06-22 06:04:00 -07:00
Matt Pharr	2b4a3b22bf	Issue an error if the user has nested foreach statements. Partially addresses issue #280. (We should support them properly, but at least now we don't silently generate incorrect code.)	2012-06-21 16:53:27 -07:00
Matt Pharr	8b891da628	Allow referring to the struct type being defined in its members. It's now legal to write: struct Foo { Foo *next; }; previously, a predeclaration "struct Foo;" was required. This fixes issue #287. This change also fixes a bug where multiple forward declarations "struct Foo; struct Foo;" would incorrectly issue an error on the second one.	2012-06-21 16:44:04 -07:00
Matt Pharr	5a2c8342eb	Allow structs with no members. Issue #289.	2012-06-21 16:07:31 -07:00
Matt Pharr	50eb4bf53a	Change print() implementation to accumulate string locally before printing. The string to be printed is accumulated into a local buffer before being sent to puts(). This ensure that if multiple threads are running and printing at the same time, their output won't be interleaved (across individual print statements-- it still may be interleaved across different print statements, just like in C). Issue #293.	2012-06-21 14:41:53 -07:00
Matt Pharr	3c10ddd46a	Fix declaration of size_t. It should be an unsigned integer type.	2012-06-21 14:40:24 -07:00
Matt Pharr	0b7f9acc70	Align <16 x i1> vectors to just 16 bits for generic targets. Partially addresses issue #259.	2012-06-21 10:25:33 -07:00
Matt Pharr	10fbaec247	Fix C++ output for unordered fp compares. Fixes a bug introduced in `46716aada3`.	2012-06-21 09:57:19 -07:00
Matt Pharr	007a734595	Add support for 'unmasked' function qualifier.	2012-06-20 15:36:00 -07:00

... 16 17 18 19 20 ...

1818 Commits