aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	080241b7d1	Fix bugs with handling types of integer constants. We now follow the rule that the type of an integer constant is the first of int32, uint32, int64, uint64, that can hold the value. (Unless 'u' or 'l' suffixes have been provided.) Fixes issue #299.	2012-07-08 08:43:03 -07:00
Matt Pharr	0d534720bb	Fix bug with constant folding of select expressions. We would sometimes pass an int32_t * to the ConstExpr constructor but claim the underlying type was uint32, which made it grumpy.	2012-07-08 08:36:51 -07:00
Matt Pharr	1dc4424a30	Only override module datalayout for generic targets. Doing it for all targets was causing a number of tests to fail. (Actual root cause not determined.)	2012-07-07 15:12:50 -07:00
Matt Pharr	57f0cf30c0	Fix small typos in documentation.	2012-07-07 11:19:57 -07:00
Matt Pharr	8ef6bc1636	Add peephole optimization to eliminate some mask AND operations. On KNC, the various vector comparison instructions can optionally be masked; if a mask is provided, the result is effectively that the value returned is the AND of the mask with the result of the comparison. This change adds an optimization pass to the C++ backend that looks for vector ANDs where one operand is a comparison and rewrites them--e.g. "__and(__equal_float(a, b), c)" is changed to "__equal_float_and_mask(a, b, c)", saving an instruction in the end. Issue #319.	2012-07-07 08:35:38 -07:00
Matt Pharr	974b40c8af	Add type suffix to comparison ops in C++ output. e.g. "__equal()" -> "__equal_float()", etc. No functional change; this is necessary groundwork for a forthcoming peephole optimization that eliminates ANDs of masks in some cases.	2012-07-07 07:50:59 -07:00
Matt Pharr	45e9e0be0b	Map comparison predicates to strings for C++ output in a stand-alone function.	2012-07-06 16:00:09 -07:00
Matt Pharr	ec0918045d	Issue error if compiling for multiple targets and program is coming from stdin. We currently don't support this, so at least now we issue an intelligible error message in this case. Issue #269.	2012-07-06 13:21:53 -07:00
Matt Pharr	38bcecd2f3	Print a useful error if llvm-config isn't found when building. Previously, there was a ton of unintelligible error spew. Issue #273.	2012-07-06 13:18:11 -07:00
Matt Pharr	aabbdba068	Switch a few remaining fprintf() calls to use Warning()/Error().	2012-07-06 12:56:45 -07:00
Matt Pharr	84c183da1f	Issue error if a non "generic" target is used with C++ emission. Issue #314.	2012-07-06 12:56:24 -07:00
Matt Pharr	b363b98211	Improve handling of datalayout for generic targets. Flag 32-bit vector types as only requiring 32-bit alignment (preemptive bug fix for 32xi1 vectors). Force module datalayouts to be the same before linking them to silence an LLVM warning. Finishes issue #309.	2012-07-06 12:51:17 -07:00
Matt Pharr	8defbeb248	Handle llvm.objectsize intrinsic in C++ backend. Partially addresses issue #309.	2012-07-06 12:29:23 -07:00
Matt Pharr	f52d227d80	Remove extra newline in error message	2012-07-06 11:31:29 -07:00
Matt Pharr	78cb45fb25	Improve error message with ambiguous function overloads. Issue #316.	2012-07-06 11:25:57 -07:00
Matt Pharr	2d8026625b	Always check the execution mask after break/continue/return. When "break", "continue", or "return" is used under varying control flow, we now always check the execution mask to see if all of the program instances are executing it. (Previously, this was only done with "cbreak", "ccontinue", and "creturn", which are now deprecated.) An important effect of this change is that it fixes a family of cases where we could end up running with an "all off" execution mask, which isn't supposed to happen, as it leads to all sorts of invalid behavior. This change does cause the volume rendering example to run 9% slower, but doesn't affect the other examples. Issue #257.	2012-07-06 11:09:11 -07:00
Matt Pharr	73afab464f	Provide mask at block entry for switch statements. This fixes a crash if 'cbreak' was used in a 'switch'. Renamed FunctionEmitContext::SetLoopMask() to SetBlockEntryMask(), and similarly the loopMask member variable.	2012-07-06 11:08:05 -07:00
Matt Pharr	8aa139b6be	For C++ output, store constant vector values in local arrays. When we have a constant vector of primitive types, we now generate a definition of a static const array of the individual values. This in turn allows us to emit a simple aligned vector load to get the constant vector value, rather than inefficiently inserting the values into a vector. Issue #318.	2012-07-06 08:57:09 -07:00
Matt Pharr	e5fe0eabdc	Update __load() builtins to take const pointers.	2012-07-06 08:47:47 -07:00
Matt Pharr	0d3993fa25	More varied support for constant vectors from C++ backend. If we have a vector of all zeros, a __setzero_* function call is emitted, permitting calling specialized intrinsics for this. Undefined values are reflected with an __undef_* call, which similarly allows passing that information along. This change also includes a cleanup to the signature of the __smear_* functions; since they already have different names depending on the scalar value type, we don't need to use the trick of passing an undefined value of the return vector type as the first parameter as an indirect way to overload by return value. Issue #317.	2012-07-05 20:19:11 -07:00
Jean-Luc Duprat	ac421f68e2	Ongoing support for int64 for KNC: Fixes to __load and __store. Added __add, __mul, __equal, __not_equal, __extract_elements, __smear_i64, __cast_sext, __cast_zext, and __scatter_base_offsets32_float. __rcp_varying_float now has a fast-math and full-precision implementation.	2012-07-05 17:05:42 -07:00
Jean-Luc Duprat	b9d1f0db18	Ongoing support for int64 for KNC: Fixes to __load and __store. Added __add, __mul, __equal, __not_equal, __extract_elements, __smear_i64, __cast_sext, __cast_zext, and __scatter_base_offsets32_float. __rcp_varying_float now has a fast-math and full-precision implementation.	2012-07-05 16:56:13 -07:00
Matt Pharr	6aad4c7a39	Bump version number to 1.3.1dev	2012-07-05 13:35:34 -07:00
Matt Pharr	4186ef204d	Fix build with LLVM top of tree.	2012-07-05 13:35:01 -07:00
Matt Pharr	ae7a094ee0	Merge pull request #315 from NicolasT/master Fix build on Fedora 17	2012-07-04 08:21:03 -07:00
Nicolas Trangez	3a007f939a	Build: Include unistd.h where required Some modules require an include of unistd.h (e.g. for getcwd and isatty definitions). These changes were required to build successfully on a Fedora 17 system, using GCC 4.7.0 & glibc-headers 2.15.	2012-07-04 14:49:00 +02:00
Matt Pharr	b8503b9255	News and doxygen version number bump for 1.3.0 v1.3.0	2012-06-29 08:38:38 -07:00
Matt Pharr	b7bc76d3cc	Documentation updates for 1.3.0.	2012-06-29 08:35:29 -07:00
Matt Pharr	27d6c12972	Bump ISPC_MINOR_VERSION to 3	2012-06-28 16:15:46 -07:00
Matt Pharr	b69d783e09	Bump version to 1.3.0	2012-06-28 15:35:52 -07:00
Matt Pharr	3b2ff6301c	Use fputs() rather than puts() for printing final result from print(). puts() sillily adds an undesired newline.	2012-06-28 12:29:40 -07:00
Matt Pharr	6c7043916e	Silence bogus compiler warning	2012-06-28 12:11:56 -07:00
Matt Pharr	96a6e75b71	Fix issues with LLVM 3.0 and 3.1 build in cbackend.cpp Should fix issue #312.	2012-06-28 12:11:27 -07:00
Matt Pharr	a91e4e7981	Fix missing ;s from `66d4c2ddd9`	2012-06-28 12:04:58 -07:00
Jean-Luc Duprat	95d8f76ec3	Added prelimary support for Intel's Xeon Phi KNC processor. float, int32 and double support is included; int8, int16 and int64 not supported yet. This is work in progress and not considered stable yet.	2012-06-28 12:00:55 -07:00
Jean-Luc Duprat	66d4c2ddd9	When the --emit-c++ option is used, the state of the --opt=fast-math option is passed into the generated C++ code. If --opt=fast-math is used then the generated code contains: #define ISPC_FAST_MATH 1 Otherwise it contains: #undef ISPC_FAST_MATH This allows the generic headers to support the user's request.	2012-06-28 11:17:11 -07:00
Jean-Luc Duprat	8115ca739a	Added prelimary support for Intel's Xeon Phi KNC processor. float, int32 and double support is included; int8, int16 and int64 not supported yet. This is work in progress and not considered stable yet.	2012-06-28 10:54:09 -07:00
Jean-Luc Duprat	ec4021bbf4	When the --emit-c++ option is used, the state of the --opt=fast-math option is passed into the generated C++ code. If --opt=fast-math is used then the generated code contains: #define ISPC_FAST_MATH 1 Otherwise it contains: #undef ISPC_FAST_MATH This allows the generic headers to support the user's request.	2012-06-28 10:42:29 -07:00
Jean-Luc Duprat	e431b07e04	Changed the C API to use templates to indicate memory alignment to the C compiler This should help with performance of the generated code. Updated the relevant header files (sse4.h, generic-16.h, generic-32.h, generic-64.h) Updated generic-32.h and generic-64.h to the new memory API	2012-06-28 09:29:15 -07:00
Matt Pharr	d34a87404d	Provide (undocumented for now) __pause() call to emit PAUSE inst.	2012-06-28 09:28:25 -07:00
Matt Pharr	f38770bf2a	Fix build with LLVM ToT	2012-06-28 07:36:10 -07:00
Jean-Luc Duprat	dc9998ccaf	Missed a few minor fixes to generic-64.h in previous commit	2012-06-27 17:14:03 -07:00
Jean-Luc Duprat	f1b3703389	Changed the C API to use templates to indicate memory alignment to the C compiler This should help with performance of the generated code. Updated the relevant header files (sse4.h, generic-16.h, generic-32.h, generic-64.h) Updated generic-32.h and generic-64.h to the new memory API	2012-06-27 16:59:26 -07:00
Jean-Luc Duprat	b6a8d0ee7f	Merge branch 'master' of git://github.com/ispc/ispc	2012-06-27 10:15:24 -07:00
Jean-Luc Duprat	2a4dff38d0	cbackend.cpp now makes explicit use of the llvm namespace (Rather than implicitly with a using declaration.) This will allow for some further changes to ISPC's C backend, without collision with ISPC's namespace. This change aims to have no effect on the code generated by the compiler, it should be a big no-op; except for its side-effects on maintainability.	2012-06-27 08:30:30 -07:00
Jean-Luc Duprat	665c564dcf	cbackend.cpp now makes explicit use of the llvm namespace, rather than implicitly with a using declaration. This will allow for some further changes to ISPC's C backend, without collision with ISPC's namespace. This change aims to have no effect on the code generated by the compiler, it should be a big no-op; except for its side-effects on maintainability.	2012-06-26 22:15:31 -07:00
Jean-Luc Duprat	ed71413e04	Merge branch 'master' of git://github.com/ispc/ispc	2012-06-26 14:32:27 -07:00
Jean-Luc Duprat	4b5e49b00b	Merge branch 'master' of github.com:jduprat/ispc	2012-06-26 14:32:01 -07:00
Matt Pharr	f558ee788e	Fix bug with generating implicit zero initializer values. Issue #300.	2012-06-26 11:58:16 -07:00
Matt Pharr	ceb8ca680c	Fix crash in codegen for assert() with malformed program. Issue #302.	2012-06-26 11:54:55 -07:00

... 2 3 4 5 6 ...

1131 Commits