aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	974b40c8af	Add type suffix to comparison ops in C++ output. e.g. "__equal()" -> "__equal_float()", etc. No functional change; this is necessary groundwork for a forthcoming peephole optimization that eliminates ANDs of masks in some cases.	2012-07-07 07:50:59 -07:00
Matt Pharr	45e9e0be0b	Map comparison predicates to strings for C++ output in a stand-alone function.	2012-07-06 16:00:09 -07:00
Matt Pharr	8defbeb248	Handle llvm.objectsize intrinsic in C++ backend. Partially addresses issue #309.	2012-07-06 12:29:23 -07:00
Matt Pharr	96a6e75b71	Fix issues with LLVM 3.0 and 3.1 build in cbackend.cpp Should fix issue #312.	2012-06-28 12:11:27 -07:00
Matt Pharr	a91e4e7981	Fix missing ;s from `66d4c2ddd9`	2012-06-28 12:04:58 -07:00
Jean-Luc Duprat	66d4c2ddd9	When the --emit-c++ option is used, the state of the --opt=fast-math option is passed into the generated C++ code. If --opt=fast-math is used then the generated code contains: #define ISPC_FAST_MATH 1 Otherwise it contains: #undef ISPC_FAST_MATH This allows the generic headers to support the user's request.	2012-06-28 11:17:11 -07:00
Jean-Luc Duprat	e431b07e04	Changed the C API to use templates to indicate memory alignment to the C compiler This should help with performance of the generated code. Updated the relevant header files (sse4.h, generic-16.h, generic-32.h, generic-64.h) Updated generic-32.h and generic-64.h to the new memory API	2012-06-28 09:29:15 -07:00
Jean-Luc Duprat	2a4dff38d0	cbackend.cpp now makes explicit use of the llvm namespace (Rather than implicitly with a using declaration.) This will allow for some further changes to ISPC's C backend, without collision with ISPC's namespace. This change aims to have no effect on the code generated by the compiler, it should be a big no-op; except for its side-effects on maintainability.	2012-06-27 08:30:30 -07:00
Matt Pharr	10fbaec247	Fix C++ output for unordered fp compares. Fixes a bug introduced in `46716aada3`.	2012-06-21 09:57:19 -07:00
Matt Pharr	6f0a2686dc	Use %a format for printf() for float constants on non-Windows platforms.	2012-06-07 13:20:03 -07:00
Matt Pharr	3c869802fb	Always store multiply-used vector compares in temporary variables (C++ output).	2012-06-06 11:08:42 -07:00
Matt Pharr	96aaf6d53b	Fix build with LLVM top of tree.	2012-06-05 12:28:05 -07:00
Matt Pharr	8d3ac3ac1e	Fix build with LLVM ToT	2012-05-18 10:09:09 -07:00
Matt Pharr	f4df2fb176	Improvements to mask update code for generic targets. Rather than XOR'ing with a temporary 'all-on' vector, we call __not. Also, we call out to __and_not1 and __and_not2, for an AND where the first or second operand, respectively, has had NOT applied to it.	2012-05-16 13:52:51 -07:00
Matt Pharr	c6241581a0	Add an extra parameter to __smear functions to encode return type. Now, the __smear* functions in generated C++ code have an unused first parameter of the desired return type; this allows us to have headers that include variants of __smear for multiple target widths. (This approach is necessary since we can't overload by return type in C++.) Issue #256.	2012-05-08 09:54:23 -07:00
Matt Pharr	ee1fe3aa9f	Update build to handle existence of LLVM 3.2 dev branch. We now compile with LLVM 3.0, 3.1, and 3.2svn.	2012-05-03 08:25:25 -07:00
Nipunn Koorapati	db8b08131f	Fixed compile error which shows up on LLVM 3.0	2012-04-20 12:17:09 -04:00
Matt Pharr	32815e628d	Improve naming of llvm Instructions created. We now try harder to keep the names of instructions related to the initial names of variables they're derived from and so forth. This is useful for making both LLVM IR as well as generated C++ code easier to correlate back to the original ispc source code. Issue #244.	2012-04-19 16:36:46 -07:00
Matt Pharr	cb9f50ef63	C++ backend: mangle variable names less. This makes the generated code a little easier to connect with the original program.	2012-04-19 13:11:47 -07:00
Matt Pharr	12c754c92b	Improved handling of splatted constant vectors in C++ backend. Now, when we're printing out a constant vector value, we check to see if it's a splat and call out to one of the __splat_* functions in the generated code if to.	2012-04-19 13:11:15 -07:00
Matt Pharr	abf7c423bb	Fix build with LLVM 3.0	2012-04-18 06:14:55 -07:00
Matt Pharr	a0c9f7823b	C++ backend fixes. Handle calls to llvm.trap() Declare functions before globals Handle memset()	2012-04-17 15:09:42 -07:00
Matt Pharr	098c4910de	Remove support for building with LLVM 2.9. A forthcoming change uses some features of LLVM 3.0's new type system, and it's not worth back-porting this to also all work with LLVM 2.9.	2012-04-15 20:08:51 -07:00
Matt Pharr	95556811fa	Fix linux build	2012-04-05 20:39:39 -07:00
Matt Pharr	1dac05960a	Fix build with LLVM 3.1 ToT	2012-04-05 08:17:56 -07:00
Matt Pharr	05d1b06eeb	Fixes to get the C++ backend more working again.	2012-03-30 16:56:30 -07:00
Matt Pharr	e264d95019	LLVMVectorValuesAllEqual() improvements. Clean up the API, so the caller doesn't have to pass in a vector so the function can track PHI nodes (do that internally instead.) Handle casts in lValuesAreEqual().	2012-03-19 11:54:18 -07:00
Matt Pharr	9ec8e5a275	Fix compile warnings on Linux	2012-03-12 13:12:23 -07:00
Matt Pharr	a473046058	Once again fix for LLVM 3.1 TOT API changes	2012-03-11 15:04:26 -07:00
Matt Pharr	a69b7a5a01	Fix build with LLVM 3.1 TOT	2012-03-10 13:06:53 -08:00
Matt Pharr	f4adbbf90c	Merge a number of cbackend changes from the LLVM dev tree. This fixes a number of failing tests with LLVM 3.1svn when using the generic targets. Issue #175.	2012-02-13 16:52:38 -08:00
Matt Pharr	db72781d2a	Fix C++ backend to not assert with LLVM 3.1 svn builds.	2012-02-10 12:30:31 -08:00
Alex Reece	ea18427d29	Remove UnwindInst Code no longer builds against head of LLVM branch after revision 149906 removed the unwind instruction.	2012-02-07 15:46:22 -08:00
Matt Pharr	12dc3f5c28	Fixes to c++ backend for new and delete Don't include declarations of malloc/free in the generated code (get the standard ones from system headers instead). Add a cast to (uint8_t ) before calls to malloc, which C++ requires, since proper malloc returns a void .	2012-01-27 16:49:09 -08:00
Matt Pharr	2fb59c90cf	Fix C++ backend bug introduced in `d14a2de168`. (This was causing a number of tests to fail with the generic targets.)	2012-01-19 11:35:02 -07:00
Matt Pharr	68f6ea8def	For << and >> with C++, detect when all instances are shifting by the same amount. In this case, we now emit calls to potentially-specialized functions for the left/right shifts that take a single integer value for the shift amount. These in turn can be matched to the corresponding intrinsics for the SSE target. Issue #145.	2012-01-19 10:04:32 -07:00
Matt Pharr	6451c3d99d	Fix bug with code for initializers for static arrays in generated C++ code. (This was preventing aobench from compiling successfully with the generic target.)	2012-01-18 16:55:09 -07:00
Matt Pharr	5134de71c0	Fix Windows build (inttypes.h not available)	2012-01-09 09:05:20 -08:00
Pierre-Antoine Lacaze	54e8e8022b	suppress warnings about long long arguments	2012-01-09 10:18:39 +01:00
Matt Pharr	78c6d3c02f	Add initial support for 'goto' statements. ispc now supports goto, but only under uniform control flow--i.e. it must be possible for the compiler to statically determine that all program instances will follow the goto. An error is issued at compile time if a goto is used when this is not the case.	2012-01-05 12:22:36 -08:00
Matt Pharr	48e9d4af39	Emit code for #includes in emitted C++ code all at the start of the file.	2012-01-05 12:22:35 -08:00
Matt Pharr	8938e14442	Add support for emitting ~generic vectorized C++ code. The compiler now supports an --emit-c++ option, which generates generic vector C++ code. To actually compile this code, the user must provide C++ code that implements a variety of types and operations (e.g. adding two floating-point vector values together, comparing them, etc). There are two examples of this required code in examples/intrinsics: generic-16.h is a "generic" 16-wide implementation that does all required with scalar math; it's useful for demonstrating the requirements of the implementation. Then, sse4.h shows a simple implementation of a SSE4 target that maps the emitted function calls to SSE intrinsics. When using these example implementations with the ispc test suite, all but one or two tests pass with gcc and clang on Linux and OSX. There are currently ~10 failures with icc on Linux, and ~50 failures with MSVC 2010. (To be fixed in coming days.) Performance varies: when running the examples through the sse4.h target, some have the same performance as when compiled with --target=sse4 from ispc directly (options), while noise is 12% slower, rt is 26% slower, and aobench is 2.2x slower. The details of this haven't yet been carefully investigated, but will be in coming days as well. Issue #92.	2012-01-04 12:59:03 -08:00

42 Commits