aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	5a53a43ed0	Finish support for 64-bit types in stdlib. Fixes issue #14 . Add much more suppport for doubles and in64 types in the standard library, basically supporting everything for them that are supported for floats and int32s. (The notable exceptions being the approximate rcp() and rsqrt() functions, which don't really have sensible analogs for doubles (or at least not built-in instructions).)	2011-07-07 13:25:55 +01:00
Matt Pharr	6d3e44ead7	Add missing 'internal' qualifiers to two atomic function implementations.	2011-07-06 07:20:46 +01:00
Matt Pharr	5bcc611409	Implement global atomics and a memory barrier in the standard library. This checkin provides the standard set of atomic operations and a memory barrier in the ispc standard library. Both signed and unsigned 32- and 64-bit integer types are supported.	2011-07-04 17:20:42 +01:00
Matt Pharr	c14c3ceba6	Provide both signed and unsigned int variants of bitcode-based builtins. When creating function Symbols for functions that were defined in LLVM bitcode for the standard library, if any of the function parameters are integer types, create two ispc-side Symbols: one where the integer types are all signed and the other where they are all unsigned. This allows us to provide, for example, both store_to_int16(reference int a[], uniform int offset, int val) as well as store_to_int16(reference unsigned int a[], uniform int offset, unsigned int val). functions. Added some additional tests to exercise the new variants of these. Also fixed some cases where the __{load,store}_int{8,16} builtins would read from/write to memory even if the mask was all off (which could cause crashes in some cases.)	2011-07-04 12:10:26 +01:00
Matt Pharr	fe7717ab67	Added shuffle() variant to the standard library that takes two varying values and a permutation index that spans the concatenation of the two of them (along the lines of SHUFPS...)	2011-07-02 08:43:35 +01:00
Matt Pharr	2709c354d7	Add support for broadcast(), rotate(), and shuffle() stdlib routines	2011-06-27 17:31:44 -07:00
Matt Pharr	b84167dddd	Fixed a number of issues related to memory alignment; a number of places were expecting vector-width-aligned pointers where in point of fact, there's no guarantee that they would have been in general. Removed the aligned memory allocation routines from some of the examples; they're no longer needed. No perf. difference on Core2/Core i5 CPUs; older CPUs may see some regressions. Still need to update the documentation for this change and finish reviewing alignment issues in Load/Store instructions generated by .cpp files.	2011-06-23 18:18:33 -07:00
Matt Pharr	18af5226ba	Initial commit.	2011-06-21 12:48:50 -07:00

8 Commits