aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	f65a20c700	AVX bugfix: when replacing 'all on' masked store with a store, the rvalue is operand 2, not operand 1 (which is the mask!)	2011-08-31 18:06:29 -07:00
Matt Pharr	ad9e66650d	AVX bugfix with alignment for store instructions. When replacing 'all on' masked store with regular store, set alignment to be the vector element alignment, not the alignment for a whole vector. (i.e. 4 or 8 byte alignment, not 32 or 64).	2011-08-29 16:58:48 -07:00
Matt Pharr	4ab982bc16	Various AVX fixes (found by inspection). Emit calls to masked_store, not masked_store_blend, when handling masked stores emitted by the frontend. Fix bug in binary8to16 macro in builtins.m4 Fix bug in 16-wide version of __reduce_add_float Remove blend function implementations for masked_store_blend for AVX; just forward those on to the corresponding real masked store functions.	2011-08-26 12:58:02 -07:00
Matt Pharr	fe54f1ad8e	Fixes to build with latest LLVM ToT	2011-08-18 08:34:49 +01:00
Matt Pharr	922dbdec06	Fixes to build with LLVM top-of-tree	2011-07-26 10:57:49 +01:00
Matt Pharr	16be1d313e	AVX updates / improvements. Add optimization patterns to detect and simplify masked loads and stores with the mask all on / all off. Enable AVX for LLVM 3.0 builds (still generally hits bugs / unimplemented stuff on the LLVM side, but it's getting there).	2011-07-25 07:41:37 +01:00
Matt Pharr	96d40327d0	Fix issue #72 : 64 gathers/scatters led to undefined symbols	2011-07-21 14:44:55 +01:00
Matt Pharr	bba7211654	Add support for int8/int16 types. Addresses issues #9 and #42 .	2011-07-21 06:57:40 +01:00
Matt Pharr	654cfb4b4b	Many fixes for recent LLVM dev tree API changes	2011-07-18 15:54:39 +01:00
Matt Pharr	fe7717ab67	Added shuffle() variant to the standard library that takes two varying values and a permutation index that spans the concatenation of the two of them (along the lines of SHUFPS...)	2011-07-02 08:43:35 +01:00
Matt Pharr	2709c354d7	Add support for broadcast(), rotate(), and shuffle() stdlib routines	2011-06-27 17:31:44 -07:00
Matt Pharr	b84167dddd	Fixed a number of issues related to memory alignment; a number of places were expecting vector-width-aligned pointers where in point of fact, there's no guarantee that they would have been in general. Removed the aligned memory allocation routines from some of the examples; they're no longer needed. No perf. difference on Core2/Core i5 CPUs; older CPUs may see some regressions. Still need to update the documentation for this change and finish reviewing alignment issues in Load/Store instructions generated by .cpp files.	2011-06-23 18:18:33 -07:00
Matt Pharr	18af5226ba	Initial commit.	2011-06-21 12:48:50 -07:00

1 2 3 4 5

213 Commits