aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	f868a63064	Add support for scan operations across program instances (add, and, or).	2011-08-13 20:11:41 +01:00
Matt Pharr	c74116aa24	Fix crasher with malformed program	2011-08-12 07:47:17 +01:00
Matt Pharr	8c534d4d74	Add reduce_equal() function to standard library.	2011-08-10 15:55:55 -07:00
Matt Pharr	d821a11c7c	Fix min/max for integer types with AVX.	2011-08-04 06:24:20 -07:00
Matt Pharr	8a138eeb5a	vim syntax highlighting for ispc from <andreas.wendleder@googlemail.com>	2011-08-04 05:49:28 -07:00
Matt Pharr	137ea7bde6	Rename semaphore filename to be more generic	2011-08-04 05:28:00 -07:00
Matt Pharr	e05b3981d9	Add stencil example	2011-08-03 13:49:02 -07:00
Matt Pharr	a5a133ccce	Do more iterations of RNG test to let result converge to bounds.	2011-08-03 13:44:49 -07:00
Matt Pharr	0ac4f7b620	Add various prefetch functions to the standard library.	2011-08-03 13:31:45 -07:00
Matt Pharr	467f1e71d7	Add fast versions of the float<-->half conversion routines in the stdlib. These get slightly wrong results for zero and the denorms and also don't handle the Inf/NaN stuff correctly, but are much more efficient than the full versions of these routines.	2011-08-03 15:58:42 +01:00
Matt Pharr	a2996ed5d9	More efficient implementation of frandom() in stdlib	2011-08-03 14:28:06 +01:00
Matt Pharr	7d7dd2b204	Merge branch 'master' of github.com:ispc/ispc	2011-08-01 12:16:33 +01:00
Matt Pharr	172794ba5f	Release notes and doxygen update for 1.0.5 release v1.0.5	2011-08-01 12:15:42 +01:00
Matt Pharr	9ee6f86c73	Fix Windows build of ispc_test	2011-08-01 04:05:37 -07:00
Matt Pharr	a4bb6b5520	Add new example with implementation of Perlin Noise ~4.2x speedup versus serial on OSX / gcc. ~2.9x speedup versus serial on Windows / MSVC.	2011-08-01 10:33:18 +01:00
Matt Pharr	a552927a6a	Cleanup implementation of target builtins code. - Renamed stdlib-sse.ll to builtins-sse.ll (etc.) in an attempt to better indicate the fact that the stuff in those files has a role beyond implementing stuff for the standard library. - Moved declarations of the various __pseudo_* functions from being done with LLVM API calls in builtins.cpp to just straight up declarations in LLVM assembly language in builtins.m4. (Much less code to do it this way, and more clear what's going on.)	2011-08-01 05:58:43 +01:00
Matt Pharr	2d52c732f1	Doc updates for recent new swizzle support	2011-07-31 19:03:55 +02:00
Matt Pharr	25676d5643	When --debug is specified, only print the entire module bitcode twice. Fixes issue #77. Previously, it dumped out the entire module every time a new function was defined, which got to be quite a lot of output by the time the stdlib functions were all added!	2011-07-29 07:26:37 +02:00
Matt Pharr	158bd6ef9e	Fix bug with initializer expression lists for globlal/static array-typed variables.	2011-07-28 11:38:56 +01:00
Matt Pharr	7f662de6e3	Emit debug declaration of variables before the instructions for their initializers.	2011-07-28 11:05:02 +01:00
Matt Pharr	80ca02af58	Add missing #include, fix Linux build. Fixes issue #75 .	2011-07-27 10:51:13 +01:00
Matt Pharr	8aea4a836d	Fix crash when trying to generate debug info with program source from stdin	2011-07-27 07:42:47 +01:00
Matt Pharr	922dbdec06	Fixes to build with LLVM top-of-tree	2011-07-26 10:57:49 +01:00
Matt Pharr	e230d2c9ca	Make the target argument work in the run_tests.sh script	2011-07-26 10:57:39 +01:00
Matt Pharr	d0674b1706	When doing << or >> operators, don't convert the return type to the type of the shift amount. Fixes issue #73. Previously, if we had e.g. an int16 type that was being shifted left by 1, then the constant integer 1 would come in as an int32, we'd convert the int16 to an int32, and then we'd do the shift. Now, for shifts, the type of the expression is always the same as the type of the value being shifted.	2011-07-25 23:36:05 +01:00
Matt Pharr	16be1d313e	AVX updates / improvements. Add optimization patterns to detect and simplify masked loads and stores with the mask all on / all off. Enable AVX for LLVM 3.0 builds (still generally hits bugs / unimplemented stuff on the LLVM side, but it's getting there).	2011-07-25 07:41:37 +01:00
Matt Pharr	0932dcd98b	Fix build with llvm top-of-tree	2011-07-23 08:35:45 +01:00
Matt Pharr	43a619669f	Fix memory bug where we were accessing memory that had been freed. (The string for which c_str() was called was just a temporary, so its destructor ran after funcName was initialized, leading funcName to point at freed memory.)	2011-07-22 13:15:50 +01:00
Pete Couperus	59036cdf5b	Add support for multi-element vector swizzles. Issue #17 . This commit adds support for swizzles like "foo.zy" (if "foo" is, for example, a float<3> type) as rvalues. (Still need support for swizzles as lvalues.)	2011-07-22 13:10:14 +01:00
Matt Pharr	98a2d69e72	Add code to check signatures of LLVM intrinsic declarations in stdlib*.ll files. Fix a case where we were using the wrong signature for stmxcsr and ldmxcsr.	2011-07-22 12:53:17 +01:00
Matt Pharr	da0fd93315	AVX fixes: add missing 8/16-bit gathers and scatters, set features string appropriately when AVX is enabled.	2011-07-22 12:36:44 +01:00
Matt Pharr	165f90357f	Tiny cleanups, doc update re int8/16 performance	2011-07-21 16:04:16 +01:00
Matt Pharr	8ef3df57c5	Add support for in-memory half float data. Fixes issue #10	2011-07-21 15:55:45 +01:00
Matt Pharr	96d40327d0	Fix issue #72 : 64 gathers/scatters led to undefined symbols	2011-07-21 14:44:55 +01:00
Matt Pharr	bba7211654	Add support for int8/int16 types. Addresses issues #9 and #42 .	2011-07-21 06:57:40 +01:00
Matt Pharr	2d573acd17	Another LLVM dev tree API change fix	2011-07-18 17:28:49 +01:00
Matt Pharr	654cfb4b4b	Many fixes for recent LLVM dev tree API changes	2011-07-18 15:54:39 +01:00
Matt Pharr	65a29ec316	Only create ispc-callable functions for bitcode functions that start with "__" Don't create ispc-callable symbols for other functions that we find in the LLVM bitcode files that are loaded up and linked into the module so that they can be called from ispc stdlib functions. This fixes an issue where we had a clash between the declared versions of double sin(double) and the corresponding ispc stdlib routines for uniform doubles, which in turn led to bogus code being generated for calls to those ispc stdlib functions. v1.0.4	2011-07-18 13:03:50 +01:00
Matt Pharr	6b0a6c0124	Fix issue #67 : don't crash ungracefully if target ISA not supported on system. - In the ispc-generated header files, a #define now indicates which compilation target was used. - The examples use utility routines from the new file examples/cpuid.h to check the system's CPU's capabilities to see if it supports the ISA that was used for compiling the example code and print error messages if things aren't going to work...	2011-07-18 12:29:43 +01:00
Matt Pharr	213c3a9666	Release notes, bump doxygen version # for next release. Add more .gitignore stuff.	2011-07-17 16:52:36 +02:00
Matt Pharr	f0f876c3ec	Add support for enums.	2011-07-17 16:43:05 +02:00
Matt Pharr	17e5c8b7c2	Fix LLVM 2.9 build.	2011-07-13 09:24:02 +01:00
Andreas Wendleder	646db5aacb	Reflect changes in LLVM's type system.	2011-07-13 06:44:44 +01:00
Matt Pharr	a535aa586b	Fix issue #2 : use zero extend to convert bool->int, not sign extend. This way, we match C/C++ in that casting a bool to an int gives either the value zero or the value one. There is a new stdlib function int sign_extend(bool) that does sign extension for cases where that's desired.	2011-07-12 13:30:05 +01:00
Matt Pharr	6e8af5038b	Fix issue #62 : emit stdlib code as char array, not a string MSVC 2010 issues an error if given a string larger than 64k characters long. To work around this, the pre-processed stdlib.ispc code is now stored as an array of characters terminated with a NUL (i.e. the same thing in the end); MSVC is fine with arrays larger than 64k characters.	2011-07-08 09:14:52 -07:00
Andreas Wendleder	7058ca1aaf	Script fixes. Indent correctly, don't nag about nonexisting files and add a carriage return.	2011-07-08 16:45:03 +01:00
Andreas Wendleder	ae6ee3ea46	Cmake based LLVM builds don't have svn in their identification.	2011-07-08 16:44:22 +01:00
Matt Pharr	e156651190	Fix __load_masked_{32,64} to properly obey the mask. Fixes issue #28 . Fixed the implementations of these builtin functions for targets that don't have native masked load instructions so that they do no loads if the vector mask is all off, and only do an (unaligned) vector load if both the first and last element of the mask are on. Otherwise they serialize and do scalar loads for only the active lanes. This fixes a number of potential sources of crashes due to accessing invalid memory.	2011-07-08 11:21:11 +01:00
Matt Pharr	092d288aef	Merge pull request #61 from danschubert/master Fixed VC2010 warnings caused by implicit conversion from 'long' to 'char'.	2011-07-07 08:45:40 -07:00
Daniel Schubert	409bdc0dba	Fixed VC2010 warnings: lex.ll(397): warning C4244: '=' : conversion from 'long' to 'char', possible loss of data lex.ll(402): warning C4244: '=' : conversion from 'long' to 'char', possible loss of data by explicit cast to 'char'. See: http://msdn.microsoft.com/en-us/library/w4z2wdyc(v=vs.80).aspx long strtol( const char nptr, char *endptr, int base );	2011-07-07 08:17:03 -07:00

1 2 3

136 Commits