aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	ec0280be11	Rename gather/scatter_base_offsets functions to factored_based_offsets. No functional change; just preparation for having a path that doesn't factor the offsets into constant and varying parts, which will be better for AVX2 and KNC.	2012-07-11 14:16:39 -07:00
Matt Pharr	8e19d54e75	Merge pull request #328 from jduprat/explicit_isa_in_tests Explicit isa in tests	2012-07-10 20:49:37 -07:00
Jean-Luc Duprat	3c070e5e20	run_tests.py will only attempt to use the -mmic flag when the knc.h header is used	2012-07-10 17:07:56 -07:00
Jean-Luc Duprat	dde599f48f	run_tests.py now picks the ISA via a -m flag based on the target selected, rather than always picking -msse4.2; this is needed because -msse4.2 is not supported on KNC.	2012-07-10 16:39:18 -07:00
Jean-Luc Duprat	cc15ecfb3a	Merge branch 'master' of https://github.com/ispc/ispc Conflicts: cbackend.cpp examples/intrinsics/generic-16.h examples/intrinsics/generic-32.h examples/intrinsics/generic-64.h examples/intrinsics/knc.h examples/intrinsics/sse4.h	2012-07-10 16:36:08 -07:00
Jean-Luc Duprat	7a7c54bd59	Minor fixes to knc.h that resulted from integrating `bea88ab122`	2012-07-10 16:10:48 -07:00
Jean-Luc Duprat	bea88ab122	Integrated changes from mmp/and-fold-opt: Add peephole optimization to eliminate some mask AND operations. On KNC, the various vector comparison instructions can optionally be masked; if a mask is provided, the result is effectively that the value returned is the AND of the mask with the result of the comparison. This change adds an optimization pass to the C++ backend that looks for vector ANDs where one operand is a comparison and rewrites them--e.g. "and(equalfloat(a, b), c)" is changed to "_equal_float_and_mask(a, b, c)", saving an instruction in the end. Issue #319. Merge commit '8ef6bc16364d4c08aa5972141748110160613087' Conflicts: examples/intrinsics/knc.h examples/intrinsics/sse4.h	2012-07-10 10:33:24 -07:00
Matt Pharr	926b3b9ee3	Fix bugs with mask-handling for switch/do/for/while statements. All of these pass the current mask to FunctionEmitContext::SetBlockEntryMask() so that when a break/continue/return is encountered, it can test to see if all lanes have followed that path and then return; this in turn ensures that we never run statements with an all-off execution mask. These functions were passing the function internal mask, not the full mask, and thus could end up executing code with the mask all off if some lanes were disabled by an outer function. (The new tests test this case.)	2012-07-09 15:13:30 -07:00
Matt Pharr	bc7775aef2	Fix __ordered and _unordered floating point functions for C++ target. Fixes include adding "_float" and "_double" suffixes as appropriate as well as providing a number of missing implementations. This fixes a number of failures in the half* tests.	2012-07-09 14:35:51 -07:00
Matt Pharr	107669686c	Fix naming of some comparison ops in knc.h	2012-07-09 12:43:15 -07:00
Matt Pharr	bb11b3ab66	Fix build with LLVM 3.0	2012-07-09 10:45:36 -07:00
Jean-Luc Duprat	516ba85abd	Merge pull request #322 from mmp/vector-constants Vector constants	2012-07-09 09:28:26 -07:00
Jean-Luc Duprat	098277b4f0	Merge pull request #321 from mmp/setzero More varied support for constant vectors from C++ backend.	2012-07-09 08:57:05 -07:00
Matt Pharr	950a989744	Add test that was supposed to go with `080241b7d1`	2012-07-09 08:21:15 -07:00
Matt Pharr	fb8b893b10	Fix incorrect LLVM_3_1svn tests. 1. For some time now, we provide the version without the 'svn' 2. We should be testing "not LLVM 3.0" in these cases, since they apply to LLVM 3.2 and beyond as well...	2012-07-09 07:09:25 -07:00
Matt Pharr	9ca80debb8	Remove stale LLVM 2.9 support from builtins/util.m4	2012-07-09 06:54:29 -07:00
Matt Pharr	080241b7d1	Fix bugs with handling types of integer constants. We now follow the rule that the type of an integer constant is the first of int32, uint32, int64, uint64, that can hold the value. (Unless 'u' or 'l' suffixes have been provided.) Fixes issue #299.	2012-07-08 08:43:03 -07:00
Matt Pharr	0d534720bb	Fix bug with constant folding of select expressions. We would sometimes pass an int32_t * to the ConstExpr constructor but claim the underlying type was uint32, which made it grumpy.	2012-07-08 08:36:51 -07:00
Matt Pharr	1dc4424a30	Only override module datalayout for generic targets. Doing it for all targets was causing a number of tests to fail. (Actual root cause not determined.)	2012-07-07 15:12:50 -07:00
Matt Pharr	57f0cf30c0	Fix small typos in documentation.	2012-07-07 11:19:57 -07:00
Matt Pharr	8ef6bc1636	Add peephole optimization to eliminate some mask AND operations. On KNC, the various vector comparison instructions can optionally be masked; if a mask is provided, the result is effectively that the value returned is the AND of the mask with the result of the comparison. This change adds an optimization pass to the C++ backend that looks for vector ANDs where one operand is a comparison and rewrites them--e.g. "__and(__equal_float(a, b), c)" is changed to "__equal_float_and_mask(a, b, c)", saving an instruction in the end. Issue #319.	2012-07-07 08:35:38 -07:00
Matt Pharr	974b40c8af	Add type suffix to comparison ops in C++ output. e.g. "__equal()" -> "__equal_float()", etc. No functional change; this is necessary groundwork for a forthcoming peephole optimization that eliminates ANDs of masks in some cases.	2012-07-07 07:50:59 -07:00
Matt Pharr	45e9e0be0b	Map comparison predicates to strings for C++ output in a stand-alone function.	2012-07-06 16:00:09 -07:00
Matt Pharr	ec0918045d	Issue error if compiling for multiple targets and program is coming from stdin. We currently don't support this, so at least now we issue an intelligible error message in this case. Issue #269.	2012-07-06 13:21:53 -07:00
Matt Pharr	38bcecd2f3	Print a useful error if llvm-config isn't found when building. Previously, there was a ton of unintelligible error spew. Issue #273.	2012-07-06 13:18:11 -07:00
Matt Pharr	aabbdba068	Switch a few remaining fprintf() calls to use Warning()/Error().	2012-07-06 12:56:45 -07:00
Matt Pharr	84c183da1f	Issue error if a non "generic" target is used with C++ emission. Issue #314.	2012-07-06 12:56:24 -07:00
Matt Pharr	b363b98211	Improve handling of datalayout for generic targets. Flag 32-bit vector types as only requiring 32-bit alignment (preemptive bug fix for 32xi1 vectors). Force module datalayouts to be the same before linking them to silence an LLVM warning. Finishes issue #309.	2012-07-06 12:51:17 -07:00
Matt Pharr	8defbeb248	Handle llvm.objectsize intrinsic in C++ backend. Partially addresses issue #309.	2012-07-06 12:29:23 -07:00
Matt Pharr	f52d227d80	Remove extra newline in error message	2012-07-06 11:31:29 -07:00
Matt Pharr	78cb45fb25	Improve error message with ambiguous function overloads. Issue #316.	2012-07-06 11:25:57 -07:00
Matt Pharr	2d8026625b	Always check the execution mask after break/continue/return. When "break", "continue", or "return" is used under varying control flow, we now always check the execution mask to see if all of the program instances are executing it. (Previously, this was only done with "cbreak", "ccontinue", and "creturn", which are now deprecated.) An important effect of this change is that it fixes a family of cases where we could end up running with an "all off" execution mask, which isn't supposed to happen, as it leads to all sorts of invalid behavior. This change does cause the volume rendering example to run 9% slower, but doesn't affect the other examples. Issue #257.	2012-07-06 11:09:11 -07:00
Matt Pharr	73afab464f	Provide mask at block entry for switch statements. This fixes a crash if 'cbreak' was used in a 'switch'. Renamed FunctionEmitContext::SetLoopMask() to SetBlockEntryMask(), and similarly the loopMask member variable.	2012-07-06 11:08:05 -07:00
Matt Pharr	8aa139b6be	For C++ output, store constant vector values in local arrays. When we have a constant vector of primitive types, we now generate a definition of a static const array of the individual values. This in turn allows us to emit a simple aligned vector load to get the constant vector value, rather than inefficiently inserting the values into a vector. Issue #318.	2012-07-06 08:57:09 -07:00
Matt Pharr	e5fe0eabdc	Update __load() builtins to take const pointers.	2012-07-06 08:47:47 -07:00
Matt Pharr	0d3993fa25	More varied support for constant vectors from C++ backend. If we have a vector of all zeros, a __setzero_* function call is emitted, permitting calling specialized intrinsics for this. Undefined values are reflected with an __undef_* call, which similarly allows passing that information along. This change also includes a cleanup to the signature of the __smear_* functions; since they already have different names depending on the scalar value type, we don't need to use the trick of passing an undefined value of the return vector type as the first parameter as an indirect way to overload by return value. Issue #317.	2012-07-05 20:19:11 -07:00
Jean-Luc Duprat	ac421f68e2	Ongoing support for int64 for KNC: Fixes to __load and __store. Added __add, __mul, __equal, __not_equal, __extract_elements, __smear_i64, __cast_sext, __cast_zext, and __scatter_base_offsets32_float. __rcp_varying_float now has a fast-math and full-precision implementation.	2012-07-05 17:05:42 -07:00
Matt Pharr	6aad4c7a39	Bump version number to 1.3.1dev	2012-07-05 13:35:34 -07:00
Matt Pharr	4186ef204d	Fix build with LLVM top of tree.	2012-07-05 13:35:01 -07:00
Matt Pharr	ae7a094ee0	Merge pull request #315 from NicolasT/master Fix build on Fedora 17	2012-07-04 08:21:03 -07:00
Nicolas Trangez	3a007f939a	Build: Include unistd.h where required Some modules require an include of unistd.h (e.g. for getcwd and isatty definitions). These changes were required to build successfully on a Fedora 17 system, using GCC 4.7.0 & glibc-headers 2.15.	2012-07-04 14:49:00 +02:00
Matt Pharr	b8503b9255	News and doxygen version number bump for 1.3.0 v1.3.0	2012-06-29 08:38:38 -07:00
Matt Pharr	b7bc76d3cc	Documentation updates for 1.3.0.	2012-06-29 08:35:29 -07:00
Matt Pharr	27d6c12972	Bump ISPC_MINOR_VERSION to 3	2012-06-28 16:15:46 -07:00
Matt Pharr	b69d783e09	Bump version to 1.3.0	2012-06-28 15:35:52 -07:00
Matt Pharr	3b2ff6301c	Use fputs() rather than puts() for printing final result from print(). puts() sillily adds an undesired newline.	2012-06-28 12:29:40 -07:00
Matt Pharr	6c7043916e	Silence bogus compiler warning	2012-06-28 12:11:56 -07:00
Matt Pharr	96a6e75b71	Fix issues with LLVM 3.0 and 3.1 build in cbackend.cpp Should fix issue #312.	2012-06-28 12:11:27 -07:00
Matt Pharr	a91e4e7981	Fix missing ;s from `66d4c2ddd9`	2012-06-28 12:04:58 -07:00
Jean-Luc Duprat	95d8f76ec3	Added prelimary support for Intel's Xeon Phi KNC processor. float, int32 and double support is included; int8, int16 and int64 not supported yet. This is work in progress and not considered stable yet.	2012-06-28 12:00:55 -07:00

1 2 3 4 5 ...

996 Commits