Matt Pharr 8938e14442 Add support for emitting ~generic vectorized C++ code.
The compiler now supports an --emit-c++ option, which generates generic
vector C++ code.  To actually compile this code, the user must provide
C++ code that implements a variety of types and operations (e.g. adding
two floating-point vector values together, comparing them, etc).

There are two examples of this required code in examples/intrinsics:
generic-16.h is a "generic" 16-wide implementation that does all required
with scalar math; it's useful for demonstrating the requirements of the
implementation.  Then, sse4.h shows a simple implementation of a SSE4
target that maps the emitted function calls to SSE intrinsics.

When using these example implementations with the ispc test suite,
all but one or two tests pass with gcc and clang on Linux and OSX.
There are currently ~10 failures with icc on Linux, and ~50 failures with
MSVC 2010.  (To be fixed in coming days.)

Performance varies: when running the examples through the sse4.h
target, some have the same performance as when compiled with --target=sse4
from ispc directly (options), while noise is 12% slower, rt is 26%
slower, and aobench is 2.2x slower.  The details of this haven't yet been
carefully investigated, but will be in coming days as well.

Issue #92.
2012-01-04 12:59:03 -08:00
2011-06-21 12:48:50 -07:00
2012-01-04 12:59:03 -08:00
2011-06-21 12:48:50 -07:00
2011-06-21 12:48:50 -07:00

==============================
Intel(r) SPMD Program Compiler
==============================

Welcome to the Intel(r) SPMD Program Compiler (ispc)!  

ispc is a new compiler for "single program, multiple data" (SPMD)
programs. Under the SPMD model, the programmer writes a program that mostly
appears to be a regular serial program, though the execution model is
actually that a number of program instances execute in parallel on the
hardware. ispc compiles a C-based SPMD programming language to run on the
SIMD units of CPUs; it frequently provides a a 3x or more speedup on CPUs
with 4-wide SSE units, without any of the difficulty of writing intrinsics
code.

ispc is an open source compiler under the BSD license; see the file
LICENSE.txt.  ispc supports Windows, Mac, and Linux, with both x86 and
x86-64 targets.  It currently supports the SSE2, SSE4, and AVX instruction
sets.

For more information and examples, as well as a wiki and the bug database,
see the ispc distribution site, http://ispc.github.com.
Description
No description provided
Readme 34 MiB
Languages
C++ 63.5%
LLVM 19.1%
M4 11.6%
Python 4.5%
Makefile 0.5%
Other 0.6%