aaron/ispc - ispc - git.frat.tech

aaron/ispc

Go to file

Matt Pharr 6dbb15027a Take advantage of x86's free "scale by 2, 4, or 8" in addressing calculations

When loading from an address that's computed by adding two registers
together, x86 can scale one of them by 2, 4, or 8, for free as part
of the addressing calculation.  This change makes the code generated
for gather and scatter use this.

For the cases where gather/scatter is based on a base pointer and
an integer offset vector, the GSImprovementsPass looks to see if the
integer offsets are being computed as 2/4/8 times some other value.
If so, it extracts the 2x/4x/8x part and leaves the rest there as
the the offsets.  The {gather,scatter}_base_offsets_* functions take
an i32 scale factor, which is passed to them, and then they carefully
generate IR so that it hits LLVM's pattern matching for these scales.

This is particular win on AVX, since it saves us two 4-wide integer
multiplies.

Noise runs 14% faster with this.
Issue #132.

2011-12-16 15:55:44 -08:00

contrib

vim syntax highlighting for ispc from <andreas.wendleder@googlemail.com>

2011-08-04 05:49:28 -07:00

docs

Release notes and doxygen bump for 1.1.1

2011-12-15 13:17:08 -08:00

examples

Fix mandelbrot_tasks example

2011-12-11 15:21:11 -08:00

tests

Fix bugs with offsetting for varying values with gathers/scatters.

2011-12-12 14:13:46 -08:00

tests_errors

Fix test runner script to not crash if one of the tests_errors didn't return the expected result.

2011-12-15 12:38:41 -08:00

winstuff

Initial commit.

2011-06-21 12:48:50 -07:00

.gitignore

Release notes, bump doxygen version # for next release.

2011-07-17 16:52:36 +02:00

ast.cpp

Transition EstimateCost() AST traversal to WalkAST() as well.

2011-12-16 12:24:51 -08:00

ast.h

Transition EstimateCost() AST traversal to WalkAST() as well.

2011-12-16 12:24:51 -08:00

bitcode2cpp.py

More small Windows build fixes. Also switch to LLVM 3.0 libs

2011-09-26 16:07:23 -07:00

buildall.bat

Fix various warnings / build issues on Windows

2011-12-15 12:06:38 -08:00

builtins-avx-common.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins-avx-x2.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins-avx.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins-c.c

Add support for function pointers.

2011-11-03 16:14:14 -07:00

builtins-dispatch.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins-sse2-common.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins-sse2-x2.ll

Add a number of symbol names to list to make internal after loading builtins.

2011-12-07 08:30:38 -08:00

builtins-sse2.ll

Add a number of symbol names to list to make internal after loading builtins.

2011-12-07 08:30:38 -08:00

builtins-sse4-common.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins-sse4-x2.ll

Add a number of symbol names to list to make internal after loading builtins.

2011-12-07 08:30:38 -08:00

builtins-sse4.ll

Workaround change to linker behavior in LLVM 3.1

2011-11-05 16:57:26 -07:00

builtins.cpp

Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so.

2011-12-15 11:11:16 -08:00

builtins.h

Add support for compiling to multiple targets.

2011-10-04 16:01:55 -07:00

builtins.m4

Take advantage of x86's free "scale by 2, 4, or 8" in addressing calculations

2011-12-16 15:55:44 -08:00

ctx.cpp

Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so.

2011-12-15 11:11:16 -08:00

ctx.h

Small cleanup: allocate storage for the full mask in the FunctionEmitContext constructor

2011-12-10 13:33:28 -08:00

decl.cpp

Transition type checking to use WalkAST() infrastructure.

2011-12-16 12:24:51 -08:00

decl.h

Parse and then mostly ignore "signed" qualifier.

2011-11-29 21:41:04 -08:00

doxygen.cfg

Release notes and doxygen bump for 1.1.1

2011-12-15 13:17:08 -08:00

expr.cpp

Transition EstimateCost() AST traversal to WalkAST() as well.

2011-12-16 12:24:51 -08:00

expr.h

Print better error messages when function overload resolution fails.

2011-12-14 11:41:34 -08:00

func.cpp

Transition EstimateCost() AST traversal to WalkAST() as well.

2011-12-16 12:24:51 -08:00

func.h

Significantly reduce the tendrils of DeclSpecs/Declarator/Declaration code

2011-10-18 15:37:29 -07:00

ispc.cpp

Linux build fixes

2011-12-15 12:23:26 -08:00

ispc.h

Linux build fixes

2011-12-15 12:23:26 -08:00

ispc.sln

Update run_tests.py to work on Windows. Removed JIT-based testing path entirely.

2011-12-06 13:46:20 -08:00

ispc.vcxproj

Add "double-wide" sse2-x2 target.

2011-10-11 15:17:31 -07:00

lex.ll

Fix various warnings / build issues on Windows

2011-12-15 12:06:38 -08:00

LICENSE.txt

Add support for in-memory half float data. Fixes issue #10

2011-07-21 15:55:45 +01:00

llvmutil.cpp

Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so.

2011-12-15 11:11:16 -08:00

llvmutil.h

Add support for pointers to the language.

2011-11-27 13:09:59 -08:00

main.cpp

Fix various warnings / build issues on Windows

2011-12-15 12:06:38 -08:00

Makefile

Update run_tests.py to work on Windows. Removed JIT-based testing path entirely.

2011-12-06 13:46:20 -08:00

module.cpp

Transition type checking to use WalkAST() infrastructure.

2011-12-16 12:24:51 -08:00

module.h

Generalize/improve parsing of pointer declarations.

2011-11-14 08:45:55 -08:00

opt.cpp

Take advantage of x86's free "scale by 2, 4, or 8" in addressing calculations

2011-12-16 15:55:44 -08:00

opt.h

Initial commit.

2011-06-21 12:48:50 -07:00

parse.yy

Transition type checking to use WalkAST() infrastructure.

2011-12-16 12:24:51 -08:00

README.txt

Release notes and doxygen bump for 1.0.9 release

2011-09-26 16:21:32 -07:00

run_tests.py

Fix test runner script to not crash if one of the tests_errors didn't return the expected result.

2011-12-15 12:38:41 -08:00

simple.vcxproj

Windows: fix some compiler warnings during build

2011-10-09 07:40:17 -07:00

stdlib2cpp.py

Fix issue #62 : emit stdlib code as char array, not a string

2011-07-08 09:14:52 -07:00

stdlib.ispc

Fix AoS/SoA stdlib functions to match documentation

2011-12-03 22:44:16 -08:00

stmt.cpp

Transition EstimateCost() AST traversal to WalkAST() as well.

2011-12-16 12:24:51 -08:00

stmt.h

Rewrite AST optimization infrastructure to be built on top of WalkAST().

2011-12-16 12:24:51 -08:00

sym.cpp

Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so.

2011-12-15 11:11:16 -08:00

sym.h

Symbol table now properly handles scopes for function declarations.

2011-12-04 17:37:13 -08:00

test_static.cpp

Fix various warnings / build issues on Windows

2011-12-15 12:06:38 -08:00

type.cpp

Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so.

2011-12-15 11:11:16 -08:00

type.h

Add support for pointers to the language.

2011-11-27 13:09:59 -08:00

util.cpp

Have assertion macro and FATAL() text ask user to file a bug, provide URL to do so.

2011-12-15 11:11:16 -08:00

util.h

Initial commit.

2011-06-21 12:48:50 -07:00

README.txt

==============================
Intel(r) SPMD Program Compiler
==============================

Welcome to the Intel(r) SPMD Program Compiler (ispc)!  

ispc is a new compiler for "single program, multiple data" (SPMD)
programs. Under the SPMD model, the programmer writes a program that mostly
appears to be a regular serial program, though the execution model is
actually that a number of program instances execute in parallel on the
hardware. ispc compiles a C-based SPMD programming language to run on the
SIMD units of CPUs; it frequently provides a a 3x or more speedup on CPUs
with 4-wide SSE units, without any of the difficulty of writing intrinsics
code.

ispc is an open source compiler under the BSD license; see the file
LICENSE.txt.  ispc supports Windows, Mac, and Linux, with both x86 and
x86-64 targets.  It currently supports the SSE2, SSE4, and AVX instruction
sets.

For more information and examples, as well as a wiki and the bug database,
see the ispc distribution site, http://ispc.github.com.

Languages

C++ 63.5%

LLVM 19.1%

M4 11.6%

Python 4.5%

Makefile 0.5%

Other 0.6%