aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	cb7976bbf6	Added updated task launch implementation that now tracks task groups. Within each function that launches tasks, we now can easily track which tasks that function launched, so that the sync at the end of the function can just sync on the tasks launched by that function (not all tasks launched by all functions.) Implementing this led to a rework of the task system API that ispc generates code to call; the example task systems in examples/tasksys.cpp have been updated to conform to this API. (The updated API is also documented in the ispc user's guide.) As part of this, "launch[n]" syntax was added to launch a number of tasks in a single launch statement, rather than requiring a loop over 'n' to launch n tasks. This commit thus fixes issue #84 (enhancement to launch multiple tasks from a single launch statement) as well as issue #105 (recursive task launches were broken).	2011-09-30 11:20:53 -07:00
Matt Pharr	b3d3e8987b	Provide a properly initialized TextDiagnosticPrinter to clang's preprocessor. Fixes issue #100 (crash when the preprocessor was trying to emit a diagnostic about a mismatched #if/#endif).	2011-09-23 15:50:18 -07:00
Matt Pharr	9921b8e530	Predicated 'if' statement performance improvements. Go back to running both sides of 'if' statements with masking and without branching if we can determine that the code is relatively simple (as per the simple cost model), and is safe to run even if the mask is 'all off'. This gives a bit of a performance improvement for some of the examples (most notably, the ray tracer), and is the code that one wants generated in this case anyhow.	2011-09-19 09:54:09 -07:00
Matt Pharr	2405dae8e6	Use malloc() to get space for task arguments when compiling to AVX. This is to work around the LLVM bug/limitation discused in LLVM bug 10841 (http://llvm.org/bugs/show_bug.cgi?id=10841).	2011-09-17 13:38:51 -07:00
Matt Pharr	3607f3e045	Remove support for building with LLVM 2.8. Fixes issue #66 . Both 2.9 and top-of-tree generate substantially better code than LLVM 2.8 did, so it's not worth fixing the 2.8 build.	2011-09-17 13:18:59 -07:00
Matt Pharr	a501ab1aa6	Fix parenthesization bugs in cost estimates. Also added the debugging print that helped find these issues. Revert inlining some functions in examples	2011-09-16 19:07:07 -07:00
Matt Pharr	ca87579f23	Add a very simple cost model to estimate runtime cost of running code. This is currently only used to decide whether it's worth doing an "are all lanes running" check at the start of functions--for small functions, it's not worth the overhead. The cost is estimated relatively early in compilation (e.g. before we know if an array access is a scatter/gather or not, before constant folding, etc.), so there are many known shortcomings.	2011-09-16 15:09:17 -07:00
Matt Pharr	1147b53dcd	Add #define with target vector width in emitted headers	2011-09-09 09:33:56 -07:00
Matt Pharr	4cf831a651	When --fast-math is enabled, tell LLVM about it, too.	2011-09-09 09:32:59 -07:00
Matt Pharr	46d2bad231	Fix malformed program crash	2011-09-09 09:24:43 -07:00
Matt Pharr	32da8e11b4	Fix crash with varying global vector types when emitting header file.	2011-09-09 09:16:59 -07:00
Matt Pharr	54ec56c81d	Clean up and centralize LLVM target initialization	2011-08-26 10:15:33 -07:00
Matt Pharr	a322398c62	When emitting header files, put 'extern' declarations of globals used in ispc code outside of the ispc namespace. Fixes issue #64.	2011-08-26 10:03:06 -07:00
Matt Pharr	b67498766e	Big rewrite / improvement of target handling. If no CPU is specified, use the host CPU type, not just a default of "nehalem". Provide better features strings to the LLVM target machinery. -> Thus ensuring that LLVM doesn't generate SSE>2 instructions for the SSE2 target (Fixes issue #82). -> Slight code improvements from using cmovs in generated code now Use the llvm popcnt intrinsic for the SSE2 target now (it now generates code that doesn't call the popcnt instruction now that we properly tell LLVM which instructions are and aren't available for SSE2.)	2011-08-26 09:54:45 -07:00
Matt Pharr	c340ff3893	Fixes to build with LLVM ToT	2011-08-25 08:53:56 +01:00
Matt Pharr	e14208f489	Update to call DIBuilder::finalize() with LLVM 3.0	2011-08-24 22:28:20 +01:00
Matt Pharr	fe54f1ad8e	Fixes to build with latest LLVM ToT	2011-08-18 08:34:49 +01:00
Matt Pharr	04c93043d6	Target handling fixes. Set the Module's target appropriately when it's first created. Compile separate 32 and 64 bit versions of the builtins-c bitcocde and load the appropriate one based on the target we're compiling for.	2011-08-15 16:03:50 +01:00
Matt Pharr	230a0fadea	Attempt to generate debug info for task parameters.	2011-08-15 12:31:56 +01:00
Matt Pharr	25676d5643	When --debug is specified, only print the entire module bitcode twice. Fixes issue #77. Previously, it dumped out the entire module every time a new function was defined, which got to be quite a lot of output by the time the stdlib functions were all added!	2011-07-29 07:26:37 +02:00
Matt Pharr	158bd6ef9e	Fix bug with initializer expression lists for globlal/static array-typed variables.	2011-07-28 11:38:56 +01:00
Matt Pharr	8aea4a836d	Fix crash when trying to generate debug info with program source from stdin	2011-07-27 07:42:47 +01:00
Matt Pharr	0932dcd98b	Fix build with llvm top-of-tree	2011-07-23 08:35:45 +01:00
Matt Pharr	da0fd93315	AVX fixes: add missing 8/16-bit gathers and scatters, set features string appropriately when AVX is enabled.	2011-07-22 12:36:44 +01:00
Matt Pharr	2d573acd17	Another LLVM dev tree API change fix	2011-07-18 17:28:49 +01:00
Matt Pharr	654cfb4b4b	Many fixes for recent LLVM dev tree API changes	2011-07-18 15:54:39 +01:00
Matt Pharr	6b0a6c0124	Fix issue #67 : don't crash ungracefully if target ISA not supported on system. - In the ispc-generated header files, a #define now indicates which compilation target was used. - The examples use utility routines from the new file examples/cpuid.h to check the system's CPU's capabilities to see if it supports the ISA that was used for compiling the example code and print error messages if things aren't going to work...	2011-07-18 12:29:43 +01:00
Matt Pharr	f0f876c3ec	Add support for enums.	2011-07-17 16:43:05 +02:00
Andreas Wendleder	ae6ee3ea46	Cmake based LLVM builds don't have svn in their identification.	2011-07-08 16:44:22 +01:00
Matt Pharr	6e4c165c7e	Use malloc to allocate storage for task parameters on Windows. Fixes bug #55. A number of tests were crashing on Windows due to the task launch code using alloca to allocate space for the tasks' parameters. On Windows, the stack isn't generally big enough for this to be a good idea. Also added an alignment parmaeter to ISPCMalloc() to pass the alignment requirement along.	2011-07-06 05:53:25 -07:00
Matt Pharr	4d733af3c7	Add check to make sure file exists before running preprocessor. (If the file doesn't exist, clang ends up crashing, so we'd like to avoid that.)	2011-07-06 11:33:33 +01:00
Matt Pharr	b0658549c5	Fix crash when no input filename was provided.	2011-07-04 12:52:03 +01:00
Pete Couperus	fac50ba454	Use clang's preprocessor, rather than forking a process to run cpp on Mac/Linux (and not having a built-in preprocessor solution at all on Windows.) Fixes issue #32.	2011-07-04 08:35:31 +01:00
Matt Pharr	a2940d63b4	Update call to llvm::Target::createTargetMachine() for LLVM dev tree build to handle recent change to API. If building with LLVM tot, a version starting with or after this change must be used: commit 276365dd4bc0c2160f91fd8062ae1fc90c86c324 Author: Evan Cheng <evan.cheng@apple.com> Date: Thu Jun 30 01:53:36 2011 +0000 Fix the ridiculous SubtargetFeatures API where it implicitly expects CPU name to be the first encoded as the first feature. It then uses the CPU name to look up features / scheduling itineray even though clients know full well the CPU name being used to query these properties. The fix is to just have the clients explictly pass the CPU name! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@134127 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-01 08:32:58 +01:00
Matt Pharr	214fb3197a	Initial plumbing to add CollectionType base-class as common ancestor to StructTypes, ArrayTypes, and VectorTypes. Issue #37.	2011-06-29 07:42:09 +01:00
Matt Pharr	2ced56736e	small comment changes, remove dead code	2011-06-22 14:38:49 -07:00
Matt Pharr	bf74a3360f	Merge pull request #24 from petecoup/master LLVM 2.8 mods	2011-06-22 12:03:19 -07:00
Matt Pharr	6086d3597c	Fix more instances of incorrect PI constants	2011-06-22 05:27:56 -07:00
Pete Couperus	af435e52c1	Minor mods to build on Fedora 15, LLVM 2.8	2011-06-21 22:57:36 -07:00
Matt Pharr	18af5226ba	Initial commit.	2011-06-21 12:48:50 -07:00

40 Commits