aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	0f8eee9809	Fix cases in optimization code to not inadvertently match calls to func ptrs. If we call a function pointer, CallInst::getCalledFunction() returns NULL; we need to be careful about this case when we're matching various function calls in optimization passes. (Fixes a crash.)	2012-01-12 10:33:06 -08:00
Matt Pharr	0740299860	Fix switch test	2012-01-12 09:45:31 -08:00
Matt Pharr	652215861e	Update dynamic target dispatch code to support AVX2.	2012-01-12 08:37:18 -08:00
Matt Pharr	602209e5a8	Tiny updates to documentation, comment for switch stuff.	2012-01-12 05:55:42 -08:00
Matt Pharr	b60f8b4f70	Fix merge conflicts	2012-01-11 17:13:51 -08:00
Matt Pharr	b67446d998	Add support for "switch" statements. Switches with both uniform and varying "switch" expressions are supported. Switch statements with varying expressions and very large numbers of labels may not perform well; some issues to be filed shortly will track opportunities for improving these.	2012-01-11 09:16:31 -08:00
Matt Pharr	9670ab0887	Add missing cases to watch out for in lCheckAllOffSafety() Previously, we weren't checking for member expressions that dereferenced a pointer or pointer dereference expressions--only array indexing!	2012-01-11 09:16:31 -08:00
Matt Pharr	0223bb85ee	Fix bug in StmtList::EmitCode() Previously, we would return immediately if the current basic block was NULL; however, this is the wrong thing to do in that goto labels and case/default labels in switch statements will establish a new current basic block even if the current one is NULL.	2012-01-11 09:14:39 -08:00
Jean-Luc Duprat	fd81255db1	Removed mutex support for OSX 10.5 Allow to run from the build directory even if it is not on the path properly decode subprocess stdout/stderr as UTF-8 Added newlines that were mistakenly left out of print->sys.stdout.wriote() conversion in previous CL Python 3: - fixed error message comparison - explicit list creation Windows: - forward/back slash annoyances - added stdint.h with definitions for int32_t, int64_t - compile_error_files and run_error_files were being appended to improperly	2012-01-10 16:55:00 -08:00
Matt Pharr	8a8e1a7f73	Fix bug with multiple EmitCode() calls due to missing braces. In short, we were inadvertently trying to emit each function's code a second time if the function had a mask check at the start of it. StmtList::EmitCode() was covering this error up by not emitting code if the current basic block is NULL.	2012-01-10 16:50:13 -08:00
Jean-Luc Duprat	ef05fbf424	run_tests.py more compatible with python 3.x except for the mutex class...	2012-01-10 13:12:38 -08:00
Jean-Luc Duprat	fa01b63fa5	Remove assumption that . is in the PATH in run_tests.py	2012-01-10 11:41:08 -08:00
Jean-Luc Duprat	63d3d25030	Fixed off by one error in array size generated by bitcode2cpp.py	2012-01-10 11:22:13 -08:00
Jean-Luc Duprat	a8db866228	Python build compatible on both python 2 and 3	2012-01-10 10:42:15 -08:00
Jean-Luc Duprat	0519eea951	Makefile does not hardcode link paths on Linux Link statically for both x86 and x86-64	2012-01-10 10:34:57 -08:00
Matt Pharr	f4653ecd11	Release notes for 1.1.2 and doxygen version number bump v1.1.2	2012-01-09 16:05:40 -08:00
Jean-Luc Duprat	5d67252ed0	Python scripts now compatible with both 2.x and 3.x releases of python	2012-01-09 13:56:05 -08:00
Matt Pharr	5134de71c0	Fix Windows build (inttypes.h not available)	2012-01-09 09:05:20 -08:00
Matt Pharr	2be1251c70	Fix Makefile on OSX (uname -o not supported)	2012-01-09 07:40:47 -08:00
Matt Pharr	c0161aa17f	Merge pull request #154 from palacaze/mingw Mingw support	2012-01-09 07:37:02 -08:00
Pierre-Antoine Lacaze	b683aa11b1	Fix linking under mingw, libdl is Linux only.	2012-01-09 10:52:46 +01:00
Pierre-Antoine Lacaze	2654bb0112	Handle python installations in non-standards locations.	2012-01-09 10:29:54 +01:00
Pierre-Antoine Lacaze	d8728104b4	Handle the case whereby BUILD_DATE is already defined.	2012-01-09 10:29:16 +01:00
Pierre-Antoine Lacaze	0be1b70fba	Mingw has strtoull, make use of it.	2012-01-09 10:28:52 +01:00
Pierre-Antoine Lacaze	a0e9793de3	Shut up warning wrt CONSOLE_SCREEN_BUFFER_INFO initialization	2012-01-09 10:19:46 +01:00
Pierre-Antoine Lacaze	da9200fcee	Fix alloca use on mingw.	2012-01-09 10:19:09 +01:00
Pierre-Antoine Lacaze	54e8e8022b	suppress warnings about long long arguments	2012-01-09 10:18:39 +01:00
Pierre-Antoine Lacaze	d84cf781da	Mingw does not have sysconf, use the msc way of finding processors.	2012-01-09 09:45:40 +01:00
Pierre-Antoine Lacaze	002f27a30f	Implement vasprintf and asprintf for platforms lacking them.	2012-01-09 09:44:58 +01:00
Matt Pharr	86d88e9773	run_tests.py: fix to use multiple cores on windows, ignore non ispc inputs	2012-01-08 15:29:20 -08:00
Matt Pharr	fda00afe6e	Rename .txt files in docs to .rst (which is what they actually are).	2012-01-08 14:11:04 -08:00
Matt Pharr	be0c77d556	Detect more gather/scatter cases that are actually base+offsets. We now recognize patterns like (ptr + offset1 + offset2) as being cases we can handle with the base_offsets variants of the gather/scatter functions. (This can come up with multidimensional array indexing, for example.) Issue #150.	2012-01-08 14:06:44 -08:00
Matt Pharr	0ed11a7832	Compute SizeOf() and OffsetOf() at compile time in more cases with the generic target. Really, we only have to be careful about the case where there is a vector of bools (i.e. a mask) involved, since the size of that isn't known at compile-time. (Currently, at least.)	2012-01-08 14:06:44 -08:00
Matt Pharr	ff6971fb15	Use Assert() rather than assert()	2012-01-08 14:06:44 -08:00
Matt Pharr	5b4dbc8167	Fix build of aobench_instrumented example on OSX/Linux	2012-01-08 10:02:43 -08:00
Jean-Luc Duprat	59f4c9985e	Python files compatible with python 3	2012-01-06 16:56:09 -08:00
Matt Pharr	8da9be1a09	Add support for 'k', 'M', and 'G' suffixes to integer constants. (Denoting units of 1024, 10241024, and 10241024*1024, respectively.) Issue #128.	2012-01-06 14:47:47 -08:00
Matt Pharr	11033e108e	Fix bug that prohibited assignments with pointer expressions on the LHS Previously, code like "*(ptr+1) = foo" would claim that the LHS was invalid for an assignment expression. Issue #138.	2012-01-06 14:21:03 -08:00
Matt Pharr	4f97262cf2	Support function declarations in the definitions of other functions. As part of this, function declarations are no longer scoped (this is permitted by the C standard, as it turns out.) So code like: void foo() { void bar(); } void bat() { bar(); } Compiles correctly; the declaration of bar() in foo() is still available in the definition of bar(). Fixes issue #129.	2012-01-06 13:50:10 -08:00
Matt Pharr	9b68b9087a	Fix crash with anonymous function parameters in function definitions. Issue #135.	2012-01-06 13:28:06 -08:00
Matt Pharr	15cc812e37	Add notion of "unbound" variability to the type system. Now, when a type is declared without an explicit "uniform" or "varying" qualifier, its variability is unbound; depending on the context of the declaration, the variability is later finalized. Currently, in almost all cases, types with unbound variability are resolved to varying types; the one exception is typecasts like: "(int)1"; in this case, the fact that (int) has unbound variability carries through to the TypeCastExpr, which in turn notices that the expression being type cast has uniform type and in turn will resolve (int) to (uniform int). Fixes issue #127.	2012-01-06 11:52:58 -08:00
Matt Pharr	71317e6aa6	Fix bug in gather/scatter optimization passes. When flattening chains of insertelement instructions, we didn't handle the case where the initial insertelement was to a constant vector (with one value set and the other values undef). Also generalized the "do all of the instances access the same location" check to handle the case where some of them are accessing undef locations; these are ignored in this check, as they should correspond to the mask being off for that lane anyway. Fixes issue #149.	2012-01-06 09:19:18 -08:00
Matt Pharr	1abaaee73e	Fix bug where we'd sometimes inadvertently lose cv-qualifiers on pointers. Fixes issue #142.	2012-01-06 08:41:01 -08:00
Matt Pharr	78c6d3c02f	Add initial support for 'goto' statements. ispc now supports goto, but only under uniform control flow--i.e. it must be possible for the compiler to statically determine that all program instances will follow the goto. An error is issued at compile time if a goto is used when this is not the case.	2012-01-05 12:22:36 -08:00
Matt Pharr	48e9d4af39	Emit code for #includes in emitted C++ code all at the start of the file.	2012-01-05 12:22:35 -08:00
Matt Pharr	cb7ad371c6	Run tests using -O2. Disconcertingly, this seems to fix some gcc-only crashes with the generic-16 target (specifically, for half.ispc and for goto-[23].ispc-- those tests run fine with other compilers with generic-16.)	2012-01-05 12:22:35 -08:00
Matt Pharr	2951589825	Redo readme in better-looking rst form	2012-01-04 15:32:29 -08:00
Matt Pharr	f23dc5366a	Updates to run_tests.py script. Add support for the generic targets (using the headers in examples/intrinsics if none is provided.) Provide option to run valgrind on the compiled code. Print a list of all failing tests at the end.	2012-01-04 12:59:03 -08:00
Matt Pharr	e3341176c5	Redo makefiles for the examples. They're all based off a common examples/common.mk file, so that individual makefiles are quite simple now. The common.mk file also provides targets to build the examples using C++ output with the generic-16h or sse4.h files. These targets don't run by default, but do run if 'make all' is run.	2012-01-04 12:59:03 -08:00
Matt Pharr	8938e14442	Add support for emitting ~generic vectorized C++ code. The compiler now supports an --emit-c++ option, which generates generic vector C++ code. To actually compile this code, the user must provide C++ code that implements a variety of types and operations (e.g. adding two floating-point vector values together, comparing them, etc). There are two examples of this required code in examples/intrinsics: generic-16.h is a "generic" 16-wide implementation that does all required with scalar math; it's useful for demonstrating the requirements of the implementation. Then, sse4.h shows a simple implementation of a SSE4 target that maps the emitted function calls to SSE intrinsics. When using these example implementations with the ispc test suite, all but one or two tests pass with gcc and clang on Linux and OSX. There are currently ~10 failures with icc on Linux, and ~50 failures with MSVC 2010. (To be fixed in coming days.) Performance varies: when running the examples through the sse4.h target, some have the same performance as when compiled with --target=sse4 from ispc directly (options), while noise is 12% slower, rt is 26% slower, and aobench is 2.2x slower. The details of this haven't yet been carefully investigated, but will be in coming days as well. Issue #92.	2012-01-04 12:59:03 -08:00

... 4 5 6 7 8 ...

750 Commits