aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Matt Pharr	a501ab1aa6	Fix parenthesization bugs in cost estimates. Also added the debugging print that helped find these issues. Revert inlining some functions in examples	2011-09-16 19:07:07 -07:00
Matt Pharr	cdc850f98c	Inline some functions in examples	2011-09-16 17:02:21 -07:00
Matt Pharr	30f9dcd4f5	Unroll loops by default, add --opt=disable-loop-unroll to disable. Issue #78.	2011-09-13 15:37:18 -07:00
Matt Pharr	0c344b6755	Fix Linux build of mandelbrot_tasks example	2011-09-13 15:17:30 -07:00
Matt Pharr	5dedb6f836	Add --scale command line argument to mandelbrot and rt examples. This applies a floating-point scale factor to the image resolution; it's useful for experiments with many-core systems where the base image resolution may not give enough work for good load-balancing with tasks.	2011-09-07 20:07:51 -07:00
Matt Pharr	2ea6d249d5	Fix mapping to 8, 16 program instances in AO bench example. With this, we now compute a correct image with AVX.	2011-09-07 11:34:24 -07:00
Matt Pharr	375f1cb8e8	Make octaves and octaves loop uniform in noise example	2011-09-07 10:34:23 -07:00
Matt Pharr	effe901890	Add task-parallel version of aobench	2011-09-07 05:43:21 -07:00
Matt Pharr	b5bfa43e92	Fix error with float suffixes	2011-09-02 13:09:25 -07:00
Matt Pharr	99221f7d17	Fix a few places in examples where C reference implementaion had a double-precision fp constant undesirably causing computation to be done in double precision. Makes C scalar versions of the options pricing models, rt, and aobench 3-5% faster. Makes scalar version of noise about 15% faster. Others are unchanged.	2011-09-01 16:31:22 -07:00
Matt Pharr	a94cabc692	Modify stencil example to do separate runs with and without task parallelism.	2011-08-30 05:08:21 -07:00
Matt Pharr	33feeffe5d	Update timing header so it works with C code	2011-08-29 11:23:43 -07:00
Matt Pharr	74c2c8ae07	Linux build fixes	2011-08-17 07:08:44 -07:00
Matt Pharr	206c851146	Various improvements to example task systems in examples/. - Only have a single copy of all of the tasks_*.cpp sample implementations, stored in examples/. - Reduce dynamic storage allocation and locking in task launch code paths. - Don't have a hard limit of the number of tasks that can be launched on Windows (fix issue #85).	2011-08-17 14:31:45 +01:00
Matt Pharr	60bdf1ef8a	Modify rt example to also do a set of runs with tasks + SPMD together.	2011-08-17 13:14:32 +01:00
Matt Pharr	d7662b3eb9	Use reduce_equal() in volume rendering example to avoid some gathers. Modified this example to use reduce_equal() to see if all of the program instances want to load the 8 sample values around the same voxel. When this is the case, we can just do 8 scalar loads, rather than needing to do a fully general gather. Once this check fails, it isn't done again, since it's not likely to start succeeding in the future. This gives a ~10% speedup with the low-res data set, and basically no performance difference with the high-res one. (It makes sense that the lower-resolution the voxel sampling, the longer all of the rays will stay in the same set of voxels.)	2011-08-17 12:37:07 +01:00
Matt Pharr	ecaa57c7c6	Add volume rendering example. (~2.3x speedup from SIMD vs serial code.)	2011-08-17 12:05:37 +01:00
Matt Pharr	fce183c244	Merge branch 'master' of github.com:ispc/ispc	2011-08-17 10:32:49 +01:00
Matt Pharr	7a92f8b3f9	Add MSVC build support for stencil example	2011-08-17 02:28:49 -07:00
Matt Pharr	96af08e789	Print notices about image files being written	2011-08-16 06:31:26 +01:00
Matt Pharr	c570108026	Fix linux build of stencil example	2011-08-15 04:44:17 -07:00
Matt Pharr	137ea7bde6	Rename semaphore filename to be more generic	2011-08-04 05:28:00 -07:00
Matt Pharr	e05b3981d9	Add stencil example	2011-08-03 13:49:02 -07:00
Matt Pharr	a4bb6b5520	Add new example with implementation of Perlin Noise ~4.2x speedup versus serial on OSX / gcc. ~2.9x speedup versus serial on Windows / MSVC.	2011-08-01 10:33:18 +01:00
Matt Pharr	80ca02af58	Add missing #include, fix Linux build. Fixes issue #75 .	2011-07-27 10:51:13 +01:00
Matt Pharr	bba7211654	Add support for int8/int16 types. Addresses issues #9 and #42 .	2011-07-21 06:57:40 +01:00
Matt Pharr	6b0a6c0124	Fix issue #67 : don't crash ungracefully if target ISA not supported on system. - In the ispc-generated header files, a #define now indicates which compilation target was used. - The examples use utility routines from the new file examples/cpuid.h to check the system's CPU's capabilities to see if it supports the ISA that was used for compiling the example code and print error messages if things aren't going to work...	2011-07-18 12:29:43 +01:00
Matt Pharr	213c3a9666	Release notes, bump doxygen version # for next release. Add more .gitignore stuff.	2011-07-17 16:52:36 +02:00
Matt Pharr	6e4c165c7e	Use malloc to allocate storage for task parameters on Windows. Fixes bug #55. A number of tests were crashing on Windows due to the task launch code using alloca to allocate space for the tasks' parameters. On Windows, the stack isn't generally big enough for this to be a good idea. Also added an alignment parmaeter to ISPCMalloc() to pass the alignment requirement along.	2011-07-06 05:53:25 -07:00
Matt Pharr	46ccc251c8	Added C preprocessor support for Windows. Link the appropriate clang libraries to make the preprocessor stuff work on Windows builds. Also updated the solution files for the examples to stop using cl.exe for preprocessing but to just call ispc directly. Finishes fixes for issue #32.	2011-07-04 05:01:04 -07:00
Matt Pharr	6ed6961958	Add checks to sample task systems to ensure that TasksInit has been called; if not, print an informative error message.	2011-07-01 14:11:16 +01:00
Matt Pharr	bcae21dbca	Update examples to use fpmath:fast and to enable intrinsics on Windows	2011-06-30 13:17:14 -07:00
Matt Pharr	ff76c2334e	small doc fix, removed incorrect comment from example	2011-06-24 16:19:51 -07:00
Matt Pharr	865e430b56	Finished updating alignment issues for vector types; don't assume pointers are aligned to the natural vector width.	2011-06-23 18:51:15 -07:00
Matt Pharr	990bee5a86	Merge branch 'master' of github.com:ispc/ispc	2011-06-23 18:21:02 -07:00
Matt Pharr	b84167dddd	Fixed a number of issues related to memory alignment; a number of places were expecting vector-width-aligned pointers where in point of fact, there's no guarantee that they would have been in general. Removed the aligned memory allocation routines from some of the examples; they're no longer needed. No perf. difference on Core2/Core i5 CPUs; older CPUs may see some regressions. Still need to update the documentation for this change and finish reviewing alignment issues in Load/Store instructions generated by .cpp files.	2011-06-23 18:18:33 -07:00
Andreas Wendleder	39542f420a	Ignore built files.	2011-06-23 16:06:38 -07:00
Matt Pharr	e5bc6cd67c	Update examples/ Makefiles to make x86-64 explicit in compiler flags	2011-06-23 10:00:07 -07:00
Matt Pharr	18af5226ba	Initial commit.	2011-06-21 12:48:50 -07:00

... 3 4 5 6 7

339 Commits