aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Evghenii	9861375f0c	renamed avx-i64x4 -> avx1-i64x4	2013-09-13 15:07:14 +02:00
Tomasz Koziara	ed825b3773	Uniform memory allocation fixed.	2013-09-13 13:14:31 +01:00
egaburov	a9913c8337	changed lexer/parser to be able to read float constants, if they have "f"-suffix	2013-09-13 10:26:15 +02:00
Evghenii	a97eb7b7cb	added clamp in double precision	2013-09-13 09:32:59 +02:00
egaburov	715b828266	fixed float constants to be read as doubles	2013-09-13 09:25:52 +02:00
Evghenii	40af8d6ed5	fixed segfault in tests/launch-*.ispc. nativeVectoWidth in avx-i64x4 was set to 4. Fixed	2013-09-12 20:25:44 +02:00
Evghenii	059d80cc11	included suggested changes, ./tests/launch-*.ispc still fails. something is mask64 related, not sure what. help...	2013-09-12 17:18:12 +02:00
egaburov	7364e06387	added mask64	2013-09-12 12:02:42 +02:00
egaburov	efc20c2110	added svml support to all sse/avx modes	2013-09-11 17:07:54 +02:00
egaburov	19379db3b6	svml cleanup	2013-09-11 16:48:56 +02:00
egaburov	9cf8e8cbf3	builtins fix for double precision svml and __stdlib_asin	2013-09-11 15:23:45 +02:00
egaburov	7a32699573	added svml.m4	2013-09-11 15:18:03 +02:00
egaburov	320c41ffcf	added svml support. experimental. for some reason all sybmols are visible..	2013-09-11 15:16:50 +02:00
egaburov	9c79d4d182	addded avxh with vectorWidth=4 support, use --target=avxh to enable it	2013-09-11 12:58:02 +02:00
jbrodman	582cfe55b6	Merge pull request #575 from jbrodman/master Revert "Remove support for using SVML for math lib routines."	2013-09-05 10:31:23 -07:00
james.brodman	8db378b265	Revert "Remove support for using SVML for math lib routines." This reverts commit `d9c38b5c1f`.	2013-09-04 16:01:58 -04:00
jbrodman	71a7564317	Merge pull request #574 from jbrodman/uniftypedef Fix to respect uniform/varying qualifiers inside of typedefs.	2013-09-03 13:14:00 -07:00
jbrodman	c14b035a46	Merge pull request #572 from ifilippov/master correction of adding -Werror option	2013-08-30 11:17:01 -07:00
jbrodman	cf2eaa0014	Merge pull request #569 from dbabokin/unmasked Fix for incorrect implementation of reduce_[min\|max]_[float\|double]	2013-08-30 11:16:42 -07:00
jbrodman	cb92d54808	Merge pull request #570 from dbabokin/docs Minor docs fixes.	2013-08-30 11:16:26 -07:00
james.brodman	97d430d5cd	Fix to respect uniform/varying qualifiers inside of typedefs.	2013-08-30 14:13:08 -04:00
Ilia Filippov	320b1700ff	correction of adding -Werror option	2013-08-30 16:01:01 +04:00
Dmitry Babokin	e06267ef1b	Fix for incorrect implementation of reduce_[min\|max]_[float\|double], it showed up as -O0	2013-08-29 16:16:02 +04:00
Dmitry Babokin	501a23ad20	Typos fixes in docs	2013-08-29 14:48:09 +04:00
Dmitry Babokin	c1cc80b1d5	Merge pull request #568 from jbrodman/master Fix against LLVM ToT	2013-08-27 14:08:12 -07:00
james.brodman	28080b0c22	Fix build against 3.4	2013-08-27 16:56:00 -04:00
james.brodman	be3a40e70b	Fix for 3.4	2013-08-27 15:15:16 -04:00
Dmitry Babokin	5d8ebf3ca1	Fixing r183327-AVX2-GATHER.patch file permissions	2013-08-27 18:27:06 +04:00
Dmitry Babokin	443987f536	fixing ispc.rst file properties (should not be executable)	2013-08-27 15:33:44 +04:00
Dmitry Babokin	f6ce969d9f	Merge pull request #567 from ifilippov/master Changes in perf.py functionality, unification of examples, correction build warnings	2013-08-26 03:26:28 -07:00
Ilia Filippov	f620cdbaa1	Changes in perf.py functionality, unification of examples, correction build warnings	2013-08-26 14:04:59 +04:00
Dmitry Babokin	3f2217646e	Merge pull request #562 from mmp/arm New target naming scheme, new targets (SSE4-i8x16 and SSE4-i16x8), plus some cleanup and improvements.	2013-08-22 08:33:25 -07:00
Matt Pharr	611477e214	Revert change to lEmitVaryingSelect(). Using vector select versus a store and masked load for varying vector selects seems to give worse code. This may be related to http://llvm.org/bugs/show_bug.cgi?id=16941.	2013-08-22 07:50:25 -07:00
Dmitry Babokin	9bb5c314cd	Merge pull request #565 from dbabokin/run_tests run_tests.py fix and new switch.	2013-08-22 01:48:22 -07:00
Dmitry Babokin	f31a31478b	Moving time calculation earlier	2013-08-22 12:41:57 +04:00
Dmitry Babokin	5fb30939be	Fix for #564 , using wrong ispc in run_tests.py	2013-08-21 19:46:18 +04:00
Dmitry Babokin	60b413a9cb	Adding --non-interactive switch to run_tests.py	2013-08-21 19:25:30 +04:00
Matt Pharr	502f8fd76b	Reduce debug spew on failing idiv.ispc tests	2013-08-20 09:22:09 -07:00
Matt Pharr	2b2905b567	Fix (preexisting) bugs in generic-32/64.h with type of "__any", etc. This should be a bool, not a one-wide vector of bools. The equivalent fix was previously made in generic-16.h, but not made here. (Note that many tests are still failing with these targets, but at least they compile properly now.)	2013-08-20 09:05:50 -07:00
Matt Pharr	e7f067d70c	Fix handling of __clock() builtin for "generic" targets.	2013-08-20 09:04:52 -07:00
Matt Pharr	d976da7559	Speed up idiv test (dont test int32 as thoroughly)	2013-08-20 08:49:51 -07:00
Dmitry Babokin	84dbd66d10	Merge pull request #563 from jbrodman/debugopt Separate -O and -g	2013-08-15 13:10:13 -07:00
james.brodman	6be3c24ee5	Separate -O and -g	2013-08-15 15:24:46 -04:00
Matt Pharr	42f31aed69	Another attempt at fixing the Windows build (added sse4-8/sse4-16 targets).	2013-08-14 11:02:45 -07:00
Matt Pharr	ed017c42f1	Fix ispc.vcxproj for Windows builds	2013-08-11 07:47:20 -07:00
Matt Pharr	4766467271	Revert ispc.vcxproj to version from top-of-tree.	2013-08-10 11:23:39 -07:00
Matt Pharr	ea8591a85a	Fix build with LLVM top-of-tree (link libcurses)	2013-08-10 11:22:43 -07:00
Matt Pharr	7ab4c5391c	Fix build with LLVM 3.2 and generic-4 / examples/sse4.h target.	2013-08-09 19:56:43 -07:00
Matt Pharr	0c5742b6f8	Implement new naming scheme for --target. Now targets are named like "<isa>-i<mask size>x<gang size>", e.g. "sse4-i8x16", or "avx2-i32x16". The old target names are still supported.	2013-08-08 19:23:44 -07:00
Matt Pharr	1d76f74b16	Fix compiler warnings	2013-08-07 12:53:39 -07:00

1 2 3 4 5 ...

1517 Commits