Commit Graph

1517 Commits

Author SHA1 Message Date
Evghenii
9861375f0c renamed avx-i64x4 -> avx1-i64x4 2013-09-13 15:07:14 +02:00
Tomasz Koziara
ed825b3773 Uniform memory allocation fixed. 2013-09-13 13:14:31 +01:00
egaburov
a9913c8337 changed lexer/parser to be able to read float constants, if they have "f"-suffix 2013-09-13 10:26:15 +02:00
Evghenii
a97eb7b7cb added clamp in double precision 2013-09-13 09:32:59 +02:00
egaburov
715b828266 fixed float constants to be read as doubles 2013-09-13 09:25:52 +02:00
Evghenii
40af8d6ed5 fixed segfault in tests/launch-*.ispc. nativeVectoWidth in avx-i64x4 was set to 4. Fixed 2013-09-12 20:25:44 +02:00
Evghenii
059d80cc11 included suggested changes, ./tests/launch-*.ispc still fails. something is mask64 related, not sure what. help... 2013-09-12 17:18:12 +02:00
egaburov
7364e06387 added mask64 2013-09-12 12:02:42 +02:00
egaburov
efc20c2110 added svml support to all sse/avx modes 2013-09-11 17:07:54 +02:00
egaburov
19379db3b6 svml cleanup 2013-09-11 16:48:56 +02:00
egaburov
9cf8e8cbf3 builtins fix for double precision svml and __stdlib_asin 2013-09-11 15:23:45 +02:00
egaburov
7a32699573 added svml.m4 2013-09-11 15:18:03 +02:00
egaburov
320c41ffcf added svml support. experimental. for some reason all sybmols are visible.. 2013-09-11 15:16:50 +02:00
egaburov
9c79d4d182 addded avxh with vectorWidth=4 support, use --target=avxh to enable it 2013-09-11 12:58:02 +02:00
jbrodman
582cfe55b6 Merge pull request #575 from jbrodman/master
Revert "Remove support for using SVML for math lib routines."
2013-09-05 10:31:23 -07:00
james.brodman
8db378b265 Revert "Remove support for using SVML for math lib routines."
This reverts commit d9c38b5c1f.
2013-09-04 16:01:58 -04:00
jbrodman
71a7564317 Merge pull request #574 from jbrodman/uniftypedef
Fix to respect uniform/varying qualifiers inside of typedefs.
2013-09-03 13:14:00 -07:00
jbrodman
c14b035a46 Merge pull request #572 from ifilippov/master
correction of adding -Werror option
2013-08-30 11:17:01 -07:00
jbrodman
cf2eaa0014 Merge pull request #569 from dbabokin/unmasked
Fix for incorrect implementation of reduce_[min|max]_[float|double]
2013-08-30 11:16:42 -07:00
jbrodman
cb92d54808 Merge pull request #570 from dbabokin/docs
Minor docs fixes.
2013-08-30 11:16:26 -07:00
james.brodman
97d430d5cd Fix to respect uniform/varying qualifiers inside of typedefs. 2013-08-30 14:13:08 -04:00
Ilia Filippov
320b1700ff correction of adding -Werror option 2013-08-30 16:01:01 +04:00
Dmitry Babokin
e06267ef1b Fix for incorrect implementation of reduce_[min|max]_[float|double], it showed up as -O0 2013-08-29 16:16:02 +04:00
Dmitry Babokin
501a23ad20 Typos fixes in docs 2013-08-29 14:48:09 +04:00
Dmitry Babokin
c1cc80b1d5 Merge pull request #568 from jbrodman/master
Fix against LLVM ToT
2013-08-27 14:08:12 -07:00
james.brodman
28080b0c22 Fix build against 3.4 2013-08-27 16:56:00 -04:00
james.brodman
be3a40e70b Fix for 3.4 2013-08-27 15:15:16 -04:00
Dmitry Babokin
5d8ebf3ca1 Fixing r183327-AVX2-GATHER.patch file permissions 2013-08-27 18:27:06 +04:00
Dmitry Babokin
443987f536 fixing ispc.rst file properties (should not be executable) 2013-08-27 15:33:44 +04:00
Dmitry Babokin
f6ce969d9f Merge pull request #567 from ifilippov/master
Changes in perf.py functionality, unification of examples, correction build warnings
2013-08-26 03:26:28 -07:00
Ilia Filippov
f620cdbaa1 Changes in perf.py functionality, unification of examples, correction build warnings 2013-08-26 14:04:59 +04:00
Dmitry Babokin
3f2217646e Merge pull request #562 from mmp/arm
New target naming scheme, new targets (SSE4-i8x16 and SSE4-i16x8), plus some cleanup and improvements.
2013-08-22 08:33:25 -07:00
Matt Pharr
611477e214 Revert change to lEmitVaryingSelect().
Using vector select versus a store and masked load for varying vector
selects seems to give worse code.  This may be related to
http://llvm.org/bugs/show_bug.cgi?id=16941.
2013-08-22 07:50:25 -07:00
Dmitry Babokin
9bb5c314cd Merge pull request #565 from dbabokin/run_tests
run_tests.py fix and new switch.
2013-08-22 01:48:22 -07:00
Dmitry Babokin
f31a31478b Moving time calculation earlier 2013-08-22 12:41:57 +04:00
Dmitry Babokin
5fb30939be Fix for #564, using wrong ispc in run_tests.py 2013-08-21 19:46:18 +04:00
Dmitry Babokin
60b413a9cb Adding --non-interactive switch to run_tests.py 2013-08-21 19:25:30 +04:00
Matt Pharr
502f8fd76b Reduce debug spew on failing idiv.ispc tests 2013-08-20 09:22:09 -07:00
Matt Pharr
2b2905b567 Fix (preexisting) bugs in generic-32/64.h with type of "__any", etc.
This should be a bool, not a one-wide vector of bools.  The equivalent
fix was previously made in generic-16.h, but not made here.  (Note that
many tests are still failing with these targets, but at least they
compile properly now.)
2013-08-20 09:05:50 -07:00
Matt Pharr
e7f067d70c Fix handling of __clock() builtin for "generic" targets. 2013-08-20 09:04:52 -07:00
Matt Pharr
d976da7559 Speed up idiv test (dont test int32 as thoroughly) 2013-08-20 08:49:51 -07:00
Dmitry Babokin
84dbd66d10 Merge pull request #563 from jbrodman/debugopt
Separate -O and -g
2013-08-15 13:10:13 -07:00
james.brodman
6be3c24ee5 Separate -O and -g 2013-08-15 15:24:46 -04:00
Matt Pharr
42f31aed69 Another attempt at fixing the Windows build (added sse4-8/sse4-16 targets). 2013-08-14 11:02:45 -07:00
Matt Pharr
ed017c42f1 Fix ispc.vcxproj for Windows builds 2013-08-11 07:47:20 -07:00
Matt Pharr
4766467271 Revert ispc.vcxproj to version from top-of-tree. 2013-08-10 11:23:39 -07:00
Matt Pharr
ea8591a85a Fix build with LLVM top-of-tree (link libcurses) 2013-08-10 11:22:43 -07:00
Matt Pharr
7ab4c5391c Fix build with LLVM 3.2 and generic-4 / examples/sse4.h target. 2013-08-09 19:56:43 -07:00
Matt Pharr
0c5742b6f8 Implement new naming scheme for --target.
Now targets are named like "<isa>-i<mask size>x<gang size>", e.g.
"sse4-i8x16", or "avx2-i32x16".

The old target names are still supported.
2013-08-08 19:23:44 -07:00
Matt Pharr
1d76f74b16 Fix compiler warnings 2013-08-07 12:53:39 -07:00