Dmitry Babokin
ce99b17616
Fix for Windows buils to include new target: avx-i64x4
2013-09-14 02:00:23 +04:00
Dmitry Babokin
06aa2067d9
Merge pull request #578 from egaburov/master
...
added --target=avx-i64x4 & svml support for all sse/avx modes
2013-09-13 09:40:24 -07:00
Evghenii
36886971e3
revert lex.ll parse.yy stdlib.ispc to state when all constants are floats
2013-09-13 16:02:53 +02:00
Evghenii
9861375f0c
renamed avx-i64x4 -> avx1-i64x4
2013-09-13 15:07:14 +02:00
Tomasz Koziara
ed825b3773
Uniform memory allocation fixed.
2013-09-13 13:14:31 +01:00
egaburov
a9913c8337
changed lexer/parser to be able to read float constants, if they have "f"-suffix
2013-09-13 10:26:15 +02:00
Evghenii
a97eb7b7cb
added clamp in double precision
2013-09-13 09:32:59 +02:00
egaburov
715b828266
fixed float constants to be read as doubles
2013-09-13 09:25:52 +02:00
Evghenii
40af8d6ed5
fixed segfault in tests/launch-*.ispc. nativeVectoWidth in avx-i64x4 was set to 4. Fixed
2013-09-12 20:25:44 +02:00
Evghenii
059d80cc11
included suggested changes, ./tests/launch-*.ispc still fails. something is mask64 related, not sure what. help...
2013-09-12 17:18:12 +02:00
egaburov
7364e06387
added mask64
2013-09-12 12:02:42 +02:00
egaburov
efc20c2110
added svml support to all sse/avx modes
2013-09-11 17:07:54 +02:00
egaburov
19379db3b6
svml cleanup
2013-09-11 16:48:56 +02:00
egaburov
9cf8e8cbf3
builtins fix for double precision svml and __stdlib_asin
2013-09-11 15:23:45 +02:00
egaburov
7a32699573
added svml.m4
2013-09-11 15:18:03 +02:00
egaburov
320c41ffcf
added svml support. experimental. for some reason all sybmols are visible..
2013-09-11 15:16:50 +02:00
egaburov
9c79d4d182
addded avxh with vectorWidth=4 support, use --target=avxh to enable it
2013-09-11 12:58:02 +02:00
jbrodman
582cfe55b6
Merge pull request #575 from jbrodman/master
...
Revert "Remove support for using SVML for math lib routines."
2013-09-05 10:31:23 -07:00
james.brodman
8db378b265
Revert "Remove support for using SVML for math lib routines."
...
This reverts commit d9c38b5c1f .
2013-09-04 16:01:58 -04:00
jbrodman
71a7564317
Merge pull request #574 from jbrodman/uniftypedef
...
Fix to respect uniform/varying qualifiers inside of typedefs.
2013-09-03 13:14:00 -07:00
jbrodman
c14b035a46
Merge pull request #572 from ifilippov/master
...
correction of adding -Werror option
2013-08-30 11:17:01 -07:00
jbrodman
cf2eaa0014
Merge pull request #569 from dbabokin/unmasked
...
Fix for incorrect implementation of reduce_[min|max]_[float|double]
2013-08-30 11:16:42 -07:00
jbrodman
cb92d54808
Merge pull request #570 from dbabokin/docs
...
Minor docs fixes.
2013-08-30 11:16:26 -07:00
james.brodman
97d430d5cd
Fix to respect uniform/varying qualifiers inside of typedefs.
2013-08-30 14:13:08 -04:00
Ilia Filippov
320b1700ff
correction of adding -Werror option
2013-08-30 16:01:01 +04:00
Dmitry Babokin
e06267ef1b
Fix for incorrect implementation of reduce_[min|max]_[float|double], it showed up as -O0
2013-08-29 16:16:02 +04:00
Dmitry Babokin
501a23ad20
Typos fixes in docs
2013-08-29 14:48:09 +04:00
Dmitry Babokin
c1cc80b1d5
Merge pull request #568 from jbrodman/master
...
Fix against LLVM ToT
2013-08-27 14:08:12 -07:00
james.brodman
28080b0c22
Fix build against 3.4
2013-08-27 16:56:00 -04:00
james.brodman
be3a40e70b
Fix for 3.4
2013-08-27 15:15:16 -04:00
Dmitry Babokin
5d8ebf3ca1
Fixing r183327-AVX2-GATHER.patch file permissions
2013-08-27 18:27:06 +04:00
Dmitry Babokin
443987f536
fixing ispc.rst file properties (should not be executable)
2013-08-27 15:33:44 +04:00
Dmitry Babokin
f6ce969d9f
Merge pull request #567 from ifilippov/master
...
Changes in perf.py functionality, unification of examples, correction build warnings
2013-08-26 03:26:28 -07:00
Ilia Filippov
f620cdbaa1
Changes in perf.py functionality, unification of examples, correction build warnings
2013-08-26 14:04:59 +04:00
Dmitry Babokin
3f2217646e
Merge pull request #562 from mmp/arm
...
New target naming scheme, new targets (SSE4-i8x16 and SSE4-i16x8), plus some cleanup and improvements.
2013-08-22 08:33:25 -07:00
Matt Pharr
611477e214
Revert change to lEmitVaryingSelect().
...
Using vector select versus a store and masked load for varying vector
selects seems to give worse code. This may be related to
http://llvm.org/bugs/show_bug.cgi?id=16941 .
2013-08-22 07:50:25 -07:00
Dmitry Babokin
9bb5c314cd
Merge pull request #565 from dbabokin/run_tests
...
run_tests.py fix and new switch.
2013-08-22 01:48:22 -07:00
Dmitry Babokin
f31a31478b
Moving time calculation earlier
2013-08-22 12:41:57 +04:00
Dmitry Babokin
5fb30939be
Fix for #564 , using wrong ispc in run_tests.py
2013-08-21 19:46:18 +04:00
Dmitry Babokin
60b413a9cb
Adding --non-interactive switch to run_tests.py
2013-08-21 19:25:30 +04:00
Matt Pharr
502f8fd76b
Reduce debug spew on failing idiv.ispc tests
2013-08-20 09:22:09 -07:00
Matt Pharr
2b2905b567
Fix (preexisting) bugs in generic-32/64.h with type of "__any", etc.
...
This should be a bool, not a one-wide vector of bools. The equivalent
fix was previously made in generic-16.h, but not made here. (Note that
many tests are still failing with these targets, but at least they
compile properly now.)
2013-08-20 09:05:50 -07:00
Matt Pharr
e7f067d70c
Fix handling of __clock() builtin for "generic" targets.
2013-08-20 09:04:52 -07:00
Matt Pharr
d976da7559
Speed up idiv test (dont test int32 as thoroughly)
2013-08-20 08:49:51 -07:00
Dmitry Babokin
84dbd66d10
Merge pull request #563 from jbrodman/debugopt
...
Separate -O and -g
2013-08-15 13:10:13 -07:00
james.brodman
6be3c24ee5
Separate -O and -g
2013-08-15 15:24:46 -04:00
Matt Pharr
42f31aed69
Another attempt at fixing the Windows build (added sse4-8/sse4-16 targets).
2013-08-14 11:02:45 -07:00
Matt Pharr
ed017c42f1
Fix ispc.vcxproj for Windows builds
2013-08-11 07:47:20 -07:00
Matt Pharr
4766467271
Revert ispc.vcxproj to version from top-of-tree.
2013-08-10 11:23:39 -07:00
Matt Pharr
ea8591a85a
Fix build with LLVM top-of-tree (link libcurses)
2013-08-10 11:22:43 -07:00