129 Commits

Author SHA1 Message Date
Andrey Shishpanov
3bc99c1da1 solution for asin trouble from report 2015-07-22 14:26:51 +03:00
evghenii
9238c72e08 Merge branch 'master' into nvptx_clean_master 2014-10-14 14:27:00 +02:00
Vsevolod Livinskiy
eb61d5df72 Support for cache 2/3 and all targets 2014-10-02 16:25:23 +04:00
Vsevolod Livinskiy
0a6eb61ad0 Extend gather-scatter optimization with prefetch optimization 2014-10-02 15:21:43 +04:00
Evghenii
4641a15287 Merge branch 'master' into nvptx 2014-03-19 10:53:07 +01:00
Dmitry Babokin
8f8a9d89ef Removing trailing spaces in stdlib.ispc 2014-03-12 19:43:30 +04:00
Dmitry Babokin
1c0729df59 Clarifying comment on new functions with saturated arithmetics 2014-03-12 19:40:52 +04:00
Vsevolod Livinskij
dc00b4dd64 Undefined operation -INT64_MIN was fixed. 2014-03-08 20:11:04 +04:00
Vsevolod Livinskij
c2e05e2231 Algorithm was modified and division was changed to bit operations. 2014-02-28 20:06:46 +04:00
Vsevolod Livinskij
af836cda27 Saturating multiplication for int64 was added. 2014-02-23 19:48:03 +04:00
Evghenii
42c4d3246c Merge branch 'master' into nvptx_clean 2014-02-21 12:45:01 +01:00
Dmitry Babokin
e8680760bf Merge pull request #741 from Vsevolod-Livinskij/master
Saturation arithmetic.
2014-02-21 12:30:58 +03:00
Evghenii
ac05de6835 merged with master 2014-02-21 08:25:28 +01:00
Dmitry Babokin
f280b32fa4 Merge pull request #736 from egaburov/native_trigonometry
Native trigonometry
2014-02-20 19:18:35 +03:00
Evghenii
690a8acb30 merged with master 2014-02-20 15:22:09 +01:00
Evghenii
4196c723eb merged with nvptx 2014-02-20 11:01:58 +01:00
Vsevolod Livinskij
735e6a8ab3 Saturation arithmetic mul and div for int8/int16/int32 and div for int64 was added 2014-02-18 02:07:13 +04:00
Vsevolod Livinskij
f5508db24f Saturation arithmetic (sub and add) was added for int32/int64. 2014-02-17 18:55:40 +04:00
Vsevolod Livinskij
cef5b2eb04 Some changes in saturation arithmetic 2014-02-10 12:40:53 +04:00
Vsevolod Livinskij
1c1614d207 Some errors in comments and code were fixed 2014-02-09 21:39:42 +04:00
Evghenii
70a9b286e5 added support for native and double precision trigonometry/transendentals 2014-02-07 15:28:39 +01:00
Evghenii
81aa19a8f0 added use of native_transendentals, need to add IR 2014-02-07 11:49:24 +01:00
evghenii
732a315a4b removed __declspec(safe) duplicate 2014-02-05 13:04:45 +01:00
Evghenii
686c1d676d improvements 2014-02-05 12:04:36 +01:00
Evghenii
d3a6693eef adding __have_native_{rsqrtd,rcpd} to select between native support for double precision reciprocals and using slower but safe version in stdlib 2014-02-04 16:29:23 +01:00
Evghenii
fe98fe8cdc added fast approximate rcp(double) accurate to 15 digits 2014-02-04 15:23:34 +01:00
Evghenii
eb1a495a7a added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double) 2014-02-04 14:44:54 +01:00
Evghenii
b0753dc93d added double-version for rcp 2014-02-02 18:20:05 +01:00
evghenii
3a72e05c3e +1 2014-02-02 18:16:48 +01:00
Evghenii
5a6b650d8b restored nonptx atomic_*_local 2014-01-28 15:56:30 +01:00
Evghenii
a3b00fdcd6 added support for global atomics 2014-01-26 14:23:26 +01:00
Evghenii
a7d4a3f922 fix for __any 2014-01-26 13:15:13 +01:00
Evghenii
fcbdd93043 half/scan for 64 bit/clock/num_cores and other additions 2014-01-25 16:43:33 +01:00
Evghenii
be6ac0408a added compile-time constant __is_nvptx_traget that can be used with stdlib.ispc 2014-01-24 09:02:12 +01:00
Evghenii
1cf1dab649 fixed foreach_unique and local_atomics 2014-01-23 21:57:20 +01:00
Vsevolod Livinskij
da02236b3a Scalar realization of no-vec functions was replaced from builtins to stdlib.ispc. 2014-01-20 16:06:34 +04:00
Evghenii
f86de2be78 fix: laneIndex() must be varying 2014-01-09 09:41:57 +01:00
Evghenii
d77789d8fe +merged with master 2013-12-18 11:37:01 +01:00
Ilia Filippov
473f1cb4d2 packed_store_active2 2013-12-17 21:14:29 +04:00
Vsevolod Livinskij
9a135c48d9 Functions name change 2013-12-09 00:20:52 +04:00
Vsevolod Livinskij
65768c20ae Added tests for saturation and some fixes for generic and avx target 2013-12-05 00:34:14 +04:00
Vsevolod Livinskij
35a4d1b3a2 Add some AVX2 intrinsics 2013-11-27 00:55:57 +04:00
Vsevolod Livinskij
19f73b2ede uniform signed/unsigned int8/16 2013-11-25 19:16:02 +04:00
Evghenii
589538bf39 added stencil code 2013-11-18 12:04:00 +01:00
Evghenii
3dd6173a65 added packed_store_active that can be called with active flag 2013-11-11 12:25:15 +01:00
Evghenii
426afc7377 added workable .cu files for stencil & mandelbrot 2013-11-08 10:00:49 +01:00
egaburov
f19cf9274e Merge remote-tracking branch 'upstream/master' into nvptx 2013-10-29 15:24:40 +01:00
Evghenii
8391d05697 added blockIndex computations 2013-10-28 10:18:30 +01:00
james.brodman
4d289b16c2 Redesign after being hit with the KISS bat. 2013-10-23 14:25:43 -04:00
james.brodman
899f85ce9c Initial Support for new stdlib shift operator 2013-10-22 18:06:54 -04:00