Andrey Shishpanov
|
3bc99c1da1
|
solution for asin trouble from report
|
2015-07-22 14:26:51 +03:00 |
|
evghenii
|
9238c72e08
|
Merge branch 'master' into nvptx_clean_master
|
2014-10-14 14:27:00 +02:00 |
|
Vsevolod Livinskiy
|
eb61d5df72
|
Support for cache 2/3 and all targets
|
2014-10-02 16:25:23 +04:00 |
|
Vsevolod Livinskiy
|
0a6eb61ad0
|
Extend gather-scatter optimization with prefetch optimization
|
2014-10-02 15:21:43 +04:00 |
|
Evghenii
|
4641a15287
|
Merge branch 'master' into nvptx
|
2014-03-19 10:53:07 +01:00 |
|
Dmitry Babokin
|
8f8a9d89ef
|
Removing trailing spaces in stdlib.ispc
|
2014-03-12 19:43:30 +04:00 |
|
Dmitry Babokin
|
1c0729df59
|
Clarifying comment on new functions with saturated arithmetics
|
2014-03-12 19:40:52 +04:00 |
|
Vsevolod Livinskij
|
dc00b4dd64
|
Undefined operation -INT64_MIN was fixed.
|
2014-03-08 20:11:04 +04:00 |
|
Vsevolod Livinskij
|
c2e05e2231
|
Algorithm was modified and division was changed to bit operations.
|
2014-02-28 20:06:46 +04:00 |
|
Vsevolod Livinskij
|
af836cda27
|
Saturating multiplication for int64 was added.
|
2014-02-23 19:48:03 +04:00 |
|
Evghenii
|
42c4d3246c
|
Merge branch 'master' into nvptx_clean
|
2014-02-21 12:45:01 +01:00 |
|
Dmitry Babokin
|
e8680760bf
|
Merge pull request #741 from Vsevolod-Livinskij/master
Saturation arithmetic.
|
2014-02-21 12:30:58 +03:00 |
|
Evghenii
|
ac05de6835
|
merged with master
|
2014-02-21 08:25:28 +01:00 |
|
Dmitry Babokin
|
f280b32fa4
|
Merge pull request #736 from egaburov/native_trigonometry
Native trigonometry
|
2014-02-20 19:18:35 +03:00 |
|
Evghenii
|
690a8acb30
|
merged with master
|
2014-02-20 15:22:09 +01:00 |
|
Evghenii
|
4196c723eb
|
merged with nvptx
|
2014-02-20 11:01:58 +01:00 |
|
Vsevolod Livinskij
|
735e6a8ab3
|
Saturation arithmetic mul and div for int8/int16/int32 and div for int64 was added
|
2014-02-18 02:07:13 +04:00 |
|
Vsevolod Livinskij
|
f5508db24f
|
Saturation arithmetic (sub and add) was added for int32/int64.
|
2014-02-17 18:55:40 +04:00 |
|
Vsevolod Livinskij
|
cef5b2eb04
|
Some changes in saturation arithmetic
|
2014-02-10 12:40:53 +04:00 |
|
Vsevolod Livinskij
|
1c1614d207
|
Some errors in comments and code were fixed
|
2014-02-09 21:39:42 +04:00 |
|
Evghenii
|
70a9b286e5
|
added support for native and double precision trigonometry/transendentals
|
2014-02-07 15:28:39 +01:00 |
|
Evghenii
|
81aa19a8f0
|
added use of native_transendentals, need to add IR
|
2014-02-07 11:49:24 +01:00 |
|
evghenii
|
732a315a4b
|
removed __declspec(safe) duplicate
|
2014-02-05 13:04:45 +01:00 |
|
Evghenii
|
686c1d676d
|
improvements
|
2014-02-05 12:04:36 +01:00 |
|
Evghenii
|
d3a6693eef
|
adding __have_native_{rsqrtd,rcpd} to select between native support for double precision reciprocals and using slower but safe version in stdlib
|
2014-02-04 16:29:23 +01:00 |
|
Evghenii
|
fe98fe8cdc
|
added fast approximate rcp(double) accurate to 15 digits
|
2014-02-04 15:23:34 +01:00 |
|
Evghenii
|
eb1a495a7a
|
added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double)
|
2014-02-04 14:44:54 +01:00 |
|
Evghenii
|
b0753dc93d
|
added double-version for rcp
|
2014-02-02 18:20:05 +01:00 |
|
evghenii
|
3a72e05c3e
|
+1
|
2014-02-02 18:16:48 +01:00 |
|
Evghenii
|
5a6b650d8b
|
restored nonptx atomic_*_local
|
2014-01-28 15:56:30 +01:00 |
|
Evghenii
|
a3b00fdcd6
|
added support for global atomics
|
2014-01-26 14:23:26 +01:00 |
|
Evghenii
|
a7d4a3f922
|
fix for __any
|
2014-01-26 13:15:13 +01:00 |
|
Evghenii
|
fcbdd93043
|
half/scan for 64 bit/clock/num_cores and other additions
|
2014-01-25 16:43:33 +01:00 |
|
Evghenii
|
be6ac0408a
|
added compile-time constant __is_nvptx_traget that can be used with stdlib.ispc
|
2014-01-24 09:02:12 +01:00 |
|
Evghenii
|
1cf1dab649
|
fixed foreach_unique and local_atomics
|
2014-01-23 21:57:20 +01:00 |
|
Vsevolod Livinskij
|
da02236b3a
|
Scalar realization of no-vec functions was replaced from builtins to stdlib.ispc.
|
2014-01-20 16:06:34 +04:00 |
|
Evghenii
|
f86de2be78
|
fix: laneIndex() must be varying
|
2014-01-09 09:41:57 +01:00 |
|
Evghenii
|
d77789d8fe
|
+merged with master
|
2013-12-18 11:37:01 +01:00 |
|
Ilia Filippov
|
473f1cb4d2
|
packed_store_active2
|
2013-12-17 21:14:29 +04:00 |
|
Vsevolod Livinskij
|
9a135c48d9
|
Functions name change
|
2013-12-09 00:20:52 +04:00 |
|
Vsevolod Livinskij
|
65768c20ae
|
Added tests for saturation and some fixes for generic and avx target
|
2013-12-05 00:34:14 +04:00 |
|
Vsevolod Livinskij
|
35a4d1b3a2
|
Add some AVX2 intrinsics
|
2013-11-27 00:55:57 +04:00 |
|
Vsevolod Livinskij
|
19f73b2ede
|
uniform signed/unsigned int8/16
|
2013-11-25 19:16:02 +04:00 |
|
Evghenii
|
589538bf39
|
added stencil code
|
2013-11-18 12:04:00 +01:00 |
|
Evghenii
|
3dd6173a65
|
added packed_store_active that can be called with active flag
|
2013-11-11 12:25:15 +01:00 |
|
Evghenii
|
426afc7377
|
added workable .cu files for stencil & mandelbrot
|
2013-11-08 10:00:49 +01:00 |
|
egaburov
|
f19cf9274e
|
Merge remote-tracking branch 'upstream/master' into nvptx
|
2013-10-29 15:24:40 +01:00 |
|
Evghenii
|
8391d05697
|
added blockIndex computations
|
2013-10-28 10:18:30 +01:00 |
|
james.brodman
|
4d289b16c2
|
Redesign after being hit with the KISS bat.
|
2013-10-23 14:25:43 -04:00 |
|
james.brodman
|
899f85ce9c
|
Initial Support for new stdlib shift operator
|
2013-10-22 18:06:54 -04:00 |
|