Commit Graph

186 Commits

Author SHA1 Message Date
Evghenii
4641a15287 Merge branch 'master' into nvptx 2014-03-19 10:53:07 +01:00
Ilia Filippov
e1524891fc Merge pull request #751 from Vsevolod-Livinskij/master
Saturating multiplication for int64 was added.
2014-03-12 00:12:34 -07:00
Ilia Filippov
6738af0a0c changing uniform_min and uniform_max implementations for avx targets 2014-03-06 12:05:24 +04:00
Vsevolod Livinskij
af836cda27 Saturating multiplication for int64 was added. 2014-02-23 19:48:03 +04:00
Evghenii
42c4d3246c Merge branch 'master' into nvptx_clean 2014-02-21 12:45:01 +01:00
Dmitry Babokin
e8680760bf Merge pull request #741 from Vsevolod-Livinskij/master
Saturation arithmetic.
2014-02-21 12:30:58 +03:00
Dmitry Babokin
17d8047a93 Merge pull request #746 from ifilippov/master
Adding cases of 'cast' instructions in optimizations
2014-02-21 12:25:58 +03:00
Ilia Filippov
42e00ebb24 adding cases of 'cast' instructions in optimizations 2014-02-21 13:00:16 +04:00
Evghenii
ac05de6835 merged with master 2014-02-21 08:25:28 +01:00
Vsevolod Livinskij
7dd7020c5f Dec constants was changed with hex constants. 2014-02-20 22:57:24 +04:00
Dmitry Babokin
f280b32fa4 Merge pull request #736 from egaburov/native_trigonometry
Native trigonometry
2014-02-20 19:18:35 +03:00
Evghenii
690a8acb30 merged with master 2014-02-20 15:22:09 +01:00
Evghenii
4196c723eb merged with nvptx 2014-02-20 11:01:58 +01:00
Vsevolod Livinskij
735e6a8ab3 Saturation arithmetic mul and div for int8/int16/int32 and div for int64 was added 2014-02-18 02:07:13 +04:00
Vsevolod Livinskij
f5508db24f Saturation arithmetic (sub and add) was added for int32/int64. 2014-02-17 18:55:40 +04:00
evghenii
193bba77b0 accuracy fix 2014-02-11 11:49:03 +01:00
Evghenii
f0779f95a3 added double precision tests 2014-02-11 11:40:40 +01:00
Vsevolod Livinskij
cef5b2eb04 Some changes in saturation arithmetic 2014-02-10 12:40:53 +04:00
Evghenii
fe98fe8cdc added fast approximate rcp(double) accurate to 15 digits 2014-02-04 15:23:34 +01:00
Evghenii
eb1a495a7a added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double) 2014-02-04 14:44:54 +01:00
Evghenii
4515dd5c89 added tests for rcp/rsqrt double 2014-02-02 18:19:56 +01:00
Evghenii
b7b5c9ad1d it is illegal to pass varying parapamter to a task function with nvptx target 2014-01-27 10:30:09 +01:00
Evghenii
1c2dbd6a27 a fix for .b0 ptx and some other code improvements 2014-01-27 08:51:05 +01:00
Evghenii
4ecf30530a fixed for operator2 with nvptx target 2014-01-26 15:08:25 +01:00
Evghenii
fcbdd93043 half/scan for 64 bit/clock/num_cores and other additions 2014-01-25 16:43:33 +01:00
Evghenii
9090d8b128 added support for assert 2014-01-24 12:18:20 +01:00
Evghenii
5a8351d7ea added varying new/delete 2014-01-24 09:22:55 +01:00
Evghenii
da7a2c0c7f added emulation of "soa" data types via shared-memory 2014-01-23 16:17:06 +01:00
Evghenii
2e7609156a fixes for exclclusive_scan_and/or_i32 and shuffle2 and __movmsk 2014-01-23 10:24:44 +01:00
Evghenii
06313e0ec3 exclusive_scan_and is supported, but must be called outside if-statements. in pricniple other must do the same 2014-01-22 22:12:51 +01:00
Evghenii
5376743281 added "const" before "static unfiform" in constant folding tests 2014-01-21 14:59:25 +01:00
Evghenii
215abab544 bugfix 2014-01-21 14:55:41 +01:00
Evghenii
bc99897fbb +fixed some example, found some bugs, and bugs in ptxas/cuda 2014-01-21 14:51:27 +01:00
Evghenii
5a773ed62a some cfor tests fixes for > 16 lanes 2014-01-20 16:42:33 +01:00
Evghenii
4581f10207 some changes 2014-01-20 13:46:49 +01:00
Evghenii
de4d66c56f added addrspace(4)/constant memory for const uniform declarations 2014-01-08 13:27:24 +01:00
Evghenii
8347c766f0 added uniform memory test. 2014-01-08 11:16:51 +01:00
Vsevolod Livinskij
97cc5b7f48 Added varying CFG and non-overflow part of the tests. 2014-01-06 15:24:09 +04:00
Vsevolod Livinskij
07c6f1714a Some fixes in function names and more tests was added. 2013-12-22 19:28:26 +04:00
Dmitry Babokin
d666fc3f8f Merge pull request #686 from ifilippov/ttt
packed_store_active2() - tuned version of packed_store_active()
2013-12-17 09:23:39 -08:00
Ilia Filippov
473f1cb4d2 packed_store_active2 2013-12-17 21:14:29 +04:00
Dmitry Babokin
6d51987e67 Merge pull request #642 from egaburov/launch3d
concept of 3d tasking
2013-12-17 08:40:07 -08:00
Evghenii
59b989d243 fix for --target=sse4-i18x16 2013-12-17 16:06:20 +01:00
Vsevolod Livinskij
9a135c48d9 Functions name change 2013-12-09 00:20:52 +04:00
Vsevolod Livinskij
ea94658411 Some saturation tests fixes 2013-12-06 17:20:37 +04:00
Vsevolod Livinskij
65768c20ae Added tests for saturation and some fixes for generic and avx target 2013-12-05 00:34:14 +04:00
Ilia Filippov
4579d339ea patch for LLVM 3.3 and test correction at avx2 2013-11-18 13:53:21 +04:00
james.brodman
ec17082864 Add unittest. 2013-10-30 17:21:10 -04:00
Dmitry Babokin
6585a925be Merge pull request #641 from jbrodman/stdlibshift
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
Evghenii
84a7a5d1cb added tests for 3d launch 2013-10-26 16:16:28 +02:00