Commit Graph

175 Commits

Author SHA1 Message Date
Evghenii
ac05de6835 merged with master 2014-02-21 08:25:28 +01:00
Dmitry Babokin
f280b32fa4 Merge pull request #736 from egaburov/native_trigonometry
Native trigonometry
2014-02-20 19:18:35 +03:00
Evghenii
690a8acb30 merged with master 2014-02-20 15:22:09 +01:00
Evghenii
4196c723eb merged with nvptx 2014-02-20 11:01:58 +01:00
evghenii
193bba77b0 accuracy fix 2014-02-11 11:49:03 +01:00
Evghenii
f0779f95a3 added double precision tests 2014-02-11 11:40:40 +01:00
Vsevolod Livinskij
cef5b2eb04 Some changes in saturation arithmetic 2014-02-10 12:40:53 +04:00
Evghenii
fe98fe8cdc added fast approximate rcp(double) accurate to 15 digits 2014-02-04 15:23:34 +01:00
Evghenii
eb1a495a7a added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double) 2014-02-04 14:44:54 +01:00
Evghenii
4515dd5c89 added tests for rcp/rsqrt double 2014-02-02 18:19:56 +01:00
Evghenii
b7b5c9ad1d it is illegal to pass varying parapamter to a task function with nvptx target 2014-01-27 10:30:09 +01:00
Evghenii
1c2dbd6a27 a fix for .b0 ptx and some other code improvements 2014-01-27 08:51:05 +01:00
Evghenii
4ecf30530a fixed for operator2 with nvptx target 2014-01-26 15:08:25 +01:00
Evghenii
fcbdd93043 half/scan for 64 bit/clock/num_cores and other additions 2014-01-25 16:43:33 +01:00
Evghenii
9090d8b128 added support for assert 2014-01-24 12:18:20 +01:00
Evghenii
5a8351d7ea added varying new/delete 2014-01-24 09:22:55 +01:00
Evghenii
da7a2c0c7f added emulation of "soa" data types via shared-memory 2014-01-23 16:17:06 +01:00
Evghenii
2e7609156a fixes for exclclusive_scan_and/or_i32 and shuffle2 and __movmsk 2014-01-23 10:24:44 +01:00
Evghenii
06313e0ec3 exclusive_scan_and is supported, but must be called outside if-statements. in pricniple other must do the same 2014-01-22 22:12:51 +01:00
Evghenii
5376743281 added "const" before "static unfiform" in constant folding tests 2014-01-21 14:59:25 +01:00
Evghenii
215abab544 bugfix 2014-01-21 14:55:41 +01:00
Evghenii
bc99897fbb +fixed some example, found some bugs, and bugs in ptxas/cuda 2014-01-21 14:51:27 +01:00
Evghenii
5a773ed62a some cfor tests fixes for > 16 lanes 2014-01-20 16:42:33 +01:00
Evghenii
4581f10207 some changes 2014-01-20 13:46:49 +01:00
Evghenii
de4d66c56f added addrspace(4)/constant memory for const uniform declarations 2014-01-08 13:27:24 +01:00
Evghenii
8347c766f0 added uniform memory test. 2014-01-08 11:16:51 +01:00
Vsevolod Livinskij
97cc5b7f48 Added varying CFG and non-overflow part of the tests. 2014-01-06 15:24:09 +04:00
Vsevolod Livinskij
07c6f1714a Some fixes in function names and more tests was added. 2013-12-22 19:28:26 +04:00
Dmitry Babokin
d666fc3f8f Merge pull request #686 from ifilippov/ttt
packed_store_active2() - tuned version of packed_store_active()
2013-12-17 09:23:39 -08:00
Ilia Filippov
473f1cb4d2 packed_store_active2 2013-12-17 21:14:29 +04:00
Dmitry Babokin
6d51987e67 Merge pull request #642 from egaburov/launch3d
concept of 3d tasking
2013-12-17 08:40:07 -08:00
Evghenii
59b989d243 fix for --target=sse4-i18x16 2013-12-17 16:06:20 +01:00
Vsevolod Livinskij
9a135c48d9 Functions name change 2013-12-09 00:20:52 +04:00
Vsevolod Livinskij
ea94658411 Some saturation tests fixes 2013-12-06 17:20:37 +04:00
Vsevolod Livinskij
65768c20ae Added tests for saturation and some fixes for generic and avx target 2013-12-05 00:34:14 +04:00
Ilia Filippov
4579d339ea patch for LLVM 3.3 and test correction at avx2 2013-11-18 13:53:21 +04:00
james.brodman
ec17082864 Add unittest. 2013-10-30 17:21:10 -04:00
Dmitry Babokin
6585a925be Merge pull request #641 from jbrodman/stdlibshift
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
Evghenii
84a7a5d1cb added tests for 3d launch 2013-10-26 16:16:28 +02:00
Ilia Filippov
814ee67519 patch and regression test for problem with vzeroupper 2013-10-24 16:03:55 +04:00
james.brodman
f97a2d68c8 Bugfix for non-const shift amt and unit tests. 2013-10-22 18:29:20 -04:00
Ilia Filippov
2e724b095e support of operators 2013-10-18 13:45:15 +04:00
Dmitry Babokin
b2678b4338 Typo fix is tests/double-consts.ispc 2013-09-19 17:27:58 +04:00
Dmitry Babokin
1c527ae34c Adding tests and vim support for double constant of the form .1d41 2013-09-19 12:49:45 +04:00
Dmitry Babokin
f45f6cb32a Test, documentation and vim support for double precision constants 2013-09-19 12:49:45 +04:00
Matt Pharr
502f8fd76b Reduce debug spew on failing idiv.ispc tests 2013-08-20 09:22:09 -07:00
Matt Pharr
d976da7559 Speed up idiv test (dont test int32 as thoroughly) 2013-08-20 08:49:51 -07:00
Matt Pharr
5b20b06bd9 Add avg_{up,down}_int{8,16} routines to stdlib
These compute the average of two given values, rounding up and down,
respectively, if the result isn't exact.  When possible, these are
mapped to target-specific intrinsics (PADD[BW] on IA and VH[R]ADD[US]
on NEON.)

A subsequent commit will add pattern-matching to generate calls to
these intrinsincs when the corresponding patterns are detected in the
IR.)
2013-08-06 08:41:12 -07:00
Matt Pharr
b6df447b55 Add reduce_add() for int8 and int16 types.
This maps to specialized instructions (e.g. PSADBW) when available.
2013-07-25 09:46:01 -07:00
Matt Pharr
15a3ef370a Use @llvm.readcyclecounter to implement stdlib clock() function.
Also added a test for the clock builtin.
2013-07-23 17:24:57 -07:00