Evghenii
ac05de6835
merged with master
2014-02-21 08:25:28 +01:00
Dmitry Babokin
f280b32fa4
Merge pull request #736 from egaburov/native_trigonometry
...
Native trigonometry
2014-02-20 19:18:35 +03:00
Evghenii
690a8acb30
merged with master
2014-02-20 15:22:09 +01:00
Evghenii
4196c723eb
merged with nvptx
2014-02-20 11:01:58 +01:00
evghenii
193bba77b0
accuracy fix
2014-02-11 11:49:03 +01:00
Evghenii
f0779f95a3
added double precision tests
2014-02-11 11:40:40 +01:00
Vsevolod Livinskij
cef5b2eb04
Some changes in saturation arithmetic
2014-02-10 12:40:53 +04:00
Evghenii
fe98fe8cdc
added fast approximate rcp(double) accurate to 15 digits
2014-02-04 15:23:34 +01:00
Evghenii
eb1a495a7a
added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double)
2014-02-04 14:44:54 +01:00
Evghenii
4515dd5c89
added tests for rcp/rsqrt double
2014-02-02 18:19:56 +01:00
Evghenii
b7b5c9ad1d
it is illegal to pass varying parapamter to a task function with nvptx target
2014-01-27 10:30:09 +01:00
Evghenii
1c2dbd6a27
a fix for .b0 ptx and some other code improvements
2014-01-27 08:51:05 +01:00
Evghenii
4ecf30530a
fixed for operator2 with nvptx target
2014-01-26 15:08:25 +01:00
Evghenii
fcbdd93043
half/scan for 64 bit/clock/num_cores and other additions
2014-01-25 16:43:33 +01:00
Evghenii
9090d8b128
added support for assert
2014-01-24 12:18:20 +01:00
Evghenii
5a8351d7ea
added varying new/delete
2014-01-24 09:22:55 +01:00
Evghenii
da7a2c0c7f
added emulation of "soa" data types via shared-memory
2014-01-23 16:17:06 +01:00
Evghenii
2e7609156a
fixes for exclclusive_scan_and/or_i32 and shuffle2 and __movmsk
2014-01-23 10:24:44 +01:00
Evghenii
06313e0ec3
exclusive_scan_and is supported, but must be called outside if-statements. in pricniple other must do the same
2014-01-22 22:12:51 +01:00
Evghenii
5376743281
added "const" before "static unfiform" in constant folding tests
2014-01-21 14:59:25 +01:00
Evghenii
215abab544
bugfix
2014-01-21 14:55:41 +01:00
Evghenii
bc99897fbb
+fixed some example, found some bugs, and bugs in ptxas/cuda
2014-01-21 14:51:27 +01:00
Evghenii
5a773ed62a
some cfor tests fixes for > 16 lanes
2014-01-20 16:42:33 +01:00
Evghenii
4581f10207
some changes
2014-01-20 13:46:49 +01:00
Evghenii
de4d66c56f
added addrspace(4)/constant memory for const uniform declarations
2014-01-08 13:27:24 +01:00
Evghenii
8347c766f0
added uniform memory test.
2014-01-08 11:16:51 +01:00
Vsevolod Livinskij
97cc5b7f48
Added varying CFG and non-overflow part of the tests.
2014-01-06 15:24:09 +04:00
Vsevolod Livinskij
07c6f1714a
Some fixes in function names and more tests was added.
2013-12-22 19:28:26 +04:00
Dmitry Babokin
d666fc3f8f
Merge pull request #686 from ifilippov/ttt
...
packed_store_active2() - tuned version of packed_store_active()
2013-12-17 09:23:39 -08:00
Ilia Filippov
473f1cb4d2
packed_store_active2
2013-12-17 21:14:29 +04:00
Dmitry Babokin
6d51987e67
Merge pull request #642 from egaburov/launch3d
...
concept of 3d tasking
2013-12-17 08:40:07 -08:00
Evghenii
59b989d243
fix for --target=sse4-i18x16
2013-12-17 16:06:20 +01:00
Vsevolod Livinskij
9a135c48d9
Functions name change
2013-12-09 00:20:52 +04:00
Vsevolod Livinskij
ea94658411
Some saturation tests fixes
2013-12-06 17:20:37 +04:00
Vsevolod Livinskij
65768c20ae
Added tests for saturation and some fixes for generic and avx target
2013-12-05 00:34:14 +04:00
Ilia Filippov
4579d339ea
patch for LLVM 3.3 and test correction at avx2
2013-11-18 13:53:21 +04:00
james.brodman
ec17082864
Add unittest.
2013-10-30 17:21:10 -04:00
Dmitry Babokin
6585a925be
Merge pull request #641 from jbrodman/stdlibshift
...
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
Evghenii
84a7a5d1cb
added tests for 3d launch
2013-10-26 16:16:28 +02:00
Ilia Filippov
814ee67519
patch and regression test for problem with vzeroupper
2013-10-24 16:03:55 +04:00
james.brodman
f97a2d68c8
Bugfix for non-const shift amt and unit tests.
2013-10-22 18:29:20 -04:00
Ilia Filippov
2e724b095e
support of operators
2013-10-18 13:45:15 +04:00
Dmitry Babokin
b2678b4338
Typo fix is tests/double-consts.ispc
2013-09-19 17:27:58 +04:00
Dmitry Babokin
1c527ae34c
Adding tests and vim support for double constant of the form .1d41
2013-09-19 12:49:45 +04:00
Dmitry Babokin
f45f6cb32a
Test, documentation and vim support for double precision constants
2013-09-19 12:49:45 +04:00
Matt Pharr
502f8fd76b
Reduce debug spew on failing idiv.ispc tests
2013-08-20 09:22:09 -07:00
Matt Pharr
d976da7559
Speed up idiv test (dont test int32 as thoroughly)
2013-08-20 08:49:51 -07:00
Matt Pharr
5b20b06bd9
Add avg_{up,down}_int{8,16} routines to stdlib
...
These compute the average of two given values, rounding up and down,
respectively, if the result isn't exact. When possible, these are
mapped to target-specific intrinsics (PADD[BW] on IA and VH[R]ADD[US]
on NEON.)
A subsequent commit will add pattern-matching to generate calls to
these intrinsincs when the corresponding patterns are detected in the
IR.)
2013-08-06 08:41:12 -07:00
Matt Pharr
b6df447b55
Add reduce_add() for int8 and int16 types.
...
This maps to specialized instructions (e.g. PSADBW) when available.
2013-07-25 09:46:01 -07:00
Matt Pharr
15a3ef370a
Use @llvm.readcyclecounter to implement stdlib clock() function.
...
Also added a test for the clock builtin.
2013-07-23 17:24:57 -07:00