evghenii
69f3898a61
Merge branch 'master' into nvptx_merge
2014-07-07 16:30:12 +02:00
Ilia Filippov
76ea59b40b
support LLVM build
2014-06-18 17:53:42 +04:00
Evghenii
4641a15287
Merge branch 'master' into nvptx
2014-03-19 10:53:07 +01:00
Dmitry Babokin
31b95b665b
Copyright update
2014-03-12 20:19:16 +04:00
Ilia Filippov
ead5cc741d
support LLVM trunk after 203559 203213 and 203381 revisions
2014-03-12 12:58:50 +04:00
Evghenii
ac05de6835
merged with master
2014-02-21 08:25:28 +01:00
Dmitry Babokin
f280b32fa4
Merge pull request #736 from egaburov/native_trigonometry
...
Native trigonometry
2014-02-20 19:18:35 +03:00
Evghenii
690a8acb30
merged with master
2014-02-20 15:22:09 +01:00
Evghenii
4196c723eb
merged with nvptx
2014-02-20 11:01:58 +01:00
Dmitry Babokin
ea0a514e03
Fix for generic-1
2014-02-11 15:33:23 +04:00
Vsevolod Livinskij
65d947e449
Else branch with error report was added
2014-02-10 15:18:48 +04:00
Vsevolod Livinskij
cef5b2eb04
Some changes in saturation arithmetic
2014-02-10 12:40:53 +04:00
Vsevolod Livinskij
1c1614d207
Some errors in comments and code were fixed
2014-02-09 21:39:42 +04:00
Evghenii
70a9b286e5
added support for native and double precision trigonometry/transendentals
2014-02-07 15:28:39 +01:00
evghenii
09e8381ec7
change {rsqrt,rcp}_double to {rsqrt,rcp}d_decl
2014-02-05 13:05:04 +01:00
Evghenii
686c1d676d
improvements
2014-02-05 12:04:36 +01:00
Evghenii
d3a6693eef
adding __have_native_{rsqrtd,rcpd} to select between native support for double precision reciprocals and using slower but safe version in stdlib
2014-02-04 16:29:23 +01:00
Evghenii
fe98fe8cdc
added fast approximate rcp(double) accurate to 15 digits
2014-02-04 15:23:34 +01:00
Evghenii
eb1a495a7a
added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double)
2014-02-04 14:44:54 +01:00
Evghenii
c2ed214a74
added declaretion for movmsk_ptx
2014-02-03 08:57:27 +01:00
evghenii
3a72e05c3e
+1
2014-02-02 18:16:48 +01:00
Evghenii
a3b00fdcd6
added support for global atomics
2014-01-26 14:23:26 +01:00
Vsevolod Livinskij
da02236b3a
Scalar realization of no-vec functions was replaced from builtins to stdlib.ispc.
2014-01-20 16:06:34 +04:00
evghenii
9053eed4b4
added basic optimization pass that promotes uniform into varying variables (not array) for nvptx target
2014-01-10 06:32:57 +01:00
Evghenii
de4d66c56f
added addrspace(4)/constant memory for const uniform declarations
2014-01-08 13:27:24 +01:00
Evghenii
9b74e60185
added conversion from addrspace(3)/__local/__shared__ to addspace(0)/generic when PtrToInt is called
2014-01-07 14:29:55 +01:00
Evghenii
a8a2cf9bdb
change nvptx64->nvptx
2014-01-06 14:04:27 +01:00
Vsevolod Livinskij
323587f10f
Scalar implementation and implementation for targets which don't have h/w instructions
2014-01-02 16:48:56 +04:00
Vsevolod Livinskij
07c6f1714a
Some fixes in function names and more tests was added.
2013-12-22 19:28:26 +04:00
Evghenii
d77789d8fe
+merged with master
2013-12-18 11:37:01 +01:00
Dmitry Babokin
d666fc3f8f
Merge pull request #686 from ifilippov/ttt
...
packed_store_active2() - tuned version of packed_store_active()
2013-12-17 09:23:39 -08:00
Ilia Filippov
473f1cb4d2
packed_store_active2
2013-12-17 21:14:29 +04:00
Dmitry Babokin
6d51987e67
Merge pull request #642 from egaburov/launch3d
...
concept of 3d tasking
2013-12-17 08:40:07 -08:00
evghenii
c06ec92d0d
added commas, added multi-dimensional tasking to mandelbrot_tasks & removed mandelbrot_task3d. Also adjusted documentaiton a bit
2013-12-13 11:49:11 +01:00
Vsevolod Livinskij
65768c20ae
Added tests for saturation and some fixes for generic and avx target
2013-12-05 00:34:14 +04:00
Vsevolod Livinskij
4faff1a63c
structural change
2013-11-30 10:48:18 +04:00
Vsevolod Livinskij
4c330bc38b
Add code generation of saturation
2013-11-29 18:40:04 +04:00
Evghenii
f1fc3bdfba
added nvptx declaration to other target & fixed nvptx64 recognition
2013-11-14 20:12:58 +01:00
Evghenii
f433aa3ad5
CDP works now
2013-11-13 10:43:52 +01:00
Evghenii
f9ec1a0097
.. work in programm to embed PTX into host code ..
2013-10-30 16:47:30 +01:00
egaburov
f19cf9274e
Merge remote-tracking branch 'upstream/master' into nvptx
2013-10-29 15:24:40 +01:00
Dmitry Babokin
6585a925be
Merge pull request #641 from jbrodman/stdlibshift
...
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
james.brodman
4d289b16c2
Redesign after being hit with the KISS bat.
2013-10-23 14:25:43 -04:00
egaburov
f89bad1e94
launch now passes the right info into tasking
2013-10-23 12:51:06 +02:00
james.brodman
f97a2d68c8
Bugfix for non-const shift amt and unit tests.
2013-10-22 18:29:20 -04:00
james.brodman
899f85ce9c
Initial Support for new stdlib shift operator
2013-10-22 18:06:54 -04:00
Ilia Filippov
92773ada6d
fix for ISPC for compfails at sse4-i8 and sse4-i16
2013-10-11 15:23:40 +04:00
egaburov
7364e06387
added mask64
2013-09-12 12:02:42 +02:00
egaburov
320c41ffcf
added svml support. experimental. for some reason all sybmols are visible..
2013-09-11 15:16:50 +02:00
Matt Pharr
5b20b06bd9
Add avg_{up,down}_int{8,16} routines to stdlib
...
These compute the average of two given values, rounding up and down,
respectively, if the result isn't exact. When possible, these are
mapped to target-specific intrinsics (PADD[BW] on IA and VH[R]ADD[US]
on NEON.)
A subsequent commit will add pattern-matching to generate calls to
these intrinsincs when the corresponding patterns are detected in the
IR.)
2013-08-06 08:41:12 -07:00