Evghenii
f12826bac5
+added approx rcp/rsqrt/rtz with ftz=true
2013-11-14 22:17:57 +01:00
Evghenii
f1fc3bdfba
added nvptx declaration to other target & fixed nvptx64 recognition
2013-11-14 20:12:58 +01:00
Evghenii
e162d5a99d
programIndex still not working, found where change is needed...
2013-11-14 19:46:08 +01:00
Evghenii
1f13a236bf
small tuning
2013-11-13 13:03:26 +01:00
Evghenii
268be7f0b5
fixed ISPCSync functionality
2013-11-13 11:19:10 +01:00
Evghenii
f433aa3ad5
CDP works now
2013-11-13 10:43:52 +01:00
Evghenii
f587e0a459
handwired CDP launch
2013-11-12 21:20:10 +01:00
Evghenii
1d91a626f2
ISPC sync is not added
2013-11-12 17:02:31 +01:00
Evghenii
dbde936c3c
bugfix in inlined ptx, now NVCC also compiles the ptx
2013-11-12 16:47:47 +01:00
Evghenii
cf679187b1
added CDP calls into IR, next step ... check :)
2013-11-12 16:39:22 +01:00
Evghenii
f2c66dc4c3
added any/none/all for bool
2013-11-11 12:59:40 +01:00
Evghenii
a91c8e15e2
added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :)
2013-11-11 12:33:39 +01:00
Evghenii
e9bc2b7b54
added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics
2013-11-11 09:18:15 +01:00
Evghenii
9d23c10475
deffered_shading probilem identified. need solution
2013-11-10 13:59:41 +01:00
Evghenii
78d509dba5
working on deferred shading
2013-11-10 12:10:10 +01:00
Evghenii
1a37135f98
+1
2013-11-09 21:23:34 +01:00
Evghenii
b3c68af40a
added volume rendering to run on GPU
2013-11-08 13:57:16 +01:00
Evghenii
6fae459847
a+1
2013-11-04 10:22:05 +01:00
Evghenii
f9ec1a0097
.. work in programm to embed PTX into host code ..
2013-10-30 16:47:30 +01:00
egaburov
60881499dc
Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx
2013-10-29 15:25:14 +01:00
egaburov
f19cf9274e
Merge remote-tracking branch 'upstream/master' into nvptx
2013-10-29 15:24:40 +01:00
Evghenii
b2baa35c3d
added correct datalayout for nvptx64
2013-10-29 11:34:01 +01:00
Dmitry Babokin
6585a925be
Merge pull request #641 from jbrodman/stdlibshift
...
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
Evghenii
ff98271a43
using mask i1 for nvptx64
2013-10-28 17:03:00 +01:00
Evghenii
500ad7fb51
using mask i1 for nvptx64
2013-10-28 17:01:03 +01:00
Evghenii
4f486333ed
now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code
2013-10-28 16:47:40 +01:00
Evghenii
68ced6ce46
automatically adds -D__NVPTX__ when nvptx64 target is chosen
2013-10-28 14:08:32 +01:00
Evghenii
8391d05697
added blockIndex computations
2013-10-28 10:18:30 +01:00
Evghenii
ac095dbf3e
working on nvptx
2013-10-26 16:12:33 +02:00
james.brodman
4d289b16c2
Redesign after being hit with the KISS bat.
2013-10-23 14:25:43 -04:00
egaburov
f89bad1e94
launch now passes the right info into tasking
2013-10-23 12:51:06 +02:00
james.brodman
f97a2d68c8
Bugfix for non-const shift amt and unit tests.
2013-10-22 18:29:20 -04:00
james.brodman
899f85ce9c
Initial Support for new stdlib shift operator
2013-10-22 18:06:54 -04:00
egaburov
1710b9171f
removed LLVM_3_0 legacy part and changed copyright to 2013
2013-10-18 08:53:01 +02:00
egaburov
7e9b4c0924
added avx2-i64x4 and avx1.1-i64x4 targets
2013-10-15 10:02:10 +02:00
egaburov
8808a8cc9c
Merge remote-tracking branch 'upstream/master' into nvptx
2013-10-13 13:03:00 +02:00
Ilia Filippov
92773ada6d
fix for ISPC for compfails at sse4-i8 and sse4-i16
2013-10-11 15:23:40 +04:00
egaburov
5d56d29240
merged with master
2013-10-08 19:13:30 +02:00
Dmitry Babokin
43245bbc11
Adding check for OS AVX support to auto-dispatch code
2013-09-19 15:39:56 +04:00
Evghenii
9861375f0c
renamed avx-i64x4 -> avx1-i64x4
2013-09-13 15:07:14 +02:00
Evghenii
059d80cc11
included suggested changes, ./tests/launch-*.ispc still fails. something is mask64 related, not sure what. help...
2013-09-12 17:18:12 +02:00
egaburov
7364e06387
added mask64
2013-09-12 12:02:42 +02:00
egaburov
efc20c2110
added svml support to all sse/avx modes
2013-09-11 17:07:54 +02:00
egaburov
19379db3b6
svml cleanup
2013-09-11 16:48:56 +02:00
egaburov
7a32699573
added svml.m4
2013-09-11 15:18:03 +02:00
egaburov
320c41ffcf
added svml support. experimental. for some reason all sybmols are visible..
2013-09-11 15:16:50 +02:00
egaburov
9c79d4d182
addded avxh with vectorWidth=4 support, use --target=avxh to enable it
2013-09-11 12:58:02 +02:00
james.brodman
8db378b265
Revert "Remove support for using SVML for math lib routines."
...
This reverts commit d9c38b5c1f .
2013-09-04 16:01:58 -04:00
Matt Pharr
1276ea9844
Revert "Remove support for building with LLVM 3.1"
...
This reverts commit d3c567503b .
Conflicts:
opt.cpp
2013-08-06 17:00:35 -07:00
Matt Pharr
5b20b06bd9
Add avg_{up,down}_int{8,16} routines to stdlib
...
These compute the average of two given values, rounding up and down,
respectively, if the result isn't exact. When possible, these are
mapped to target-specific intrinsics (PADD[BW] on IA and VH[R]ADD[US]
on NEON.)
A subsequent commit will add pattern-matching to generate calls to
these intrinsincs when the corresponding patterns are detected in the
IR.)
2013-08-06 08:41:12 -07:00