Evghenii
6f200d310f
fixed to work with LLVM 3.2
2013-11-21 11:03:03 +01:00
Evghenii
c93e71698e
restored intrinsics and added tuning options to ptxgen
2013-11-15 15:04:04 +01:00
Evghenii
f12826bac5
+added approx rcp/rsqrt/rtz with ftz=true
2013-11-14 22:17:57 +01:00
Evghenii
e162d5a99d
programIndex still not working, found where change is needed...
2013-11-14 19:46:08 +01:00
Evghenii
1f13a236bf
small tuning
2013-11-13 13:03:26 +01:00
Evghenii
268be7f0b5
fixed ISPCSync functionality
2013-11-13 11:19:10 +01:00
Evghenii
f433aa3ad5
CDP works now
2013-11-13 10:43:52 +01:00
Evghenii
f587e0a459
handwired CDP launch
2013-11-12 21:20:10 +01:00
Evghenii
1d91a626f2
ISPC sync is not added
2013-11-12 17:02:31 +01:00
Evghenii
dbde936c3c
bugfix in inlined ptx, now NVCC also compiles the ptx
2013-11-12 16:47:47 +01:00
Evghenii
cf679187b1
added CDP calls into IR, next step ... check :)
2013-11-12 16:39:22 +01:00
Evghenii
f2c66dc4c3
added any/none/all for bool
2013-11-11 12:59:40 +01:00
Evghenii
a91c8e15e2
added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :)
2013-11-11 12:33:39 +01:00
Evghenii
e9bc2b7b54
added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics
2013-11-11 09:18:15 +01:00
Evghenii
9d23c10475
deffered_shading probilem identified. need solution
2013-11-10 13:59:41 +01:00
Evghenii
78d509dba5
working on deferred shading
2013-11-10 12:10:10 +01:00
Evghenii
1a37135f98
+1
2013-11-09 21:23:34 +01:00
Evghenii
b3c68af40a
added volume rendering to run on GPU
2013-11-08 13:57:16 +01:00
Evghenii
b2baa35c3d
added correct datalayout for nvptx64
2013-10-29 11:34:01 +01:00
Evghenii
ff98271a43
using mask i1 for nvptx64
2013-10-28 17:03:00 +01:00
Evghenii
500ad7fb51
using mask i1 for nvptx64
2013-10-28 17:01:03 +01:00
Evghenii
4f486333ed
now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code
2013-10-28 16:47:40 +01:00
Evghenii
8391d05697
added blockIndex computations
2013-10-28 10:18:30 +01:00
egaburov
5d56d29240
merged with master
2013-10-08 19:13:30 +02:00
egaburov
153fbc3d7d
some changes
2013-07-29 11:05:05 +02:00
egaburov
af61c9bae3
working on target-nvptx64... need to add nvptx64
2013-07-28 15:50:08 +02:00
egaburov
67b549a937
Added nvptx64 target. Things to do:
...
1. builtins/target-nvptx64.ll to write, now it is just a copy of target-generic-1.ll
2. add __global__ & __device__ scope
2. make code work for a single cuda thread
3. use tasks to work as a block grid and programIndex as laneIdx, programCount as warpSize
4. ... and more...
2013-07-28 14:31:43 +02:00