Commit Graph

149 Commits

Author SHA1 Message Date
Evghenii
fd429e4fda added common_gpu makefile helper 2014-01-05 10:00:44 +01:00
Evghenii
2d8da306a1 merged with master 2013-12-25 21:32:34 +01:00
Ilia Filippov
15816eb07e adding __packed_store_active2 to generic targets 2013-12-19 17:50:18 +04:00
Evghenii
d4b8a0f2eb added packed_store_active2 declartion in IR 2013-12-18 11:40:03 +01:00
Evghenii
d77789d8fe +merged with master 2013-12-18 11:37:01 +01:00
Dmitry Babokin
d666fc3f8f Merge pull request #686 from ifilippov/ttt
packed_store_active2() - tuned version of packed_store_active()
2013-12-17 09:23:39 -08:00
Ilia Filippov
473f1cb4d2 packed_store_active2 2013-12-17 21:14:29 +04:00
Dmitry Babokin
6d51987e67 Merge pull request #642 from egaburov/launch3d
concept of 3d tasking
2013-12-17 08:40:07 -08:00
evghenii
c06ec92d0d added commas, added multi-dimensional tasking to mandelbrot_tasks & removed mandelbrot_task3d. Also adjusted documentaiton a bit 2013-12-13 11:49:11 +01:00
evghenii
bb46b561fd Merged with upstream/master 2013-11-22 08:13:16 +01:00
Evghenii
6f200d310f fixed to work with LLVM 3.2 2013-11-21 11:03:03 +01:00
Evghenii
c93e71698e restored intrinsics and added tuning options to ptxgen 2013-11-15 15:04:04 +01:00
Evghenii
f12826bac5 +added approx rcp/rsqrt/rtz with ftz=true 2013-11-14 22:17:57 +01:00
Evghenii
f1fc3bdfba added nvptx declaration to other target & fixed nvptx64 recognition 2013-11-14 20:12:58 +01:00
Evghenii
e162d5a99d programIndex still not working, found where change is needed... 2013-11-14 19:46:08 +01:00
Dmitry Babokin
d2c7b356cc Ordering functions in target-[avx|sse2].ll to be in the same order. No real changes, except adding a few alwaysinline in SSE4 target 2013-11-14 15:34:30 +04:00
Dmitry Babokin
af58955140 target-[sse4|avx]_common.ll are twin brothers, which diffes only cosmetically. This commit makes them diffable. No real changes, except adding alwaysinline to sse version iof __max_uniform_int32/__max_uniform_uint32 2013-11-14 15:34:30 +04:00
Evghenii
1f13a236bf small tuning 2013-11-13 13:03:26 +01:00
Evghenii
268be7f0b5 fixed ISPCSync functionality 2013-11-13 11:19:10 +01:00
Evghenii
f433aa3ad5 CDP works now 2013-11-13 10:43:52 +01:00
Evghenii
f587e0a459 handwired CDP launch 2013-11-12 21:20:10 +01:00
Evghenii
1d91a626f2 ISPC sync is not added 2013-11-12 17:02:31 +01:00
Evghenii
dbde936c3c bugfix in inlined ptx, now NVCC also compiles the ptx 2013-11-12 16:47:47 +01:00
Evghenii
cf679187b1 added CDP calls into IR, next step ... check :) 2013-11-12 16:39:22 +01:00
Evghenii
f2c66dc4c3 added any/none/all for bool 2013-11-11 12:59:40 +01:00
Evghenii
a91c8e15e2 added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :) 2013-11-11 12:33:39 +01:00
Evghenii
e9bc2b7b54 added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics 2013-11-11 09:18:15 +01:00
Evghenii
9d23c10475 deffered_shading probilem identified. need solution 2013-11-10 13:59:41 +01:00
Evghenii
78d509dba5 working on deferred shading 2013-11-10 12:10:10 +01:00
Evghenii
1a37135f98 +1 2013-11-09 21:23:34 +01:00
Evghenii
b3c68af40a added volume rendering to run on GPU 2013-11-08 13:57:16 +01:00
Evghenii
6fae459847 a+1 2013-11-04 10:22:05 +01:00
Evghenii
f9ec1a0097 .. work in programm to embed PTX into host code .. 2013-10-30 16:47:30 +01:00
egaburov
60881499dc Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx 2013-10-29 15:25:14 +01:00
egaburov
f19cf9274e Merge remote-tracking branch 'upstream/master' into nvptx 2013-10-29 15:24:40 +01:00
Evghenii
b2baa35c3d added correct datalayout for nvptx64 2013-10-29 11:34:01 +01:00
Dmitry Babokin
6585a925be Merge pull request #641 from jbrodman/stdlibshift
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
Evghenii
ff98271a43 using mask i1 for nvptx64 2013-10-28 17:03:00 +01:00
Evghenii
500ad7fb51 using mask i1 for nvptx64 2013-10-28 17:01:03 +01:00
Evghenii
4f486333ed now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code 2013-10-28 16:47:40 +01:00
Evghenii
68ced6ce46 automatically adds -D__NVPTX__ when nvptx64 target is chosen 2013-10-28 14:08:32 +01:00
Evghenii
8391d05697 added blockIndex computations 2013-10-28 10:18:30 +01:00
Evghenii
ac095dbf3e working on nvptx 2013-10-26 16:12:33 +02:00
james.brodman
4d289b16c2 Redesign after being hit with the KISS bat. 2013-10-23 14:25:43 -04:00
egaburov
f89bad1e94 launch now passes the right info into tasking 2013-10-23 12:51:06 +02:00
james.brodman
f97a2d68c8 Bugfix for non-const shift amt and unit tests. 2013-10-22 18:29:20 -04:00
james.brodman
899f85ce9c Initial Support for new stdlib shift operator 2013-10-22 18:06:54 -04:00
egaburov
1710b9171f removed LLVM_3_0 legacy part and changed copyright to 2013 2013-10-18 08:53:01 +02:00
egaburov
7e9b4c0924 added avx2-i64x4 and avx1.1-i64x4 targets 2013-10-15 10:02:10 +02:00
egaburov
8808a8cc9c Merge remote-tracking branch 'upstream/master' into nvptx 2013-10-13 13:03:00 +02:00