Commit Graph

1685 Commits

Author SHA1 Message Date
Evghenii
268be7f0b5 fixed ISPCSync functionality 2013-11-13 11:19:10 +01:00
Evghenii
55bf0d23c2 resotred non-ptx functionality 2013-11-13 11:08:58 +01:00
Evghenii
f433aa3ad5 CDP works now 2013-11-13 10:43:52 +01:00
Evghenii
f587e0a459 handwired CDP launch 2013-11-12 21:20:10 +01:00
Evghenii
76bfcc29c2 ao1.ispc is not functional just yet :S 2013-11-12 19:30:41 +01:00
Evghenii
1d91a626f2 ISPC sync is not added 2013-11-12 17:02:31 +01:00
Evghenii
dbde936c3c bugfix in inlined ptx, now NVCC also compiles the ptx 2013-11-12 16:47:47 +01:00
Evghenii
cf679187b1 added CDP calls into IR, next step ... check :) 2013-11-12 16:39:22 +01:00
Evghenii
fd17ad236a export functions are now also generated... next add proper CDP calls.. 2013-11-12 14:05:12 +01:00
Evghenii
dbb96c1885 need to fix launch code 2013-11-12 13:41:03 +01:00
Evghenii
4cd7e10ad3 reversed to original changes. Here is the plan to use CDP and genarate only device code with host wrapper.. 2013-11-12 12:51:56 +01:00
Evghenii
3fd76d59ea +1 2013-11-12 11:32:42 +01:00
Evghenii
f445a470df handwired CDP launch 2013-11-12 11:25:43 +01:00
Evghenii
4e5299a9bf added CDP 2013-11-12 11:19:23 +01:00
Evghenii
a6afef9f3f +added some more mem management stuff 2013-11-12 08:31:45 +01:00
Evghenii
6a1fb8ea31 some kernel tuning 2013-11-11 14:24:13 +01:00
Evghenii
f2c66dc4c3 added any/none/all for bool 2013-11-11 12:59:40 +01:00
Evghenii
a91c8e15e2 added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :) 2013-11-11 12:33:39 +01:00
Evghenii
9c7a842163 ptx has support for half-float 2013-11-11 12:25:47 +01:00
Evghenii
3dd6173a65 added packed_store_active that can be called with active flag 2013-11-11 12:25:15 +01:00
Evghenii
e9bc2b7b54 added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics 2013-11-11 09:18:15 +01:00
Evghenii
38947ab71b made CU version working 2013-11-10 20:10:37 +01:00
Evghenii
8a7801264a added tuned code 2013-11-10 16:02:10 +01:00
Evghenii
66edc180be working on aobench 2013-11-10 14:29:53 +01:00
Evghenii
17809992d7 working on ao 2013-11-10 14:26:00 +01:00
evghenii
c10033211b removed 2013-11-10 14:17:59 +01:00
Evghenii
7d4ea1b6f0 added wc-timer 2013-11-10 14:15:16 +01:00
Evghenii
0dfe823c32 added kernels that use shared memory 2013-11-10 14:06:06 +01:00
Evghenii
bef275f62c amadded drv_api_error_String.h 2013-11-10 14:05:34 +01:00
evghenii
edb4c57e3d +added host code as well and restored original main.cpp 2013-11-10 14:07:15 +01:00
evghenii
c1b3face8f change time from sec to ms 2013-11-10 14:04:01 +01:00
Evghenii
9d23c10475 deffered_shading probilem identified. need solution 2013-11-10 13:59:41 +01:00
Evghenii
78d509dba5 working on deferred shading 2013-11-10 12:10:10 +01:00
Evghenii
1a37135f98 +1 2013-11-09 21:23:34 +01:00
Evghenii
dbd0581cb3 +added CUDA code 2013-11-09 21:05:28 +01:00
Evghenii
946530019a Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx 2013-11-09 20:56:55 +01:00
Evghenii
8f6f6d10e7 +some tuning 2013-11-09 20:56:48 +01:00
evghenii
3a549e5c2f xeonphi tests added for rt 2013-11-09 19:26:19 +01:00
evghenii
dc7015c5f2 added wc-timer for host code 2013-11-09 19:08:08 +01:00
Evghenii
356e9c6810 +fixed rt.cpp to compile with nvvm 2013-11-09 19:02:14 +01:00
egaburov
d0ddec469a Merge branch 'master' into nvptx 2013-11-08 15:42:58 +01:00
evghenii
87de3a2d06 added wc-timer for host code 2013-11-08 15:39:57 +01:00
Evghenii
eb8e1a2160 +added wc-timer 2013-11-08 15:27:51 +01:00
Evghenii
ce5f8cd46f replaced with fresh examples 2013-11-08 14:17:26 +01:00
Evghenii
b2f62d51b0 tuned stencil 2013-11-08 14:15:27 +01:00
Evghenii
b3c68af40a added volume rendering to run on GPU 2013-11-08 13:57:16 +01:00
Evghenii
348100ba42 remove stencil.cubin 2013-11-08 10:01:15 +01:00
Evghenii
426afc7377 added workable .cu files for stencil & mandelbrot 2013-11-08 10:00:49 +01:00
jbrodman
b04caabf39 Merge pull request #656 from egaburov/knc-fix
restored ISPC_FORCE_ALIGNED_MEMORY
2013-11-07 17:36:36 -08:00
evghenii
32cfdd52d3 Merge branch 'master' into knc-fix 2013-11-05 15:46:54 +01:00