Commit Graph

1764 Commits

Author SHA1 Message Date
Evghenii
8bb8f0eda4 +1 2013-11-14 17:04:50 +01:00
Evghenii
be2cc8f946 restored foreach in sort 2013-11-14 16:51:59 +01:00
Evghenii
599ada8354 added deferred shading foreach_tile 2013-11-14 16:49:47 +01:00
Evghenii
83b9cc5c0a +1 2013-11-14 16:44:09 +01:00
Evghenii
af75afeb7a foreach[_tiled] seems to work now 2013-11-14 16:29:40 +01:00
Evghenii
48644813d4 stmt.cpp forking on foreach 2013-11-14 11:30:22 +01:00
evghenii
c81821ed28 +1 2013-11-13 21:17:21 +01:00
Evghenii
42cfe97427 using now cuda_ispc.h 2013-11-13 21:06:40 +01:00
Evghenii
09a2c12ea0 added cuda_ispc.h & cuda eror_strings 2013-11-13 21:04:59 +01:00
Evghenii
a0f6f264f6 fixed problem with new/delete and added Mel/sec counter 2013-11-13 20:34:01 +01:00
Evghenii
6f9cea5b58 removed binary 2013-11-13 19:43:45 +01:00
Evghenii
dd4ac42491 added print m 2013-11-13 19:43:32 +01:00
Evghenii
01df6ed4a9 added ispc timers w/o task 2013-11-13 19:13:04 +01:00
Evghenii
e71259006c +1 2013-11-13 19:06:02 +01:00
Evghenii
0f161b500f +1 2013-11-13 19:02:45 +01:00
Evghenii
e442139c39 runs, next check correctness 2013-11-13 18:15:52 +01:00
Evghenii
8b0f871c06 +1 2013-11-13 17:23:23 +01:00
Evghenii
61fab0340c working on sort 2013-11-13 17:07:55 +01:00
Evghenii
525eacd035 +1 2013-11-13 16:32:56 +01:00
Evghenii
780e9f31fe some tuning 2013-11-13 16:23:05 +01:00
Evghenii
c0b54aa58c added Makefile_gpu 2013-11-13 16:20:51 +01:00
Evghenii
c0c1cc1ba7 +added Makefile and some fixes 2013-11-13 14:16:48 +01:00
Evghenii
dededd1929 cleaned 2013-11-13 13:56:45 +01:00
Evghenii
d3ade0654e added Makefile 2013-11-13 13:45:24 +01:00
Evghenii
2dd7128db5 added Makefile 2013-11-13 13:40:08 +01:00
Evghenii
1f13a236bf small tuning 2013-11-13 13:03:26 +01:00
Evghenii
ca1dbc3d3b fixed cuda kernel 2013-11-13 12:54:52 +01:00
Evghenii
74db8cbab3 +1 2013-11-13 12:12:09 +01:00
Evghenii
62bc39e600 +CDP works with deferred shading 2013-11-13 11:57:37 +01:00
Evghenii
268be7f0b5 fixed ISPCSync functionality 2013-11-13 11:19:10 +01:00
Evghenii
55bf0d23c2 resotred non-ptx functionality 2013-11-13 11:08:58 +01:00
Evghenii
f433aa3ad5 CDP works now 2013-11-13 10:43:52 +01:00
Evghenii
f587e0a459 handwired CDP launch 2013-11-12 21:20:10 +01:00
Evghenii
76bfcc29c2 ao1.ispc is not functional just yet :S 2013-11-12 19:30:41 +01:00
Evghenii
1d91a626f2 ISPC sync is not added 2013-11-12 17:02:31 +01:00
Evghenii
dbde936c3c bugfix in inlined ptx, now NVCC also compiles the ptx 2013-11-12 16:47:47 +01:00
Evghenii
cf679187b1 added CDP calls into IR, next step ... check :) 2013-11-12 16:39:22 +01:00
Evghenii
fd17ad236a export functions are now also generated... next add proper CDP calls.. 2013-11-12 14:05:12 +01:00
Evghenii
dbb96c1885 need to fix launch code 2013-11-12 13:41:03 +01:00
Evghenii
4cd7e10ad3 reversed to original changes. Here is the plan to use CDP and genarate only device code with host wrapper.. 2013-11-12 12:51:56 +01:00
Evghenii
3fd76d59ea +1 2013-11-12 11:32:42 +01:00
Evghenii
f445a470df handwired CDP launch 2013-11-12 11:25:43 +01:00
Evghenii
4e5299a9bf added CDP 2013-11-12 11:19:23 +01:00
Evghenii
a6afef9f3f +added some more mem management stuff 2013-11-12 08:31:45 +01:00
Evghenii
6a1fb8ea31 some kernel tuning 2013-11-11 14:24:13 +01:00
Evghenii
f2c66dc4c3 added any/none/all for bool 2013-11-11 12:59:40 +01:00
Evghenii
a91c8e15e2 added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :) 2013-11-11 12:33:39 +01:00
Evghenii
9c7a842163 ptx has support for half-float 2013-11-11 12:25:47 +01:00
Evghenii
3dd6173a65 added packed_store_active that can be called with active flag 2013-11-11 12:25:15 +01:00
Evghenii
e9bc2b7b54 added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics 2013-11-11 09:18:15 +01:00