Evghenii
|
268be7f0b5
|
fixed ISPCSync functionality
|
2013-11-13 11:19:10 +01:00 |
|
Evghenii
|
55bf0d23c2
|
resotred non-ptx functionality
|
2013-11-13 11:08:58 +01:00 |
|
Evghenii
|
f433aa3ad5
|
CDP works now
|
2013-11-13 10:43:52 +01:00 |
|
Evghenii
|
f587e0a459
|
handwired CDP launch
|
2013-11-12 21:20:10 +01:00 |
|
Evghenii
|
76bfcc29c2
|
ao1.ispc is not functional just yet :S
|
2013-11-12 19:30:41 +01:00 |
|
Evghenii
|
1d91a626f2
|
ISPC sync is not added
|
2013-11-12 17:02:31 +01:00 |
|
Evghenii
|
dbde936c3c
|
bugfix in inlined ptx, now NVCC also compiles the ptx
|
2013-11-12 16:47:47 +01:00 |
|
Evghenii
|
cf679187b1
|
added CDP calls into IR, next step ... check :)
|
2013-11-12 16:39:22 +01:00 |
|
Evghenii
|
fd17ad236a
|
export functions are now also generated... next add proper CDP calls..
|
2013-11-12 14:05:12 +01:00 |
|
Evghenii
|
dbb96c1885
|
need to fix launch code
|
2013-11-12 13:41:03 +01:00 |
|
Evghenii
|
4cd7e10ad3
|
reversed to original changes. Here is the plan to use CDP and genarate only device code with host wrapper..
|
2013-11-12 12:51:56 +01:00 |
|
Evghenii
|
3fd76d59ea
|
+1
|
2013-11-12 11:32:42 +01:00 |
|
Evghenii
|
f445a470df
|
handwired CDP launch
|
2013-11-12 11:25:43 +01:00 |
|
Evghenii
|
4e5299a9bf
|
added CDP
|
2013-11-12 11:19:23 +01:00 |
|
Evghenii
|
a6afef9f3f
|
+added some more mem management stuff
|
2013-11-12 08:31:45 +01:00 |
|
Evghenii
|
6a1fb8ea31
|
some kernel tuning
|
2013-11-11 14:24:13 +01:00 |
|
Evghenii
|
f2c66dc4c3
|
added any/none/all for bool
|
2013-11-11 12:59:40 +01:00 |
|
Evghenii
|
a91c8e15e2
|
added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :)
|
2013-11-11 12:33:39 +01:00 |
|
Evghenii
|
9c7a842163
|
ptx has support for half-float
|
2013-11-11 12:25:47 +01:00 |
|
Evghenii
|
3dd6173a65
|
added packed_store_active that can be called with active flag
|
2013-11-11 12:25:15 +01:00 |
|
Evghenii
|
e9bc2b7b54
|
added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics
|
2013-11-11 09:18:15 +01:00 |
|
Evghenii
|
38947ab71b
|
made CU version working
|
2013-11-10 20:10:37 +01:00 |
|
Evghenii
|
8a7801264a
|
added tuned code
|
2013-11-10 16:02:10 +01:00 |
|
Evghenii
|
66edc180be
|
working on aobench
|
2013-11-10 14:29:53 +01:00 |
|
Evghenii
|
17809992d7
|
working on ao
|
2013-11-10 14:26:00 +01:00 |
|
evghenii
|
c10033211b
|
removed
|
2013-11-10 14:17:59 +01:00 |
|
Evghenii
|
7d4ea1b6f0
|
added wc-timer
|
2013-11-10 14:15:16 +01:00 |
|
Evghenii
|
0dfe823c32
|
added kernels that use shared memory
|
2013-11-10 14:06:06 +01:00 |
|
Evghenii
|
bef275f62c
|
amadded drv_api_error_String.h
|
2013-11-10 14:05:34 +01:00 |
|
evghenii
|
edb4c57e3d
|
+added host code as well and restored original main.cpp
|
2013-11-10 14:07:15 +01:00 |
|
evghenii
|
c1b3face8f
|
change time from sec to ms
|
2013-11-10 14:04:01 +01:00 |
|
Evghenii
|
9d23c10475
|
deffered_shading probilem identified. need solution
|
2013-11-10 13:59:41 +01:00 |
|
Evghenii
|
78d509dba5
|
working on deferred shading
|
2013-11-10 12:10:10 +01:00 |
|
Evghenii
|
1a37135f98
|
+1
|
2013-11-09 21:23:34 +01:00 |
|
Evghenii
|
dbd0581cb3
|
+added CUDA code
|
2013-11-09 21:05:28 +01:00 |
|
Evghenii
|
946530019a
|
Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx
|
2013-11-09 20:56:55 +01:00 |
|
Evghenii
|
8f6f6d10e7
|
+some tuning
|
2013-11-09 20:56:48 +01:00 |
|
evghenii
|
3a549e5c2f
|
xeonphi tests added for rt
|
2013-11-09 19:26:19 +01:00 |
|
evghenii
|
dc7015c5f2
|
added wc-timer for host code
|
2013-11-09 19:08:08 +01:00 |
|
Evghenii
|
356e9c6810
|
+fixed rt.cpp to compile with nvvm
|
2013-11-09 19:02:14 +01:00 |
|
egaburov
|
d0ddec469a
|
Merge branch 'master' into nvptx
|
2013-11-08 15:42:58 +01:00 |
|
evghenii
|
87de3a2d06
|
added wc-timer for host code
|
2013-11-08 15:39:57 +01:00 |
|
Evghenii
|
eb8e1a2160
|
+added wc-timer
|
2013-11-08 15:27:51 +01:00 |
|
Evghenii
|
ce5f8cd46f
|
replaced with fresh examples
|
2013-11-08 14:17:26 +01:00 |
|
Evghenii
|
b2f62d51b0
|
tuned stencil
|
2013-11-08 14:15:27 +01:00 |
|
Evghenii
|
b3c68af40a
|
added volume rendering to run on GPU
|
2013-11-08 13:57:16 +01:00 |
|
Evghenii
|
348100ba42
|
remove stencil.cubin
|
2013-11-08 10:01:15 +01:00 |
|
Evghenii
|
426afc7377
|
added workable .cu files for stencil & mandelbrot
|
2013-11-08 10:00:49 +01:00 |
|
jbrodman
|
b04caabf39
|
Merge pull request #656 from egaburov/knc-fix
restored ISPC_FORCE_ALIGNED_MEMORY
|
2013-11-07 17:36:36 -08:00 |
|
evghenii
|
32cfdd52d3
|
Merge branch 'master' into knc-fix
|
2013-11-05 15:46:54 +01:00 |
|