Commit Graph

1727 Commits

Author SHA1 Message Date
Evghenii
fd17ad236a export functions are now also generated... next add proper CDP calls.. 2013-11-12 14:05:12 +01:00
Evghenii
dbb96c1885 need to fix launch code 2013-11-12 13:41:03 +01:00
Evghenii
4cd7e10ad3 reversed to original changes. Here is the plan to use CDP and genarate only device code with host wrapper.. 2013-11-12 12:51:56 +01:00
Evghenii
3fd76d59ea +1 2013-11-12 11:32:42 +01:00
Evghenii
f445a470df handwired CDP launch 2013-11-12 11:25:43 +01:00
Evghenii
4e5299a9bf added CDP 2013-11-12 11:19:23 +01:00
Evghenii
a6afef9f3f +added some more mem management stuff 2013-11-12 08:31:45 +01:00
Evghenii
6a1fb8ea31 some kernel tuning 2013-11-11 14:24:13 +01:00
Evghenii
f2c66dc4c3 added any/none/all for bool 2013-11-11 12:59:40 +01:00
Evghenii
a91c8e15e2 added reduce_min/max_float, packed_store_active for CUDA, and now kerenls1.ispc just work :) 2013-11-11 12:33:39 +01:00
Evghenii
9c7a842163 ptx has support for half-float 2013-11-11 12:25:47 +01:00
Evghenii
3dd6173a65 added packed_store_active that can be called with active flag 2013-11-11 12:25:15 +01:00
Evghenii
e9bc2b7b54 added uniform_new/uniform_delete in util_ptx.m4 and __shfl intrinsics 2013-11-11 09:18:15 +01:00
Evghenii
38947ab71b made CU version working 2013-11-10 20:10:37 +01:00
Evghenii
8a7801264a added tuned code 2013-11-10 16:02:10 +01:00
Evghenii
66edc180be working on aobench 2013-11-10 14:29:53 +01:00
Evghenii
17809992d7 working on ao 2013-11-10 14:26:00 +01:00
evghenii
c10033211b removed 2013-11-10 14:17:59 +01:00
Evghenii
7d4ea1b6f0 added wc-timer 2013-11-10 14:15:16 +01:00
Evghenii
0dfe823c32 added kernels that use shared memory 2013-11-10 14:06:06 +01:00
Evghenii
bef275f62c amadded drv_api_error_String.h 2013-11-10 14:05:34 +01:00
evghenii
edb4c57e3d +added host code as well and restored original main.cpp 2013-11-10 14:07:15 +01:00
evghenii
c1b3face8f change time from sec to ms 2013-11-10 14:04:01 +01:00
Evghenii
9d23c10475 deffered_shading probilem identified. need solution 2013-11-10 13:59:41 +01:00
Evghenii
78d509dba5 working on deferred shading 2013-11-10 12:10:10 +01:00
Evghenii
1a37135f98 +1 2013-11-09 21:23:34 +01:00
Evghenii
dbd0581cb3 +added CUDA code 2013-11-09 21:05:28 +01:00
Evghenii
946530019a Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx 2013-11-09 20:56:55 +01:00
Evghenii
8f6f6d10e7 +some tuning 2013-11-09 20:56:48 +01:00
evghenii
3a549e5c2f xeonphi tests added for rt 2013-11-09 19:26:19 +01:00
evghenii
dc7015c5f2 added wc-timer for host code 2013-11-09 19:08:08 +01:00
Evghenii
356e9c6810 +fixed rt.cpp to compile with nvvm 2013-11-09 19:02:14 +01:00
egaburov
d0ddec469a Merge branch 'master' into nvptx 2013-11-08 15:42:58 +01:00
evghenii
87de3a2d06 added wc-timer for host code 2013-11-08 15:39:57 +01:00
Evghenii
eb8e1a2160 +added wc-timer 2013-11-08 15:27:51 +01:00
Evghenii
ce5f8cd46f replaced with fresh examples 2013-11-08 14:17:26 +01:00
Evghenii
b2f62d51b0 tuned stencil 2013-11-08 14:15:27 +01:00
Evghenii
b3c68af40a added volume rendering to run on GPU 2013-11-08 13:57:16 +01:00
Evghenii
348100ba42 remove stencil.cubin 2013-11-08 10:01:15 +01:00
Evghenii
426afc7377 added workable .cu files for stencil & mandelbrot 2013-11-08 10:00:49 +01:00
jbrodman
b04caabf39 Merge pull request #656 from egaburov/knc-fix
restored ISPC_FORCE_ALIGNED_MEMORY
2013-11-07 17:36:36 -08:00
evghenii
32cfdd52d3 Merge branch 'master' into knc-fix 2013-11-05 15:46:54 +01:00
evghenii
015af03bdc changed back to #define ISPC_FORCE_ALIGNED_MEMORY aligned_ld/st #else unaligned ld/st #endif. However load<64>/store<64> will still be unaliged w/o this define because of fails related to the issue #632 2013-11-05 15:41:14 +01:00
Dmitry Babokin
99946ae0e6 Merge pull request #655 from ifilippov/windows
Windows support in alloy
2013-11-05 05:51:37 -08:00
Ilia Filippov
a910bfb539 Windows support 2013-11-05 16:31:01 +04:00
Evghenii
cb7cbec0d5 added cuda examples 2013-11-04 11:44:49 +01:00
Evghenii
cb6614da42 fixed the code that non-task code is also emitted for the host 2013-11-04 11:33:37 +01:00
Evghenii
6fae459847 a+1 2013-11-04 10:22:05 +01:00
Dmitry Babokin
bc7c54e5c8 Merge pull request #652 from jbrodman/ptrarithlvalues
Fix a case for a missing lvalue for pointer arithmetic
2013-11-01 10:39:27 -07:00
Dmitry Babokin
ab835d8086 Merge pull request #653 from jbrodman/master
Fix for Visual Studio compilation error.
2013-11-01 10:39:13 -07:00