Evghenii
|
dbd0581cb3
|
+added CUDA code
|
2013-11-09 21:05:28 +01:00 |
|
Evghenii
|
946530019a
|
Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx
|
2013-11-09 20:56:55 +01:00 |
|
Evghenii
|
8f6f6d10e7
|
+some tuning
|
2013-11-09 20:56:48 +01:00 |
|
evghenii
|
3a549e5c2f
|
xeonphi tests added for rt
|
2013-11-09 19:26:19 +01:00 |
|
evghenii
|
dc7015c5f2
|
added wc-timer for host code
|
2013-11-09 19:08:08 +01:00 |
|
Evghenii
|
356e9c6810
|
+fixed rt.cpp to compile with nvvm
|
2013-11-09 19:02:14 +01:00 |
|
egaburov
|
d0ddec469a
|
Merge branch 'master' into nvptx
|
2013-11-08 15:42:58 +01:00 |
|
evghenii
|
87de3a2d06
|
added wc-timer for host code
|
2013-11-08 15:39:57 +01:00 |
|
Evghenii
|
eb8e1a2160
|
+added wc-timer
|
2013-11-08 15:27:51 +01:00 |
|
Evghenii
|
ce5f8cd46f
|
replaced with fresh examples
|
2013-11-08 14:17:26 +01:00 |
|
Evghenii
|
b2f62d51b0
|
tuned stencil
|
2013-11-08 14:15:27 +01:00 |
|
Evghenii
|
b3c68af40a
|
added volume rendering to run on GPU
|
2013-11-08 13:57:16 +01:00 |
|
Evghenii
|
348100ba42
|
remove stencil.cubin
|
2013-11-08 10:01:15 +01:00 |
|
Evghenii
|
426afc7377
|
added workable .cu files for stencil & mandelbrot
|
2013-11-08 10:00:49 +01:00 |
|
jbrodman
|
b04caabf39
|
Merge pull request #656 from egaburov/knc-fix
restored ISPC_FORCE_ALIGNED_MEMORY
|
2013-11-07 17:36:36 -08:00 |
|
evghenii
|
32cfdd52d3
|
Merge branch 'master' into knc-fix
|
2013-11-05 15:46:54 +01:00 |
|
evghenii
|
015af03bdc
|
changed back to #define ISPC_FORCE_ALIGNED_MEMORY aligned_ld/st #else unaligned ld/st #endif. However load<64>/store<64> will still be unaliged w/o this define because of fails related to the issue #632
|
2013-11-05 15:41:14 +01:00 |
|
Dmitry Babokin
|
99946ae0e6
|
Merge pull request #655 from ifilippov/windows
Windows support in alloy
|
2013-11-05 05:51:37 -08:00 |
|
Ilia Filippov
|
a910bfb539
|
Windows support
|
2013-11-05 16:31:01 +04:00 |
|
Evghenii
|
cb7cbec0d5
|
added cuda examples
|
2013-11-04 11:44:49 +01:00 |
|
Evghenii
|
cb6614da42
|
fixed the code that non-task code is also emitted for the host
|
2013-11-04 11:33:37 +01:00 |
|
Evghenii
|
6fae459847
|
a+1
|
2013-11-04 10:22:05 +01:00 |
|
Dmitry Babokin
|
bc7c54e5c8
|
Merge pull request #652 from jbrodman/ptrarithlvalues
Fix a case for a missing lvalue for pointer arithmetic
|
2013-11-01 10:39:27 -07:00 |
|
Dmitry Babokin
|
ab835d8086
|
Merge pull request #653 from jbrodman/master
Fix for Visual Studio compilation error.
|
2013-11-01 10:39:13 -07:00 |
|
james.brodman
|
0f7050d3aa
|
More stds compliant. VS doesn't like non constant length local arrays.
|
2013-10-31 19:51:13 -04:00 |
|
Evghenii
|
2cef101022
|
now emits host object file with ptx in it... next step .. testing
|
2013-10-31 18:05:04 +01:00 |
|
Evghenii
|
0a069f7de2
|
added comment
|
2013-10-31 16:06:44 +01:00 |
|
Evghenii
|
dcf9c280ee
|
some cleaning
|
2013-10-31 16:05:06 +01:00 |
|
Evghenii
|
a2fd124997
|
forced module name & ptx string to be generaetd nly once
|
2013-10-31 16:04:30 +01:00 |
|
Evghenii
|
63917f8cc2
|
now generates CUDALaunch call. Few tweaks are still necesary
|
2013-10-31 16:01:34 +01:00 |
|
Evghenii
|
e7ddb9e642
|
now adds function&module name. next step adding pointer to parameter list
|
2013-10-30 22:41:01 +01:00 |
|
james.brodman
|
ec17082864
|
Add unittest.
|
2013-10-30 17:21:10 -04:00 |
|
james.brodman
|
9ce6fbe1fa
|
Support using pointer arithmetic as lvalue
|
2013-10-30 17:07:26 -04:00 |
|
Evghenii
|
8db3d25844
|
moved PtxString to Globals
|
2013-10-30 21:05:22 +01:00 |
|
Evghenii
|
f9ec1a0097
|
.. work in programm to embed PTX into host code ..
|
2013-10-30 16:47:30 +01:00 |
|
jbrodman
|
61c33969a2
|
Merge pull request #651 from jbrodman/shiftfix
Fix logic that looks for shift builtins.
|
2013-10-29 11:08:52 -07:00 |
|
james.brodman
|
85eb4cf0d6
|
Fix logic that looks for shift builtins.
|
2013-10-29 14:02:32 -04:00 |
|
Evghenii
|
47cc470bf6
|
change nativeVectorWidth from 1 -> 32 for nvptx64
|
2013-10-29 16:07:12 +01:00 |
|
egaburov
|
60881499dc
|
Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx
|
2013-10-29 15:25:14 +01:00 |
|
egaburov
|
f19cf9274e
|
Merge remote-tracking branch 'upstream/master' into nvptx
|
2013-10-29 15:24:40 +01:00 |
|
Evghenii
|
ed9bca0e12
|
add __soa_to_aos*_float1 and __aos_to_soa*_float1 builtins
|
2013-10-29 15:06:08 +01:00 |
|
Evghenii
|
f15cdc03e3
|
nvptx64 generates 2 targets: task and normal function for nvptx64 and export for avx only
|
2013-10-29 14:46:51 +01:00 |
|
Evghenii
|
b31fc6f66d
|
now can generate both targets for npvtx64. m_isPTX is set true, to distuish when to either skip or exlcusive euse export
|
2013-10-29 14:17:11 +01:00 |
|
Evghenii
|
8baef6daa3
|
+1
|
2013-10-29 14:01:53 +01:00 |
|
Evghenii
|
ac700d4860
|
checkpoint
|
2013-10-29 13:36:31 +01:00 |
|
Evghenii
|
b2baa35c3d
|
added correct datalayout for nvptx64
|
2013-10-29 11:34:01 +01:00 |
|
Evghenii
|
b50d3944ea
|
allow easy switch between llvm
|
2013-10-29 10:22:07 +01:00 |
|
Evghenii
|
f115a32073
|
fix llvm 3.2 compilation
|
2013-10-29 10:21:56 +01:00 |
|
Evghenii
|
57aefdf830
|
accepts ptx extension when target is nvptx64
|
2013-10-29 10:21:48 +01:00 |
|
Dmitry Babokin
|
0d48ace15e
|
Merge pull request #649 from dbabokin/dispatch2
Typo fix
|
2013-10-28 14:35:55 -07:00 |
|