Evghenii
|
b3c68af40a
|
added volume rendering to run on GPU
|
2013-11-08 13:57:16 +01:00 |
|
Evghenii
|
348100ba42
|
remove stencil.cubin
|
2013-11-08 10:01:15 +01:00 |
|
Evghenii
|
426afc7377
|
added workable .cu files for stencil & mandelbrot
|
2013-11-08 10:00:49 +01:00 |
|
Evghenii
|
cb7cbec0d5
|
added cuda examples
|
2013-11-04 11:44:49 +01:00 |
|
Evghenii
|
cb6614da42
|
fixed the code that non-task code is also emitted for the host
|
2013-11-04 11:33:37 +01:00 |
|
Evghenii
|
6fae459847
|
a+1
|
2013-11-04 10:22:05 +01:00 |
|
Evghenii
|
2cef101022
|
now emits host object file with ptx in it... next step .. testing
|
2013-10-31 18:05:04 +01:00 |
|
Evghenii
|
0a069f7de2
|
added comment
|
2013-10-31 16:06:44 +01:00 |
|
Evghenii
|
dcf9c280ee
|
some cleaning
|
2013-10-31 16:05:06 +01:00 |
|
Evghenii
|
a2fd124997
|
forced module name & ptx string to be generaetd nly once
|
2013-10-31 16:04:30 +01:00 |
|
Evghenii
|
63917f8cc2
|
now generates CUDALaunch call. Few tweaks are still necesary
|
2013-10-31 16:01:34 +01:00 |
|
Evghenii
|
e7ddb9e642
|
now adds function&module name. next step adding pointer to parameter list
|
2013-10-30 22:41:01 +01:00 |
|
Evghenii
|
8db3d25844
|
moved PtxString to Globals
|
2013-10-30 21:05:22 +01:00 |
|
Evghenii
|
f9ec1a0097
|
.. work in programm to embed PTX into host code ..
|
2013-10-30 16:47:30 +01:00 |
|
Evghenii
|
47cc470bf6
|
change nativeVectorWidth from 1 -> 32 for nvptx64
|
2013-10-29 16:07:12 +01:00 |
|
egaburov
|
60881499dc
|
Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx
|
2013-10-29 15:25:14 +01:00 |
|
egaburov
|
f19cf9274e
|
Merge remote-tracking branch 'upstream/master' into nvptx
|
2013-10-29 15:24:40 +01:00 |
|
Evghenii
|
ed9bca0e12
|
add __soa_to_aos*_float1 and __aos_to_soa*_float1 builtins
|
2013-10-29 15:06:08 +01:00 |
|
Evghenii
|
f15cdc03e3
|
nvptx64 generates 2 targets: task and normal function for nvptx64 and export for avx only
|
2013-10-29 14:46:51 +01:00 |
|
Evghenii
|
b31fc6f66d
|
now can generate both targets for npvtx64. m_isPTX is set true, to distuish when to either skip or exlcusive euse export
|
2013-10-29 14:17:11 +01:00 |
|
Evghenii
|
8baef6daa3
|
+1
|
2013-10-29 14:01:53 +01:00 |
|
Evghenii
|
ac700d4860
|
checkpoint
|
2013-10-29 13:36:31 +01:00 |
|
Evghenii
|
b2baa35c3d
|
added correct datalayout for nvptx64
|
2013-10-29 11:34:01 +01:00 |
|
Evghenii
|
b50d3944ea
|
allow easy switch between llvm
|
2013-10-29 10:22:07 +01:00 |
|
Evghenii
|
f115a32073
|
fix llvm 3.2 compilation
|
2013-10-29 10:21:56 +01:00 |
|
Evghenii
|
57aefdf830
|
accepts ptx extension when target is nvptx64
|
2013-10-29 10:21:48 +01:00 |
|
Dmitry Babokin
|
0d48ace15e
|
Merge pull request #649 from dbabokin/dispatch2
Typo fix
|
2013-10-28 14:35:55 -07:00 |
|
Dmitry Babokin
|
362ee06b9f
|
Typo fix
|
2013-10-29 01:35:26 +04:00 |
|
jbrodman
|
9004f090c5
|
Merge pull request #646 from dbabokin/docs
Docs fix in memory management section
|
2013-10-28 14:26:17 -07:00 |
|
jbrodman
|
6948120094
|
Merge pull request #647 from dbabokin/dispatch2
CPU check fix
|
2013-10-28 14:25:41 -07:00 |
|
Dmitry Babokin
|
6585a925be
|
Merge pull request #641 from jbrodman/stdlibshift
Add a "shift" operator to the stdlib.
|
2013-10-28 14:18:31 -07:00 |
|
james.brodman
|
e682b19eda
|
Remove zero initialization for __vec4_i32
|
2013-10-28 17:13:07 -04:00 |
|
james.brodman
|
8ee3178166
|
Add Performance Warning
|
2013-10-28 16:51:02 -04:00 |
|
james.brodman
|
09a6e37154
|
Source cleanup.
|
2013-10-28 16:37:33 -04:00 |
|
james.brodman
|
1b8e745ffe
|
remove condition. Don't use gcc 4.7 for tests.
|
2013-10-28 16:36:59 -04:00 |
|
james.brodman
|
6a1952d1f9
|
Merge branch 'stdlibshift' of https://github.com/jbrodman/ispc into stdlibshift
|
2013-10-28 16:16:30 -04:00 |
|
james.brodman
|
9ba7b96825
|
Make the new optimization play nicely with the other.s
|
2013-10-28 16:14:31 -04:00 |
|
Dmitry Babokin
|
ef5d2dc043
|
Merge pull request #648 from dbabokin/fail_db
fail_db update on all platforms
|
2013-10-28 12:08:30 -07:00 |
|
Dmitry Babokin
|
1f0f852fda
|
Standalone checker for detecting the best ISA supported on the host
|
2013-10-28 22:54:14 +04:00 |
|
Dmitry Babokin
|
a166eb7ea1
|
Check AVX OS support in host cpu check code
|
2013-10-28 22:41:23 +04:00 |
|
james.brodman
|
02681d531e
|
Minor tweak for interface.
|
2013-10-28 12:56:43 -04:00 |
|
james.brodman
|
641d882ea6
|
Add shift support for knc targets. This is not optimized.
|
2013-10-28 12:43:42 -04:00 |
|
james.brodman
|
1e80b3b0d7
|
Add shift support for generic-16 target.
|
2013-10-28 12:20:32 -04:00 |
|
Evghenii
|
ff98271a43
|
using mask i1 for nvptx64
|
2013-10-28 17:03:00 +01:00 |
|
Evghenii
|
500ad7fb51
|
using mask i1 for nvptx64
|
2013-10-28 17:01:03 +01:00 |
|
Evghenii
|
4f486333ed
|
now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code
|
2013-10-28 16:47:40 +01:00 |
|
Evghenii
|
9a677b62ab
|
Merge branch 'launch3d' into nvptx
|
2013-10-28 14:28:21 +01:00 |
|
Evghenii
|
68ced6ce46
|
automatically adds -D__NVPTX__ when nvptx64 target is chosen
|
2013-10-28 14:08:32 +01:00 |
|
Evghenii
|
1bd5360d3b
|
added now NVPTX64 automatically emits unmasked extern "C" for task function with kernel attributes
|
2013-10-28 13:58:01 +01:00 |
|
Evghenii
|
a7aa1ac1cf
|
now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code
|
2013-10-28 12:57:09 +01:00 |
|