Commit Graph

1764 Commits

Author SHA1 Message Date
Evghenii
47cc470bf6 change nativeVectorWidth from 1 -> 32 for nvptx64 2013-10-29 16:07:12 +01:00
egaburov
60881499dc Merge branch 'nvptx' of github.com:egaburov/ispc into nvptx 2013-10-29 15:25:14 +01:00
egaburov
f19cf9274e Merge remote-tracking branch 'upstream/master' into nvptx 2013-10-29 15:24:40 +01:00
Evghenii
ed9bca0e12 add __soa_to_aos*_float1 and __aos_to_soa*_float1 builtins 2013-10-29 15:06:08 +01:00
Evghenii
f15cdc03e3 nvptx64 generates 2 targets: task and normal function for nvptx64 and export for avx only 2013-10-29 14:46:51 +01:00
Evghenii
b31fc6f66d now can generate both targets for npvtx64. m_isPTX is set true, to distuish when to either skip or exlcusive euse export 2013-10-29 14:17:11 +01:00
Evghenii
8baef6daa3 +1 2013-10-29 14:01:53 +01:00
Evghenii
ac700d4860 checkpoint 2013-10-29 13:36:31 +01:00
Evghenii
b2baa35c3d added correct datalayout for nvptx64 2013-10-29 11:34:01 +01:00
Evghenii
b50d3944ea allow easy switch between llvm 2013-10-29 10:22:07 +01:00
Evghenii
f115a32073 fix llvm 3.2 compilation 2013-10-29 10:21:56 +01:00
Evghenii
57aefdf830 accepts ptx extension when target is nvptx64 2013-10-29 10:21:48 +01:00
Dmitry Babokin
0d48ace15e Merge pull request #649 from dbabokin/dispatch2
Typo fix
2013-10-28 14:35:55 -07:00
Dmitry Babokin
362ee06b9f Typo fix 2013-10-29 01:35:26 +04:00
jbrodman
9004f090c5 Merge pull request #646 from dbabokin/docs
Docs fix in memory management section
2013-10-28 14:26:17 -07:00
jbrodman
6948120094 Merge pull request #647 from dbabokin/dispatch2
CPU check fix
2013-10-28 14:25:41 -07:00
Dmitry Babokin
6585a925be Merge pull request #641 from jbrodman/stdlibshift
Add a "shift" operator to the stdlib.
2013-10-28 14:18:31 -07:00
james.brodman
e682b19eda Remove zero initialization for __vec4_i32 2013-10-28 17:13:07 -04:00
james.brodman
8ee3178166 Add Performance Warning 2013-10-28 16:51:02 -04:00
james.brodman
09a6e37154 Source cleanup. 2013-10-28 16:37:33 -04:00
james.brodman
1b8e745ffe remove condition. Don't use gcc 4.7 for tests. 2013-10-28 16:36:59 -04:00
james.brodman
6a1952d1f9 Merge branch 'stdlibshift' of https://github.com/jbrodman/ispc into stdlibshift 2013-10-28 16:16:30 -04:00
james.brodman
9ba7b96825 Make the new optimization play nicely with the other.s 2013-10-28 16:14:31 -04:00
Dmitry Babokin
ef5d2dc043 Merge pull request #648 from dbabokin/fail_db
fail_db update on all platforms
2013-10-28 12:08:30 -07:00
Dmitry Babokin
1f0f852fda Standalone checker for detecting the best ISA supported on the host 2013-10-28 22:54:14 +04:00
Dmitry Babokin
a166eb7ea1 Check AVX OS support in host cpu check code 2013-10-28 22:41:23 +04:00
james.brodman
02681d531e Minor tweak for interface. 2013-10-28 12:56:43 -04:00
james.brodman
641d882ea6 Add shift support for knc targets. This is not optimized. 2013-10-28 12:43:42 -04:00
james.brodman
1e80b3b0d7 Add shift support for generic-16 target. 2013-10-28 12:20:32 -04:00
Evghenii
ff98271a43 using mask i1 for nvptx64 2013-10-28 17:03:00 +01:00
Evghenii
500ad7fb51 using mask i1 for nvptx64 2013-10-28 17:01:03 +01:00
Evghenii
4f486333ed now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code 2013-10-28 16:47:40 +01:00
Evghenii
9a677b62ab Merge branch 'launch3d' into nvptx 2013-10-28 14:28:21 +01:00
Evghenii
68ced6ce46 automatically adds -D__NVPTX__ when nvptx64 target is chosen 2013-10-28 14:08:32 +01:00
Evghenii
1bd5360d3b added now NVPTX64 automatically emits unmasked extern "C" for task function with kernel attributes 2013-10-28 13:58:01 +01:00
Evghenii
a7aa1ac1cf now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code 2013-10-28 12:57:09 +01:00
Evghenii
ae23320417 added metadata for tasks with nvptx64 target. not tasks are kernel callable from host 2013-10-28 12:10:40 +01:00
Evghenii
b68a751f4e generating proper tasking function for nvptx 2013-10-28 11:36:08 +01:00
Evghenii
8391d05697 added blockIndex computations 2013-10-28 10:18:30 +01:00
Dmitry Babokin
63a3214cc6 Removing fails with g++4.4/g++4.7, as we are using clang by default now 2013-10-28 12:45:39 +04:00
Dmitry Babokin
4382902894 Fail_db update on Windows: 3.3 update and adding 3.4 2013-10-28 12:31:24 +04:00
Dmitry Babokin
103ef25f12 Docs fix in memory management section 2013-10-27 23:01:20 +04:00
Evghenii
84a7a5d1cb added tests for 3d launch 2013-10-26 16:16:28 +02:00
Evghenii
ac095dbf3e working on nvptx 2013-10-26 16:12:33 +02:00
Dmitry Babokin
a508bd4290 MacOS fails update 2013-10-26 14:50:45 +04:00
Dmitry Babokin
c1de753db6 Merge pull request #645 from ifilippov/636
fixing problem 644
2013-10-25 12:01:20 -07:00
Ilia Filippov
621679245a fixing problem 644 2013-10-25 22:44:37 +04:00
Dmitry Babokin
58aea1b61c Fail_db update with Linux passed with LLVM 3.4 2013-10-25 21:42:57 +04:00
Dmitry Babokin
9b5ee1b31b fail_db update on Linux 2013-10-25 02:07:51 +04:00
Evghenii
383e804ec1 changed notation form taskIndex1,2,3 -> taskIndex0,1,2 2013-10-24 17:20:56 +02:00