Evghenii
f011b3cb22
added Error when varying variable is defined in global scope with nvptx target
2014-01-08 11:46:36 +01:00
Evghenii
8347c766f0
added uniform memory test.
2014-01-08 11:16:51 +01:00
Evghenii
0a66f17897
experimental support for non-constant [non-static] uniform arrays mapped to addrspace(3)
2014-01-08 11:06:14 +01:00
Evghenii
3972d740a6
added mask for tasking function
2014-01-06 16:18:28 +01:00
Evghenii
91d4ae46f6
sort --fails
2014-01-06 15:38:30 +01:00
Evghenii
546f9cb409
MAJOR CHANGE--- STOP WITH THIS BRANCH--
2014-01-06 13:51:02 +01:00
Evghenii
ddfe782151
merged
2013-12-13 11:56:43 +01:00
Ilia Filippov
8b972f2ed6
Changing error to warning: mismatch in size/layout of global variable
2013-11-26 17:08:06 +04:00
evghenii
bb46b561fd
Merged with upstream/master
2013-11-22 08:13:16 +01:00
Evghenii
1445202e0e
identified bug due to llvm-3.4
2013-11-14 21:18:25 +01:00
Evghenii
f1fc3bdfba
added nvptx declaration to other target & fixed nvptx64 recognition
2013-11-14 20:12:58 +01:00
Evghenii
7aa37b19a9
added some more macros as quick hack...
2013-11-14 20:04:05 +01:00
Evghenii
25df23fed3
workaround for programIndex via preprocessor
2013-11-14 19:48:50 +01:00
Dmitry Babokin
e100040f28
Fix bug with fail when --target=avx1.1-i32x8,avx2-i32x8 - avx11 is not a valid target anymore, need more complete string
2013-11-14 15:37:11 +04:00
Evghenii
fd17ad236a
export functions are now also generated... next add proper CDP calls..
2013-11-12 14:05:12 +01:00
Evghenii
dbb96c1885
need to fix launch code
2013-11-12 13:41:03 +01:00
Evghenii
4cd7e10ad3
reversed to original changes. Here is the plan to use CDP and genarate only device code with host wrapper..
2013-11-12 12:51:56 +01:00
Evghenii
2cef101022
now emits host object file with ptx in it... next step .. testing
2013-10-31 18:05:04 +01:00
Evghenii
8db3d25844
moved PtxString to Globals
2013-10-30 21:05:22 +01:00
Evghenii
f9ec1a0097
.. work in programm to embed PTX into host code ..
2013-10-30 16:47:30 +01:00
Evghenii
f15cdc03e3
nvptx64 generates 2 targets: task and normal function for nvptx64 and export for avx only
2013-10-29 14:46:51 +01:00
Evghenii
b31fc6f66d
now can generate both targets for npvtx64. m_isPTX is set true, to distuish when to either skip or exlcusive euse export
2013-10-29 14:17:11 +01:00
Evghenii
ac700d4860
checkpoint
2013-10-29 13:36:31 +01:00
Evghenii
b2baa35c3d
added correct datalayout for nvptx64
2013-10-29 11:34:01 +01:00
Evghenii
57aefdf830
accepts ptx extension when target is nvptx64
2013-10-29 10:21:48 +01:00
Evghenii
68ced6ce46
automatically adds -D__NVPTX__ when nvptx64 target is chosen
2013-10-28 14:08:32 +01:00
Evghenii
1bd5360d3b
added now NVPTX64 automatically emits unmasked extern "C" for task function with kernel attributes
2013-10-28 13:58:01 +01:00
Evghenii
a7aa1ac1cf
now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code
2013-10-28 12:57:09 +01:00
james.brodman
44912e6b1e
Fix segfault when using both -g and -MMM
2013-10-08 18:27:03 -04:00
Matt Pharr
1276ea9844
Revert "Remove support for building with LLVM 3.1"
...
This reverts commit d3c567503b .
Conflicts:
opt.cpp
2013-08-06 17:00:35 -07:00
Matt Pharr
d3c567503b
Remove support for building with LLVM 3.1
2013-07-31 06:46:45 -07:00
Matt Pharr
ab3b633733
Add 8-bit and 16-bit specialized NEON targets.
...
Like SSE4-8 and SSE4-16, these use 8-bit and 16-bit values for mask
elements, respectively, and thus should generate the best code when used
for computation with datatypes of those sizes.
2013-07-30 08:44:16 -07:00
Matt Pharr
753c001e69
Merge branch 'master' of https://github.com/ispc/ispc into constfold
2013-07-23 16:12:04 -07:00
Matt Pharr
564e61c828
Improvements to constant folding.
...
We can now do constant folding with all basic datatypes (the previous
implementation handled int32 well, but had limited, if any, coverage
for other datatypes.)
Reduced a bit of repeated code in the constant folding implementation
through template helper functions.
2013-07-22 16:12:02 -07:00
Matt Pharr
946c39a5df
Fix build with LLVM top-of-tree.
...
The DIBuilder::getCU() method has been removed; we now just store the
compilation unit returned when we call DIBuilder::createCompileUnit.
2013-07-22 15:42:52 -07:00
Matt Pharr
f1cce0ef5f
Fix build with LLVM top-of-tree
2013-07-17 09:25:00 -07:00
Dmitry Babokin
481bcc732b
Changes for 1.4.0 release
2013-05-27 16:48:41 +04:00
Dmitry Babokin
23ba61e76f
Fix for #462 : broken debug infor support with LLVM 3.3+
2013-05-20 22:28:47 +04:00
Dmitry Babokin
b6b9daa3c5
Enabling llvm 3.4
2013-05-13 19:25:31 +04:00
Dmitry Babokin
549655bff4
Adding new line to error/warning message on Windows and fixing some typos.
2013-05-01 20:22:01 +02:00
Dmitry Babokin
a0462fe1ee
#469 : Fix for multi-target compilation
2013-04-12 14:06:12 +04:00
Dmitry Babokin
0af2a13349
DataLayout is changed to be managed from single place. v4-128-128 is added to generic DataLayout
2013-03-23 14:38:51 +04:00
Dmitry Babokin
0f86255279
Target class redesign: data moved to private. Also empty target-feature attribute is not added anymore (generic targets).
2013-03-23 14:28:05 +04:00
Dmitry Babokin
3f8a678c5a
Editorial change: fixing trailing white spaces and tabs
2013-03-18 16:17:55 +04:00
Dmitry Babokin
f2dcad27bb
Fix for LLVM 3.1 and #441
2013-03-12 21:13:08 +04:00
Dmitry Babokin
01992006b2
Fix for #441 : Prepocessor complains on code commented out by //
2013-03-12 18:56:32 +04:00
Dmitry Babokin
524939dc5b
Fix for issue #430
2013-02-27 18:03:07 +04:00
james.brodman
775ecd6dfe
Tracking ToT changes. Clang PP APIs changed.
2013-01-30 11:57:33 -05:00
james.brodman
3aaf2ef2d4
ToT Fixes / M4 macro fix
2013-01-14 14:55:10 -05:00
jbrodman
dc939eba78
Merge pull request #418 from mmp/master
...
Fix build with LLVM top-of-tree, fix warnings, remove LLVM 3.0 support
2013-01-08 10:28:02 -08:00