Dmitry Babokin
6d51987e67
Merge pull request #642 from egaburov/launch3d
...
concept of 3d tasking
2013-12-17 08:40:07 -08:00
james.brodman
01432670fd
Fix header file for multi-target output with pointers to varying in exported functions.
2013-12-12 13:27:23 -05:00
james.brodman
d10c0d9545
Add dynamic dispatch support for export functions with pointers to varying data as arguments
2013-12-05 17:47:58 -05:00
james.brodman
a448ccf20c
Merge branch 'master' into nomosoa
2013-12-04 13:52:44 -05:00
evghenii
bb46b561fd
Merged with upstream/master
2013-11-22 08:13:16 +01:00
Ilia Filippov
3fd9d5a025
support of LLVM 3.5
2013-11-21 19:09:43 +04:00
Evghenii
55bf0d23c2
resotred non-ptx functionality
2013-11-13 11:08:58 +01:00
Evghenii
dbb96c1885
need to fix launch code
2013-11-12 13:41:03 +01:00
Evghenii
4cd7e10ad3
reversed to original changes. Here is the plan to use CDP and genarate only device code with host wrapper..
2013-11-12 12:51:56 +01:00
Evghenii
dcf9c280ee
some cleaning
2013-10-31 16:05:06 +01:00
Evghenii
63917f8cc2
now generates CUDALaunch call. Few tweaks are still necesary
2013-10-31 16:01:34 +01:00
Evghenii
f9ec1a0097
.. work in programm to embed PTX into host code ..
2013-10-30 16:47:30 +01:00
Evghenii
f15cdc03e3
nvptx64 generates 2 targets: task and normal function for nvptx64 and export for avx only
2013-10-29 14:46:51 +01:00
Evghenii
1bd5360d3b
added now NVPTX64 automatically emits unmasked extern "C" for task function with kernel attributes
2013-10-28 13:58:01 +01:00
Evghenii
a7aa1ac1cf
now nvptx allows extern "C" task void, which is emits a kernel that should (?) be callable by driver API from external code
2013-10-28 12:57:09 +01:00
Evghenii
b68a751f4e
generating proper tasking function for nvptx
2013-10-28 11:36:08 +01:00
Evghenii
383e804ec1
changed notation form taskIndex1,2,3 -> taskIndex0,1,2
2013-10-24 17:20:56 +02:00
Evghenii
43761173ec
changed notation, task[Index,Count]_[x,y,z] -> task[Index,Count][1,2,3]. Change launch <<< nx,ny,nz >>> into launch [nx,ny,nz] or equivalent launch [nz][ny][nx]. Programmer can pick the one the is liked the most
2013-10-24 13:16:23 +02:00
egaburov
78a05777bc
added taskIndex_x,y,z and taskCount_x,y,z
2013-10-22 16:18:40 +02:00
james.brodman
ecb1174a18
leaving myself notes for later
2013-09-27 14:23:04 -04:00
james.brodman
090dec8549
Output regular header for multiple targets + fix exported varying types.
2013-08-22 13:23:22 -04:00
JCB
3e9d784013
Support exported arrays of varyings
2013-08-20 16:14:29 -04:00
JCB
10b8c481f5
initial support for exported varying
2013-08-20 15:14:15 -04:00
Matt Pharr
564e61c828
Improvements to constant folding.
...
We can now do constant folding with all basic datatypes (the previous
implementation handled int32 well, but had limited, if any, coverage
for other datatypes.)
Reduced a bit of repeated code in the constant folding implementation
through template helper functions.
2013-07-22 16:12:02 -07:00
Ilia Filippov
560acd5017
changes to support createFunction() with DICompositeType argument in LLVM_3_4
2013-06-04 15:48:39 +04:00
Dmitry Babokin
0f86255279
Target class redesign: data moved to private. Also empty target-feature attribute is not added anymore (generic targets).
2013-03-23 14:28:05 +04:00
Dmitry Babokin
3f8a678c5a
Editorial change: fixing trailing white spaces and tabs
2013-03-18 16:17:55 +04:00
Dmitry Babokin
51fdff208e
Tracking ToT changes in DIBuilder interface
2013-02-25 14:50:33 +04:00
jbrodman
dc939eba78
Merge pull request #418 from mmp/master
...
Fix build with LLVM top-of-tree, fix warnings, remove LLVM 3.0 support
2013-01-08 10:28:02 -08:00
Matt Pharr
0bf1320a32
Remove support for building with LLVM 3.0
2013-01-06 12:27:53 -08:00
Matt Pharr
63dd7d9859
Fix build to work with LLVM top-of-tree again
2013-01-06 12:02:08 -08:00
Peng Tu
6ba7368ab0
Fix two compile time errors to allow SOA pointer and array be passed as function argument.
2012-12-11 17:20:15 -08:00
Matt Pharr
172a189c6f
Fix build with LLVM top-of-tree
2012-10-17 11:11:50 -07:00
Jean-Luc Duprat
aecd6e0878
All the smear(), setzero() and undef() APIs are now templated on the return type.
...
Modified ISPC's internal mangling to pass these through unchanged.
Tried hard to make sure this is not going to introduce an ABI change.
2012-07-17 17:06:36 -07:00
Matt Pharr
4186ef204d
Fix build with LLVM top of tree.
2012-07-05 13:35:01 -07:00
Matt Pharr
f38770bf2a
Fix build with LLVM ToT
2012-06-28 07:36:10 -07:00
Matt Pharr
5a2c8342eb
Allow structs with no members.
...
Issue #289 .
2012-06-21 16:07:31 -07:00
Matt Pharr
007a734595
Add support for 'unmasked' function qualifier.
2012-06-20 15:36:00 -07:00
Matt Pharr
b2f6ed7209
Fix usage of CastType
2012-06-18 16:26:31 -07:00
Ingo Wald
789e04ce90
Add support for host/device stub functions for offload.
2012-06-12 10:23:49 -07:00
Matt Pharr
96aaf6d53b
Fix build with LLVM top of tree.
2012-06-05 12:28:05 -07:00
Matt Pharr
6118643232
Handle more error cases if the user tries to declare a method.
2012-06-04 09:07:13 -07:00
Matt Pharr
22cb80399f
Issue error if user tries to declare a method.
2012-06-04 08:50:13 -07:00
Matt Pharr
fd03ba7586
Export reference parameters as C++ references, not pointers.
2012-05-24 07:12:48 -07:00
Matt Pharr
333f901187
Fix build with LLVM 3.2 dev top-of-tree
2012-05-23 14:19:50 -07:00
Matt Pharr
7dd4d6c75e
Update for LLVM 3.2dev API change
2012-05-22 15:53:14 -07:00
Matt Pharr
8006589828
Use llvm::SmallVectors for struct member types and function types.
...
Further reduction of dynamic memory allocation...
2012-05-04 13:55:38 -07:00
Matt Pharr
7db8824da2
Reduce dynamic memory allocation in getting unif/varying variants of AtomicTypes
2012-05-04 13:55:38 -07:00
Matt Pharr
bff02017da
Cache const/non-const variants of Atomic and ReferenceTypes.
...
More reduction of dynamic memory allocation.
2012-05-04 13:55:38 -07:00
Matt Pharr
78d62705cc
Cache element types in StructType.
...
Previously, GetElementType() would end up causing dynamic allocation to
happen to compute the final element type (turning types with unbound
variability into the same type with the struct's variability) each it was
called, which was wasteful and slow. Now we cache the result.
Another 20% perf on compiling that problematic program.
2012-05-04 13:55:38 -07:00