Evghenii
|
a3b00fdcd6
|
added support for global atomics
|
2014-01-26 14:23:26 +01:00 |
|
Evghenii
|
a7d4a3f922
|
fix for __any
|
2014-01-26 13:15:13 +01:00 |
|
Evghenii
|
3e86dfe480
|
fix for __any
|
2014-01-25 17:09:11 +01:00 |
|
Evghenii
|
fcbdd93043
|
half/scan for 64 bit/clock/num_cores and other additions
|
2014-01-25 16:43:33 +01:00 |
|
Evghenii
|
805196a6a0
|
fixed doubles
|
2014-01-25 15:31:56 +01:00 |
|
Evghenii
|
bd34729217
|
added floor/ceil/round for float/double
|
2014-01-25 12:20:38 +01:00 |
|
Evghenii
|
6917c161c8
|
fixed reduce_equal
|
2014-01-25 11:39:37 +01:00 |
|
Evghenii
|
156aa4c139
|
partial support for reduce equal
|
2014-01-24 17:29:26 +01:00 |
|
Evghenii
|
ddb9b2fc47
|
added basic printing from ptx
|
2014-01-24 13:44:38 +01:00 |
|
Evghenii
|
c76c916475
|
removed insert/extract_void
|
2014-01-24 12:37:18 +01:00 |
|
Evghenii
|
9090d8b128
|
added support for assert
|
2014-01-24 12:18:20 +01:00 |
|
Evghenii
|
5a8351d7ea
|
added varying new/delete
|
2014-01-24 09:22:55 +01:00 |
|
Evghenii
|
1a07aed6aa
|
foreach_unique will work now on atomic data types, not pointers yet. enum is not tested. All tests/foreach-unique-*.ispc pass
|
2014-01-24 08:30:50 +01:00 |
|
Evghenii
|
1cf1dab649
|
fixed foreach_unique and local_atomics
|
2014-01-23 21:57:20 +01:00 |
|
Evghenii
|
f0d3501dbd
|
atomic globals now fail compilations.
|
2014-01-23 19:57:58 +01:00 |
|
Evghenii
|
0091973bca
|
packed_load and packed_store2 added
|
2014-01-23 14:34:00 +01:00 |
|
Evghenii
|
e87e332d2f
|
identified issue with __movmsk.
|
2014-01-23 10:45:10 +01:00 |
|
Evghenii
|
2e7609156a
|
fixes for exclclusive_scan_and/or_i32 and shuffle2 and __movmsk
|
2014-01-23 10:24:44 +01:00 |
|
Evghenii
|
06313e0ec3
|
exclusive_scan_and is supported, but must be called outside if-statements. in pricniple other must do the same
|
2014-01-22 22:12:51 +01:00 |
|
Evghenii
|
08d78e6be5
|
partial exclusive_scan support
|
2014-01-22 21:55:22 +01:00 |
|
Evghenii
|
11964a8ce8
|
added broadcast
|
2014-01-22 20:46:41 +01:00 |
|
Evghenii
|
7d0aa7a336
|
added shift
|
2014-01-22 20:43:53 +01:00 |
|
Evghenii
|
39962623cc
|
added shuffle
|
2014-01-22 19:18:45 +01:00 |
|
Evghenii
|
5cde87ce80
|
added reduce_add/min/max
|
2014-01-22 16:55:08 +01:00 |
|
Evghenii
|
bc99897fbb
|
+fixed some example, found some bugs, and bugs in ptxas/cuda
|
2014-01-21 14:51:27 +01:00 |
|
evghenii
|
9053eed4b4
|
added basic optimization pass that promotes uniform into varying variables (not array) for nvptx target
|
2014-01-10 06:32:57 +01:00 |
|
Evghenii
|
de4d66c56f
|
added addrspace(4)/constant memory for const uniform declarations
|
2014-01-08 13:27:24 +01:00 |
|
Evghenii
|
9b74e60185
|
added conversion from addrspace(3)/__local/__shared__ to addspace(0)/generic when PtrToInt is called
|
2014-01-07 14:29:55 +01:00 |
|
evghenii
|
7e63cafc85
|
fixed common-1
|
2014-01-06 20:24:37 +01:00 |
|
Evghenii
|
a8a2cf9bdb
|
change nvptx64->nvptx
|
2014-01-06 14:04:27 +01:00 |
|
Evghenii
|
546f9cb409
|
MAJOR CHANGE--- STOP WITH THIS BRANCH--
|
2014-01-06 13:51:02 +01:00 |
|
Evghenii
|
fd429e4fda
|
added common_gpu makefile helper
|
2014-01-05 10:00:44 +01:00 |
|
Evghenii
|
2d8da306a1
|
merged with master
|
2013-12-25 21:32:34 +01:00 |
|
Ilia Filippov
|
15816eb07e
|
adding __packed_store_active2 to generic targets
|
2013-12-19 17:50:18 +04:00 |
|
Evghenii
|
d4b8a0f2eb
|
added packed_store_active2 declartion in IR
|
2013-12-18 11:40:03 +01:00 |
|
Evghenii
|
d77789d8fe
|
+merged with master
|
2013-12-18 11:37:01 +01:00 |
|
Dmitry Babokin
|
d666fc3f8f
|
Merge pull request #686 from ifilippov/ttt
packed_store_active2() - tuned version of packed_store_active()
|
2013-12-17 09:23:39 -08:00 |
|
Ilia Filippov
|
473f1cb4d2
|
packed_store_active2
|
2013-12-17 21:14:29 +04:00 |
|
Dmitry Babokin
|
6d51987e67
|
Merge pull request #642 from egaburov/launch3d
concept of 3d tasking
|
2013-12-17 08:40:07 -08:00 |
|
evghenii
|
c06ec92d0d
|
added commas, added multi-dimensional tasking to mandelbrot_tasks & removed mandelbrot_task3d. Also adjusted documentaiton a bit
|
2013-12-13 11:49:11 +01:00 |
|
evghenii
|
bb46b561fd
|
Merged with upstream/master
|
2013-11-22 08:13:16 +01:00 |
|
Evghenii
|
6f200d310f
|
fixed to work with LLVM 3.2
|
2013-11-21 11:03:03 +01:00 |
|
Evghenii
|
c93e71698e
|
restored intrinsics and added tuning options to ptxgen
|
2013-11-15 15:04:04 +01:00 |
|
Evghenii
|
f12826bac5
|
+added approx rcp/rsqrt/rtz with ftz=true
|
2013-11-14 22:17:57 +01:00 |
|
Evghenii
|
f1fc3bdfba
|
added nvptx declaration to other target & fixed nvptx64 recognition
|
2013-11-14 20:12:58 +01:00 |
|
Evghenii
|
e162d5a99d
|
programIndex still not working, found where change is needed...
|
2013-11-14 19:46:08 +01:00 |
|
Dmitry Babokin
|
d2c7b356cc
|
Ordering functions in target-[avx|sse2].ll to be in the same order. No real changes, except adding a few alwaysinline in SSE4 target
|
2013-11-14 15:34:30 +04:00 |
|
Dmitry Babokin
|
af58955140
|
target-[sse4|avx]_common.ll are twin brothers, which diffes only cosmetically. This commit makes them diffable. No real changes, except adding alwaysinline to sse version iof __max_uniform_int32/__max_uniform_uint32
|
2013-11-14 15:34:30 +04:00 |
|
Evghenii
|
1f13a236bf
|
small tuning
|
2013-11-13 13:03:26 +01:00 |
|
Evghenii
|
268be7f0b5
|
fixed ISPCSync functionality
|
2013-11-13 11:19:10 +01:00 |
|