Commit Graph

180 Commits

Author SHA1 Message Date
Evghenii
a3b00fdcd6 added support for global atomics 2014-01-26 14:23:26 +01:00
Evghenii
a7d4a3f922 fix for __any 2014-01-26 13:15:13 +01:00
Evghenii
3e86dfe480 fix for __any 2014-01-25 17:09:11 +01:00
Evghenii
fcbdd93043 half/scan for 64 bit/clock/num_cores and other additions 2014-01-25 16:43:33 +01:00
Evghenii
805196a6a0 fixed doubles 2014-01-25 15:31:56 +01:00
Evghenii
bd34729217 added floor/ceil/round for float/double 2014-01-25 12:20:38 +01:00
Evghenii
6917c161c8 fixed reduce_equal 2014-01-25 11:39:37 +01:00
Evghenii
156aa4c139 partial support for reduce equal 2014-01-24 17:29:26 +01:00
Evghenii
ddb9b2fc47 added basic printing from ptx 2014-01-24 13:44:38 +01:00
Evghenii
c76c916475 removed insert/extract_void 2014-01-24 12:37:18 +01:00
Evghenii
9090d8b128 added support for assert 2014-01-24 12:18:20 +01:00
Evghenii
5a8351d7ea added varying new/delete 2014-01-24 09:22:55 +01:00
Evghenii
1a07aed6aa foreach_unique will work now on atomic data types, not pointers yet. enum is not tested. All tests/foreach-unique-*.ispc pass 2014-01-24 08:30:50 +01:00
Evghenii
1cf1dab649 fixed foreach_unique and local_atomics 2014-01-23 21:57:20 +01:00
Evghenii
f0d3501dbd atomic globals now fail compilations. 2014-01-23 19:57:58 +01:00
Evghenii
0091973bca packed_load and packed_store2 added 2014-01-23 14:34:00 +01:00
Evghenii
e87e332d2f identified issue with __movmsk. 2014-01-23 10:45:10 +01:00
Evghenii
2e7609156a fixes for exclclusive_scan_and/or_i32 and shuffle2 and __movmsk 2014-01-23 10:24:44 +01:00
Evghenii
06313e0ec3 exclusive_scan_and is supported, but must be called outside if-statements. in pricniple other must do the same 2014-01-22 22:12:51 +01:00
Evghenii
08d78e6be5 partial exclusive_scan support 2014-01-22 21:55:22 +01:00
Evghenii
11964a8ce8 added broadcast 2014-01-22 20:46:41 +01:00
Evghenii
7d0aa7a336 added shift 2014-01-22 20:43:53 +01:00
Evghenii
39962623cc added shuffle 2014-01-22 19:18:45 +01:00
Evghenii
5cde87ce80 added reduce_add/min/max 2014-01-22 16:55:08 +01:00
Evghenii
bc99897fbb +fixed some example, found some bugs, and bugs in ptxas/cuda 2014-01-21 14:51:27 +01:00
evghenii
9053eed4b4 added basic optimization pass that promotes uniform into varying variables (not array) for nvptx target 2014-01-10 06:32:57 +01:00
Evghenii
de4d66c56f added addrspace(4)/constant memory for const uniform declarations 2014-01-08 13:27:24 +01:00
Evghenii
9b74e60185 added conversion from addrspace(3)/__local/__shared__ to addspace(0)/generic when PtrToInt is called 2014-01-07 14:29:55 +01:00
evghenii
7e63cafc85 fixed common-1 2014-01-06 20:24:37 +01:00
Evghenii
a8a2cf9bdb change nvptx64->nvptx 2014-01-06 14:04:27 +01:00
Evghenii
546f9cb409 MAJOR CHANGE--- STOP WITH THIS BRANCH-- 2014-01-06 13:51:02 +01:00
Evghenii
fd429e4fda added common_gpu makefile helper 2014-01-05 10:00:44 +01:00
Evghenii
2d8da306a1 merged with master 2013-12-25 21:32:34 +01:00
Ilia Filippov
15816eb07e adding __packed_store_active2 to generic targets 2013-12-19 17:50:18 +04:00
Evghenii
d4b8a0f2eb added packed_store_active2 declartion in IR 2013-12-18 11:40:03 +01:00
Evghenii
d77789d8fe +merged with master 2013-12-18 11:37:01 +01:00
Dmitry Babokin
d666fc3f8f Merge pull request #686 from ifilippov/ttt
packed_store_active2() - tuned version of packed_store_active()
2013-12-17 09:23:39 -08:00
Ilia Filippov
473f1cb4d2 packed_store_active2 2013-12-17 21:14:29 +04:00
Dmitry Babokin
6d51987e67 Merge pull request #642 from egaburov/launch3d
concept of 3d tasking
2013-12-17 08:40:07 -08:00
evghenii
c06ec92d0d added commas, added multi-dimensional tasking to mandelbrot_tasks & removed mandelbrot_task3d. Also adjusted documentaiton a bit 2013-12-13 11:49:11 +01:00
evghenii
bb46b561fd Merged with upstream/master 2013-11-22 08:13:16 +01:00
Evghenii
6f200d310f fixed to work with LLVM 3.2 2013-11-21 11:03:03 +01:00
Evghenii
c93e71698e restored intrinsics and added tuning options to ptxgen 2013-11-15 15:04:04 +01:00
Evghenii
f12826bac5 +added approx rcp/rsqrt/rtz with ftz=true 2013-11-14 22:17:57 +01:00
Evghenii
f1fc3bdfba added nvptx declaration to other target & fixed nvptx64 recognition 2013-11-14 20:12:58 +01:00
Evghenii
e162d5a99d programIndex still not working, found where change is needed... 2013-11-14 19:46:08 +01:00
Dmitry Babokin
d2c7b356cc Ordering functions in target-[avx|sse2].ll to be in the same order. No real changes, except adding a few alwaysinline in SSE4 target 2013-11-14 15:34:30 +04:00
Dmitry Babokin
af58955140 target-[sse4|avx]_common.ll are twin brothers, which diffes only cosmetically. This commit makes them diffable. No real changes, except adding alwaysinline to sse version iof __max_uniform_int32/__max_uniform_uint32 2013-11-14 15:34:30 +04:00
Evghenii
1f13a236bf small tuning 2013-11-13 13:03:26 +01:00
Evghenii
268be7f0b5 fixed ISPCSync functionality 2013-11-13 11:19:10 +01:00