Commit Graph

1938 Commits

Author SHA1 Message Date
evghenii
0df23312ac +1 2013-11-18 22:00:01 +01:00
evghenii
e8fc32a7dc Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach 2013-11-18 21:59:09 +01:00
evghenii
e4ee1692fb added KNC Makfile 2013-11-18 21:58:56 +01:00
Evghenii
4bc8c79bd3 fixed cuda kernel 2013-11-18 13:28:12 +01:00
Evghenii
915dc4be7f +1 2013-11-18 13:24:01 +01:00
Evghenii
cf2116e167 +1 2013-11-18 13:16:30 +01:00
Evghenii
64762c5acd +1 2013-11-18 13:15:05 +01:00
Evghenii
4f9b8ebc73 +1 2013-11-18 13:11:57 +01:00
Evghenii
ee61a265f4 fixed kernel 2013-11-18 13:01:36 +01:00
Evghenii
db4abfe198 +1 2013-11-18 12:58:30 +01:00
Evghenii
4b7dbbf43b added cuda kernel 2013-11-18 12:46:30 +01:00
Evghenii
db13639460 change # options 2013-11-18 12:42:23 +01:00
evghenii
1cdb4c21c2 makefile fix 2013-11-18 12:25:36 +01:00
evghenii
008d9371b1 added Makefile for KNC 2013-11-18 12:19:45 +01:00
evghenii
6640dd0a6c added Makefile for KNC 2013-11-18 12:19:33 +01:00
Evghenii
589538bf39 added stencil code 2013-11-18 12:04:00 +01:00
Evghenii
8d4dd13750 changes 2013-11-18 11:58:19 +01:00
Dmitry Babokin
754a3208f2 Merge pull request #659 from ifilippov/master
patch for LLVM 3.3 and test correction at avx2
2013-11-18 02:46:15 -08:00
Evghenii
4d278a654e +1 2013-11-18 10:58:24 +01:00
Evghenii
9eaec6d58a changed avx->avx-x2 2013-11-18 10:56:22 +01:00
Evghenii
616cd3316c +1 2013-11-18 10:54:29 +01:00
Ilia Filippov
4579d339ea patch for LLVM 3.3 and test correction at avx2 2013-11-18 13:53:21 +04:00
Evghenii
36f584d341 +1 2013-11-18 10:51:54 +01:00
Evghenii
6387bbf193 Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach 2013-11-18 10:45:02 +01:00
Evghenii
de177e7529 +using kernels1 in deferred 2013-11-18 10:44:52 +01:00
evghenii
f17fdfdbef added KNC makefile 2013-11-18 10:41:52 +01:00
Evghenii
45d6bf196a change timings to ms 2013-11-18 10:37:17 +01:00
evghenii
5e2fff91f8 +1 2013-11-18 10:33:04 +01:00
evghenii
e13f8dd359 +1 2013-11-18 10:30:18 +01:00
Evghenii
0897f3c1c4 Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach 2013-11-18 10:17:02 +01:00
Evghenii
a82368956e added ms timer 2013-11-18 10:16:24 +01:00
evghenii
6841d8ba9a added Makefile_mic 2013-11-18 09:59:41 +01:00
Evghenii
927da8e861 change register allocation, makes code much faster 2013-11-18 09:46:51 +01:00
Evghenii
3c220a2813 loop unrolling, maks code 10x faster 2013-11-18 09:37:25 +01:00
Evghenii
5a01819fdc unrolled loops in binomial options cuda version 2013-11-18 09:16:09 +01:00
Dmitry Babokin
4977933d81 Merge pull request #658 from dbabokin/fail_db
fail_db.txt update on Linux
2013-11-17 15:42:51 -08:00
Dmitry Babokin
953e467a85 fail_db.txt update on Linux 2013-11-18 03:39:09 +04:00
jbrodman
131ab07c2b Merge pull request #657 from dbabokin/avx-i32x4
avx1-i32x4 target
2013-11-15 16:00:57 -08:00
Evghenii
1d94667a15 +speed-up binomial options via use of shared memory 2013-11-15 20:46:55 +01:00
Evghenii
4421fb7e19 +1 2013-11-15 20:21:31 +01:00
Dmitry Babokin
131ff50333 Adding avx1-i32x4 to alloy.py testing 2013-11-15 22:09:13 +04:00
Evghenii
bc8b5b3896 added cuda versino 2013-11-15 18:09:03 +01:00
Evghenii
a2d12517e7 +options 2013-11-15 17:59:04 +01:00
Evghenii
95d6647dce +1 2013-11-15 17:32:59 +01:00
Evghenii
3454f51d2c added some ptx options 2013-11-15 17:23:22 +01:00
Evghenii
6b65f6d9f4 +1 2013-11-15 16:21:09 +01:00
Evghenii
c93e71698e restored intrinsics and added tuning options to ptxgen 2013-11-15 15:04:04 +01:00
Evghenii
f9d2ede83c +1 2013-11-14 23:15:51 +01:00
Evghenii
53bf4573f0 +fix 2013-11-14 23:05:44 +01:00
Evghenii
86652738c0 working on rt 2013-11-14 22:54:37 +01:00