Evghenii
|
4b7dbbf43b
|
added cuda kernel
|
2013-11-18 12:46:30 +01:00 |
|
Evghenii
|
db13639460
|
change # options
|
2013-11-18 12:42:23 +01:00 |
|
evghenii
|
1cdb4c21c2
|
makefile fix
|
2013-11-18 12:25:36 +01:00 |
|
evghenii
|
008d9371b1
|
added Makefile for KNC
|
2013-11-18 12:19:45 +01:00 |
|
evghenii
|
6640dd0a6c
|
added Makefile for KNC
|
2013-11-18 12:19:33 +01:00 |
|
Evghenii
|
589538bf39
|
added stencil code
|
2013-11-18 12:04:00 +01:00 |
|
Evghenii
|
8d4dd13750
|
changes
|
2013-11-18 11:58:19 +01:00 |
|
Dmitry Babokin
|
754a3208f2
|
Merge pull request #659 from ifilippov/master
patch for LLVM 3.3 and test correction at avx2
|
2013-11-18 02:46:15 -08:00 |
|
Evghenii
|
4d278a654e
|
+1
|
2013-11-18 10:58:24 +01:00 |
|
Evghenii
|
9eaec6d58a
|
changed avx->avx-x2
|
2013-11-18 10:56:22 +01:00 |
|
Evghenii
|
616cd3316c
|
+1
|
2013-11-18 10:54:29 +01:00 |
|
Ilia Filippov
|
4579d339ea
|
patch for LLVM 3.3 and test correction at avx2
|
2013-11-18 13:53:21 +04:00 |
|
Evghenii
|
36f584d341
|
+1
|
2013-11-18 10:51:54 +01:00 |
|
Evghenii
|
6387bbf193
|
Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach
|
2013-11-18 10:45:02 +01:00 |
|
Evghenii
|
de177e7529
|
+using kernels1 in deferred
|
2013-11-18 10:44:52 +01:00 |
|
evghenii
|
f17fdfdbef
|
added KNC makefile
|
2013-11-18 10:41:52 +01:00 |
|
Evghenii
|
45d6bf196a
|
change timings to ms
|
2013-11-18 10:37:17 +01:00 |
|
evghenii
|
5e2fff91f8
|
+1
|
2013-11-18 10:33:04 +01:00 |
|
evghenii
|
e13f8dd359
|
+1
|
2013-11-18 10:30:18 +01:00 |
|
Evghenii
|
0897f3c1c4
|
Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach
|
2013-11-18 10:17:02 +01:00 |
|
Evghenii
|
a82368956e
|
added ms timer
|
2013-11-18 10:16:24 +01:00 |
|
evghenii
|
6841d8ba9a
|
added Makefile_mic
|
2013-11-18 09:59:41 +01:00 |
|
Evghenii
|
927da8e861
|
change register allocation, makes code much faster
|
2013-11-18 09:46:51 +01:00 |
|
Evghenii
|
3c220a2813
|
loop unrolling, maks code 10x faster
|
2013-11-18 09:37:25 +01:00 |
|
Evghenii
|
5a01819fdc
|
unrolled loops in binomial options cuda version
|
2013-11-18 09:16:09 +01:00 |
|
Dmitry Babokin
|
4977933d81
|
Merge pull request #658 from dbabokin/fail_db
fail_db.txt update on Linux
|
2013-11-17 15:42:51 -08:00 |
|
Dmitry Babokin
|
953e467a85
|
fail_db.txt update on Linux
|
2013-11-18 03:39:09 +04:00 |
|
jbrodman
|
131ab07c2b
|
Merge pull request #657 from dbabokin/avx-i32x4
avx1-i32x4 target
|
2013-11-15 16:00:57 -08:00 |
|
Evghenii
|
1d94667a15
|
+speed-up binomial options via use of shared memory
|
2013-11-15 20:46:55 +01:00 |
|
Evghenii
|
4421fb7e19
|
+1
|
2013-11-15 20:21:31 +01:00 |
|
Dmitry Babokin
|
131ff50333
|
Adding avx1-i32x4 to alloy.py testing
|
2013-11-15 22:09:13 +04:00 |
|
Evghenii
|
bc8b5b3896
|
added cuda versino
|
2013-11-15 18:09:03 +01:00 |
|
Evghenii
|
a2d12517e7
|
+options
|
2013-11-15 17:59:04 +01:00 |
|
Evghenii
|
95d6647dce
|
+1
|
2013-11-15 17:32:59 +01:00 |
|
Evghenii
|
3454f51d2c
|
added some ptx options
|
2013-11-15 17:23:22 +01:00 |
|
Evghenii
|
6b65f6d9f4
|
+1
|
2013-11-15 16:21:09 +01:00 |
|
Evghenii
|
c93e71698e
|
restored intrinsics and added tuning options to ptxgen
|
2013-11-15 15:04:04 +01:00 |
|
Evghenii
|
f9d2ede83c
|
+1
|
2013-11-14 23:15:51 +01:00 |
|
Evghenii
|
53bf4573f0
|
+fix
|
2013-11-14 23:05:44 +01:00 |
|
Evghenii
|
86652738c0
|
working on rt
|
2013-11-14 22:54:37 +01:00 |
|
Evghenii
|
294fb039fe
|
some tuning, adding cuda kernels
|
2013-11-14 22:33:58 +01:00 |
|
Evghenii
|
f12826bac5
|
+added approx rcp/rsqrt/rtz with ftz=true
|
2013-11-14 22:17:57 +01:00 |
|
Evghenii
|
2c8afde6d9
|
chaning MF
|
2013-11-14 21:38:25 +01:00 |
|
Evghenii
|
1445202e0e
|
identified bug due to llvm-3.4
|
2013-11-14 21:18:25 +01:00 |
|
Evghenii
|
1b940fd41e
|
+1
|
2013-11-14 20:19:59 +01:00 |
|
Evghenii
|
f1fc3bdfba
|
added nvptx declaration to other target & fixed nvptx64 recognition
|
2013-11-14 20:12:58 +01:00 |
|
Evghenii
|
7aa37b19a9
|
added some more macros as quick hack...
|
2013-11-14 20:04:05 +01:00 |
|
Evghenii
|
967a49dd66
|
+1
|
2013-11-14 19:54:18 +01:00 |
|
Evghenii
|
25df23fed3
|
workaround for programIndex via preprocessor
|
2013-11-14 19:48:50 +01:00 |
|
Evghenii
|
e162d5a99d
|
programIndex still not working, found where change is needed...
|
2013-11-14 19:46:08 +01:00 |
|