Commit Graph

97 Commits

Author SHA1 Message Date
Evghenii
616cd3316c +1 2013-11-18 10:54:29 +01:00
Evghenii
36f584d341 +1 2013-11-18 10:51:54 +01:00
Evghenii
6387bbf193 Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach 2013-11-18 10:45:02 +01:00
Evghenii
de177e7529 +using kernels1 in deferred 2013-11-18 10:44:52 +01:00
evghenii
f17fdfdbef added KNC makefile 2013-11-18 10:41:52 +01:00
Evghenii
45d6bf196a change timings to ms 2013-11-18 10:37:17 +01:00
evghenii
5e2fff91f8 +1 2013-11-18 10:33:04 +01:00
evghenii
e13f8dd359 +1 2013-11-18 10:30:18 +01:00
Evghenii
0897f3c1c4 Merge branch 'sm35_foreach' of github.com:egaburov/ispc into sm35_foreach 2013-11-18 10:17:02 +01:00
Evghenii
a82368956e added ms timer 2013-11-18 10:16:24 +01:00
evghenii
6841d8ba9a added Makefile_mic 2013-11-18 09:59:41 +01:00
Evghenii
927da8e861 change register allocation, makes code much faster 2013-11-18 09:46:51 +01:00
Evghenii
3c220a2813 loop unrolling, maks code 10x faster 2013-11-18 09:37:25 +01:00
Evghenii
5a01819fdc unrolled loops in binomial options cuda version 2013-11-18 09:16:09 +01:00
Evghenii
1d94667a15 +speed-up binomial options via use of shared memory 2013-11-15 20:46:55 +01:00
Evghenii
4421fb7e19 +1 2013-11-15 20:21:31 +01:00
Evghenii
bc8b5b3896 added cuda versino 2013-11-15 18:09:03 +01:00
Evghenii
a2d12517e7 +options 2013-11-15 17:59:04 +01:00
Evghenii
95d6647dce +1 2013-11-15 17:32:59 +01:00
Evghenii
3454f51d2c added some ptx options 2013-11-15 17:23:22 +01:00
Evghenii
6b65f6d9f4 +1 2013-11-15 16:21:09 +01:00
Evghenii
c93e71698e restored intrinsics and added tuning options to ptxgen 2013-11-15 15:04:04 +01:00
Evghenii
f9d2ede83c +1 2013-11-14 23:15:51 +01:00
Evghenii
53bf4573f0 +fix 2013-11-14 23:05:44 +01:00
Evghenii
86652738c0 working on rt 2013-11-14 22:54:37 +01:00
Evghenii
294fb039fe some tuning, adding cuda kernels 2013-11-14 22:33:58 +01:00
Evghenii
f12826bac5 +added approx rcp/rsqrt/rtz with ftz=true 2013-11-14 22:17:57 +01:00
Evghenii
2c8afde6d9 chaning MF 2013-11-14 21:38:25 +01:00
Evghenii
1445202e0e identified bug due to llvm-3.4 2013-11-14 21:18:25 +01:00
Evghenii
1b940fd41e +1 2013-11-14 20:19:59 +01:00
Evghenii
7aa37b19a9 added some more macros as quick hack... 2013-11-14 20:04:05 +01:00
Evghenii
8bb8f0eda4 +1 2013-11-14 17:04:50 +01:00
Evghenii
be2cc8f946 restored foreach in sort 2013-11-14 16:51:59 +01:00
Evghenii
599ada8354 added deferred shading foreach_tile 2013-11-14 16:49:47 +01:00
Evghenii
83b9cc5c0a +1 2013-11-14 16:44:09 +01:00
Evghenii
af75afeb7a foreach[_tiled] seems to work now 2013-11-14 16:29:40 +01:00
evghenii
c81821ed28 +1 2013-11-13 21:17:21 +01:00
Evghenii
42cfe97427 using now cuda_ispc.h 2013-11-13 21:06:40 +01:00
Evghenii
09a2c12ea0 added cuda_ispc.h & cuda eror_strings 2013-11-13 21:04:59 +01:00
Evghenii
a0f6f264f6 fixed problem with new/delete and added Mel/sec counter 2013-11-13 20:34:01 +01:00
Evghenii
6f9cea5b58 removed binary 2013-11-13 19:43:45 +01:00
Evghenii
dd4ac42491 added print m 2013-11-13 19:43:32 +01:00
Evghenii
01df6ed4a9 added ispc timers w/o task 2013-11-13 19:13:04 +01:00
Evghenii
e71259006c +1 2013-11-13 19:06:02 +01:00
Evghenii
0f161b500f +1 2013-11-13 19:02:45 +01:00
Evghenii
e442139c39 runs, next check correctness 2013-11-13 18:15:52 +01:00
Evghenii
8b0f871c06 +1 2013-11-13 17:23:23 +01:00
Evghenii
61fab0340c working on sort 2013-11-13 17:07:55 +01:00
Evghenii
525eacd035 +1 2013-11-13 16:32:56 +01:00
Evghenii
780e9f31fe some tuning 2013-11-13 16:23:05 +01:00