Dmitry Babokin
30270584aa
Merge pull request #749 from egaburov/nvptx_clean
...
Experimental support for PTX with examples
2014-10-16 15:56:02 +04:00
evghenii
92377426bd
changed progress bar implementation
2014-10-16 11:53:05 +02:00
Anton Mitrokhin
3d71932ca6
knc.h: added [] overloads to type defenitions
2014-10-16 12:57:55 +04:00
Dmitry Babokin
ccf4d00385
Merge pull request #874 from dbabokin/alloy-gnu-toolchain
...
alloy.py script upgrade
2014-10-15 23:09:50 +04:00
Dmitry Babokin
f7390aaec9
Adding default sysroot detection for MacOS 10.9 (and newer), which
...
allows working out of the box. Otherwise C standard headers are not
found.
2014-10-15 23:02:59 +04:00
Dmitry Babokin
d79529cada
Adding --with-gcc-toolchain alloy switch, which is passed to configure.
...
This is the only way to build functional clang on old Linux system,
which has old system gcc and new gcc (4.7+) installed in non-system
location.
Also adding switch to build only x86 target in clang.
2014-10-15 20:40:37 +04:00
Dmitry Babokin
8f9a935132
Merge pull request #873 from dbabokin/knc
...
KNC prefetch:
2014-10-14 20:59:48 +04:00
Dmitry Babokin
3f24c8dedc
KNC prefetch:
...
- make L3 prefetch to hit L2$, instead of being nop.
- fix vector prefetch to use single intrinsic, instead of two (this is
caused by bug in Composer 14.0 documentation).
2014-10-14 20:24:16 +04:00
Dmitry Babokin
741df13eb8
Merge pull request #871 from ncos/knc-backend-merge
...
Replaced vec16_i1 type definition in knc.h from typedef to struct version, adding a constructor
2014-10-14 19:24:53 +04:00
Dmitry Babokin
8e31d7e99c
Merge pull request #872 from ncos/alloy_modifications
...
Changed the 'newest_LLVM' variable to 3.5 and fixed an error in help message
2014-10-14 19:24:38 +04:00
Anton Mitrokhin
7ac8a9ea04
changed vec16_i1 type definition in knc.h from typedef to struct
2014-10-14 18:52:03 +04:00
Anton Mitrokhin
dcf70d887b
Changed the 'newest_LLVM' variable to 3.5 and fixed an error in help message
2014-10-14 18:23:47 +04:00
evghenii
558d7ee1d3
fix compilation with gcc481
2014-10-14 15:49:50 +02:00
Dmitry Babokin
2e49ecd56f
Merge pull request #870 from aguskov/master
...
Added a patch for LLVM 3.5 to enable proper debugging under Windows
2014-10-14 17:31:55 +04:00
evghenii
47b1f2182f
fix documentation typo
2014-10-14 15:24:30 +02:00
aguskov
efee86fa2e
Update 3_5_win_coff_debug_info.patch
2014-10-14 17:17:49 +04:00
evghenii
4e7ae5269b
added pseudo_prefetch definitions
2014-10-14 14:48:02 +02:00
evghenii
9238c72e08
Merge branch 'master' into nvptx_clean_master
2014-10-14 14:27:00 +02:00
evghenii
83a863ea83
fix compilation for llvm 3.5
2014-10-14 14:17:36 +02:00
Andrey Guskov
75759254c3
Added a patch for LLVM 3.5 to enable proper debugging under Windows
2014-10-14 15:58:23 +04:00
Dmitry Babokin
44ed10b6f2
Merge pull request #869 from dbabokin/typo
...
Typo fix
2014-10-14 14:54:05 +04:00
Dmitry Babokin
82ed2ce416
Typo fix
2014-10-14 14:51:12 +04:00
Dmitry Babokin
29c0f75306
Merge pull request #868 from dbabokin/win_build
...
Adding missing dependency for LLVM 3.6 (Windows specific problem) and fi...
2014-10-13 19:36:37 +04:00
Dmitry Babokin
481224bbcd
Adding missing dependency for LLVM 3.6 (Windows specific problem) and fixing end-of-lines in ispc.vcxproj
2014-10-13 19:31:11 +04:00
Dmitry Babokin
743f89f93f
Merge pull request #867 from jbrodman/master
...
Fix warnings in knc.h with appropriate casting
2014-10-09 11:52:10 +04:00
James Brodman
3aa2cce504
Fix warnings in knc.h with appropriate casting
2014-10-08 17:21:36 -07:00
Dmitry Babokin
a83fec3dd0
Merge pull request #865 from ncos/ispc_build_fails
...
Build fail fix for ISPC with LLVM 3.6 (current trunk)
2014-10-02 18:29:20 +04:00
Dmitry Babokin
b8a9139f8e
Merge pull request #864 from Vsevolod-Livinskij/opt_prefetch
...
Optimized prefetch
2014-10-02 17:16:51 +04:00
Vsevolod Livinskiy
eb61d5df72
Support for cache 2/3 and all targets
2014-10-02 16:25:23 +04:00
Anton Mitrokhin
57fb2a75ec
new interface for 'DIBuilder::insertDeclare' and 'DIBuilder::createGlobalVariable' in LLVM 3.6 (ids: 076fd5dfc1f0600183bbc7db974dc7b39086136d and bc88cfc3512326d6c8f8dbf3c15d880c0e21feb0 correspondingly
2014-10-02 15:35:19 +04:00
Vsevolod Livinskiy
0a6eb61ad0
Extend gather-scatter optimization with prefetch optimization
2014-10-02 15:21:43 +04:00
Dmitry Babokin
df38b862c2
Merge pull request #863 from ncos/knc-fails
...
Changed the default knc header file from 'knc-i1x16.h' to 'knc.h'
2014-10-02 13:57:59 +04:00
Dmitry Babokin
ddc6e33bc0
Merge pull request #862 from ncos/knc-backend-merge
...
Modification of 'knc.h'
2014-10-02 13:57:45 +04:00
Anton Mitrokhin
700fe244e7
removed debug macros
2014-10-02 13:08:47 +04:00
Anton Mitrokhin
c934a68bc4
changed 'const int' to 'const int8_t' in '__vec16_i8' constructor
2014-10-02 13:00:55 +04:00
Anton Mitrokhin
8295df5a1e
fixed header in 'knc.h'
2014-09-25 22:54:29 +04:00
Anton Mitrokhin
3b16cd8c56
fixed 'knc-i1x16.h' to compile with icc v.15 beta
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
4560df284b
changed codestyle to 2 spaces in 'knc.h'
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
832aff5d76
commented out '__vec16_i1' type as there is no '__cast_uitofp' which produces compfails
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
46bd353027
fixed 'INT_MIN' bug in '__gather64_i64'
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
1a2979aa7f
start fixing gather/scatter functions (INT_MIN fix)
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
8b8e313dc6
starded work on gather/scatter instructions
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
0881463d69
changed '__vec16_i1 __equal_i64(const __vec16_i64 &a, const __vec16_i6_i64 &b, __vec16_i1 mask)' function
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
2e92989101
changed '__vec16_i64 __load(const __vec16_i64 *p)'
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
1d69b954bd
added several memory functions for i8
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
90843b3bff
changed a constructor of '__vec16_i8'
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
78a7ef9fc5
added several math functions, new runfails: gather-int8-2/4; ldexp-double (previously compfailed)
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
efa0ea01f3
add 'void __masked_store_i64(void *p, const __vec16_i64 &v, __vec16_i1 mask)' function. now 450 compfails and 12 runfails
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
ddf5df6193
made 'int64_t __extract_element(const __vec16_i64 &v, uint32_t index)' function
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
4fff0ab571
added __vec16_i32 __cast_trunc(__vec16_i32, const __vec16_i64 i64) function. ptr-diff-3/5/6 tests are apparently runfailing because of it
2014-09-25 21:29:47 +04:00