Vsevolod Livinskiy
3b1445c660
__smear_i8/16 was added
2014-11-09 00:20:58 +04:00
Vsevolod Livinskiy
39b1e4a204
masked_load/store_i16 was added
2014-11-09 00:20:24 +04:00
Dmitry Babokin
df7d65b076
Merge pull request #891 from ncos/knc-backend-merge
...
Add missing _cast_uitofp/fptoui, setzero and min_varying functions
2014-11-06 09:03:54 -08:00
Anton Mitrokhin
15e1f86711
- Removed the debug code occasionally sneaked to the repo
...
- Put 12 knc runfails to the fail_db.txt (were invisible due to debug code)
- Several minor fixes not affecting functionality
2014-11-06 16:48:51 +04:00
Vsevolod Livinskiy
9a8782d122
Fix for load/store i16
2014-10-30 18:37:38 +04:00
Anton Mitrokhin
e2249365f3
setzero functions for i8 and i16
2014-10-30 18:36:39 +04:00
Anton Mitrokhin
1f2079f2a8
fixed __cast_fpto<...> set of functions
2014-10-30 17:49:57 +04:00
Anton Mitrokhin
a4bcb1e1e2
__min_varying_<...> intrinsics added
2014-10-29 20:16:34 +04:00
Anton Mitrokhin
8b34fb853e
Commented out the faulty functions. Our code generator does not provide the necessary alligning to use them.
2014-10-21 15:42:14 +04:00
Anton Mitrokhin
09b8f65246
changed the names of 'reduce' functions (ixx -> intxx) to match the names generated by code generator
2014-10-21 15:26:07 +04:00
Anton Mitrokhin
3d71932ca6
knc.h: added [] overloads to type defenitions
2014-10-16 12:57:55 +04:00
Dmitry Babokin
3f24c8dedc
KNC prefetch:
...
- make L3 prefetch to hit L2$, instead of being nop.
- fix vector prefetch to use single intrinsic, instead of two (this is
caused by bug in Composer 14.0 documentation).
2014-10-14 20:24:16 +04:00
Anton Mitrokhin
7ac8a9ea04
changed vec16_i1 type definition in knc.h from typedef to struct
2014-10-14 18:52:03 +04:00
James Brodman
3aa2cce504
Fix warnings in knc.h with appropriate casting
2014-10-08 17:21:36 -07:00
Dmitry Babokin
b8a9139f8e
Merge pull request #864 from Vsevolod-Livinskij/opt_prefetch
...
Optimized prefetch
2014-10-02 17:16:51 +04:00
Vsevolod Livinskiy
eb61d5df72
Support for cache 2/3 and all targets
2014-10-02 16:25:23 +04:00
Vsevolod Livinskiy
0a6eb61ad0
Extend gather-scatter optimization with prefetch optimization
2014-10-02 15:21:43 +04:00
Anton Mitrokhin
700fe244e7
removed debug macros
2014-10-02 13:08:47 +04:00
Anton Mitrokhin
c934a68bc4
changed 'const int' to 'const int8_t' in '__vec16_i8' constructor
2014-10-02 13:00:55 +04:00
Anton Mitrokhin
8295df5a1e
fixed header in 'knc.h'
2014-09-25 22:54:29 +04:00
Anton Mitrokhin
4560df284b
changed codestyle to 2 spaces in 'knc.h'
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
832aff5d76
commented out '__vec16_i1' type as there is no '__cast_uitofp' which produces compfails
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
46bd353027
fixed 'INT_MIN' bug in '__gather64_i64'
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
1a2979aa7f
start fixing gather/scatter functions (INT_MIN fix)
2014-09-25 21:29:49 +04:00
Anton Mitrokhin
8b8e313dc6
starded work on gather/scatter instructions
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
0881463d69
changed '__vec16_i1 __equal_i64(const __vec16_i64 &a, const __vec16_i6_i64 &b, __vec16_i1 mask)' function
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
2e92989101
changed '__vec16_i64 __load(const __vec16_i64 *p)'
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
1d69b954bd
added several memory functions for i8
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
90843b3bff
changed a constructor of '__vec16_i8'
2014-09-25 21:29:48 +04:00
Anton Mitrokhin
78a7ef9fc5
added several math functions, new runfails: gather-int8-2/4; ldexp-double (previously compfailed)
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
efa0ea01f3
add 'void __masked_store_i64(void *p, const __vec16_i64 &v, __vec16_i1 mask)' function. now 450 compfails and 12 runfails
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
ddf5df6193
made 'int64_t __extract_element(const __vec16_i64 &v, uint32_t index)' function
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
4fff0ab571
added __vec16_i32 __cast_trunc(__vec16_i32, const __vec16_i64 i64) function. ptr-diff-3/5/6 tests are apparently runfailing because of it
2014-09-25 21:29:47 +04:00
Anton Mitrokhin
85b703981d
added several helper functions
2014-09-25 21:29:46 +04:00
Anton Mitrokhin
45114d3283
No actual code change: changed the code style for '__vec16_i8' struct in 'knc.h'
2014-09-25 21:29:46 +04:00
Anton Mitrokhin
723baca4c2
added several useful defines and a warning that '__vec16_i1' might not be working with embree
2014-09-25 21:29:46 +04:00
Vsevolod Livinskiy
0a5b16dbe6
some minor fixes for knc.h
2014-09-25 21:29:46 +04:00
Vsevolod Livinskiy
57f29e5035
Fix to support icc v.15 beta
2014-08-26 15:20:10 +04:00
Anton Mitrokhin
bd8d02527b
removed ugly INT32_MIN define (included limits.h) and updated the copyright
2014-08-15 16:18:29 +04:00
Anton Mitrokhin
7adacf5a7b
64 bit gather/scatter fix for knc.h
2014-08-15 16:04:31 +04:00
jbrodman
fbb34b3f3a
Merge pull request #747 from dbabokin/knc_extract_element
...
Knc.h fix
2014-02-27 10:51:17 -08:00
Dmitry Babokin
f0a7baf340
Remove conflicting __extract_element(__vec16_i64 ..., ...)
2014-02-22 01:10:55 +04:00
evghenii
8490efe0ad
fix for knc.h. Due to a bug in ICC (tested with 13.1.3 & 14.0.1) ,the resulting .cpp file fails to compile
2014-02-07 16:00:21 +01:00
evghenii
438cee4e21
added support for double precision/native transendentals/trigonometry
2014-02-07 15:43:42 +01:00
evghenii
c59cff396d
added {rsqrt,rcp}d support for knc.h. test-147.ispc & test-148.ispc pass.
2014-02-05 13:55:38 +01:00
Evghenii
eb01ffd4e6
first commit for {rsqrt,rcp}d knc support. going to test on other node now
2014-02-05 13:43:07 +01:00
james.brodman
9f933b500b
Add missing __cast_sext(__vec16_i32,__vec16_i1)
2013-12-20 16:45:27 -05:00
Ilia Filippov
15816eb07e
adding __packed_store_active2 to generic targets
2013-12-19 17:50:18 +04:00
Matt Pharr
e7f067d70c
Fix handling of __clock() builtin for "generic" targets.
2013-08-20 09:04:52 -07:00
Matt Pharr
b6df447b55
Add reduce_add() for int8 and int16 types.
...
This maps to specialized instructions (e.g. PSADBW) when available.
2013-07-25 09:46:01 -07:00