Vsevolod Livinskiy
28b49837fc
round2to16 was added
2015-05-21 16:53:18 +03:00
Dmitry Babokin
8e47273186
Copyright refresh
2015-04-22 16:39:11 +03:00
Vsevolod Livinskiy
f92d351cf0
Some codestyle changes
2015-03-05 18:04:39 +03:00
Vsevolod Livinskiy
a216b2bb9c
New LLVM IR load instruction
2015-03-05 16:00:30 +03:00
Dmitry Babokin
f280b32fa4
Merge pull request #736 from egaburov/native_trigonometry
...
Native trigonometry
2014-02-20 19:18:35 +03:00
Vsevolod Livinskij
cef5b2eb04
Some changes in saturation arithmetic
2014-02-10 12:40:53 +04:00
Evghenii
70a9b286e5
added support for native and double precision trigonometry/transendentals
2014-02-07 15:28:39 +01:00
evghenii
09e8381ec7
change {rsqrt,rcp}_double to {rsqrt,rcp}d_decl
2014-02-05 13:05:04 +01:00
evghenii
3a72e05c3e
+1
2014-02-02 18:16:48 +01:00
Vsevolod Livinskij
07c6f1714a
Some fixes in function names and more tests was added.
2013-12-22 19:28:26 +04:00
Vsevolod Livinskij
4faff1a63c
structural change
2013-11-30 10:48:18 +04:00
Vsevolod Livinskij
42c148bf75
Changes for sse2 and sse4 in saturation
2013-11-29 03:33:40 +04:00
Evghenii
059d80cc11
included suggested changes, ./tests/launch-*.ispc still fails. something is mask64 related, not sure what. help...
2013-09-12 17:18:12 +02:00
egaburov
19379db3b6
svml cleanup
2013-09-11 16:48:56 +02:00
egaburov
320c41ffcf
added svml support. experimental. for some reason all sybmols are visible..
2013-09-11 15:16:50 +02:00
james.brodman
8db378b265
Revert "Remove support for using SVML for math lib routines."
...
This reverts commit d9c38b5c1f .
2013-09-04 16:01:58 -04:00
Matt Pharr
5b20b06bd9
Add avg_{up,down}_int{8,16} routines to stdlib
...
These compute the average of two given values, rounding up and down,
respectively, if the result isn't exact. When possible, these are
mapped to target-specific intrinsics (PADD[BW] on IA and VH[R]ADD[US]
on NEON.)
A subsequent commit will add pattern-matching to generate calls to
these intrinsincs when the corresponding patterns are detected in the
IR.)
2013-08-06 08:41:12 -07:00
Matt Pharr
d9c38b5c1f
Remove support for using SVML for math lib routines.
...
This path was poorly maintained and wasn't actually available on most
targets.
2013-07-31 06:56:48 -07:00
Matt Pharr
b6df447b55
Add reduce_add() for int8 and int16 types.
...
This maps to specialized instructions (e.g. PSADBW) when available.
2013-07-25 09:46:01 -07:00
Matt Pharr
2d063925a1
Explicitly call the PBLENDVB intrinsic for i8 blending with sse4-8.
...
This is slightly cleaner than trunc-ing the i8 mask to i1 and using
a vector select. (And is probably more safe in terms of good code.)
2013-07-25 09:46:01 -07:00
Matt Pharr
53414f12e6
Add SSE4 target optimized for computation with 8-bit datatypes.
...
This change adds a new 'sse4-8' target, where programCount is 16 and
the mask element size is 8-bits. (i.e. the most appropriate sizing of
the mask for SIMD computation with 8-bit datatypes.)
2013-07-23 17:30:32 -07:00