aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Ilia Filippov	ead5cc741d	support LLVM trunk after 203559 203213 and 203381 revisions	2014-03-12 12:58:50 +04:00
Ilia Filippov	6738af0a0c	changing uniform_min and uniform_max implementations for avx targets	2014-03-06 12:05:24 +04:00
Dmitry Babokin	f280b32fa4	Merge pull request #736 from egaburov/native_trigonometry Native trigonometry	2014-02-20 19:18:35 +03:00
Dmitry Babokin	ea0a514e03	Fix for generic-1	2014-02-11 15:33:23 +04:00
Vsevolod Livinskij	65d947e449	Else branch with error report was added	2014-02-10 15:18:48 +04:00
Vsevolod Livinskij	cef5b2eb04	Some changes in saturation arithmetic	2014-02-10 12:40:53 +04:00
Vsevolod Livinskij	1c1614d207	Some errors in comments and code were fixed	2014-02-09 21:39:42 +04:00
Evghenii	70a9b286e5	added support for native and double precision trigonometry/transendentals	2014-02-07 15:28:39 +01:00
evghenii	09e8381ec7	change {rsqrt,rcp}_double to {rsqrt,rcp}d_decl	2014-02-05 13:05:04 +01:00
Evghenii	d3a6693eef	adding __have_native_{rsqrtd,rcpd} to select between native support for double precision reciprocals and using slower but safe version in stdlib	2014-02-04 16:29:23 +01:00
Evghenii	fe98fe8cdc	added fast approximate rcp(double) accurate to 15 digits	2014-02-04 15:23:34 +01:00
Evghenii	eb1a495a7a	added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double)	2014-02-04 14:44:54 +01:00
evghenii	3a72e05c3e	+1	2014-02-02 18:16:48 +01:00
Vsevolod Livinskij	da02236b3a	Scalar realization of no-vec functions was replaced from builtins to stdlib.ispc.	2014-01-20 16:06:34 +04:00
Vsevolod Livinskij	323587f10f	Scalar implementation and implementation for targets which don't have h/w instructions	2014-01-02 16:48:56 +04:00
Vsevolod Livinskij	07c6f1714a	Some fixes in function names and more tests was added.	2013-12-22 19:28:26 +04:00
Ilia Filippov	15816eb07e	adding __packed_store_active2 to generic targets	2013-12-19 17:50:18 +04:00
Dmitry Babokin	d666fc3f8f	Merge pull request #686 from ifilippov/ttt packed_store_active2() - tuned version of packed_store_active()	2013-12-17 09:23:39 -08:00
Ilia Filippov	473f1cb4d2	packed_store_active2	2013-12-17 21:14:29 +04:00
Dmitry Babokin	6d51987e67	Merge pull request #642 from egaburov/launch3d concept of 3d tasking	2013-12-17 08:40:07 -08:00
evghenii	c06ec92d0d	added commas, added multi-dimensional tasking to mandelbrot_tasks & removed mandelbrot_task3d. Also adjusted documentaiton a bit	2013-12-13 11:49:11 +01:00
Vsevolod Livinskij	65768c20ae	Added tests for saturation and some fixes for generic and avx target	2013-12-05 00:34:14 +04:00
Vsevolod Livinskij	4faff1a63c	structural change	2013-11-30 10:48:18 +04:00
Vsevolod Livinskij	4c330bc38b	Add code generation of saturation	2013-11-29 18:40:04 +04:00
Vsevolod Livinskij	bec6662338	Some cganges for avx1 and avx1.1 in saturation	2013-11-29 03:45:25 +04:00
Vsevolod Livinskij	42c148bf75	Changes for sse2 and sse4 in saturation	2013-11-29 03:33:40 +04:00
Vsevolod Livinskij	35a4d1b3a2	Add some AVX2 intrinsics	2013-11-27 00:55:57 +04:00
Vsevolod Livinskij	19f73b2ede	uniform signed/unsigned int8/16	2013-11-25 19:16:02 +04:00
Dmitry Babokin	d2c7b356cc	Ordering functions in target-[avx\|sse2].ll to be in the same order. No real changes, except adding a few alwaysinline in SSE4 target	2013-11-14 15:34:30 +04:00
Dmitry Babokin	af58955140	target-[sse4\|avx]_common.ll are twin brothers, which diffes only cosmetically. This commit makes them diffable. No real changes, except adding alwaysinline to sse version iof __max_uniform_int32/__max_uniform_uint32	2013-11-14 15:34:30 +04:00
Dmitry Babokin	6585a925be	Merge pull request #641 from jbrodman/stdlibshift Add a "shift" operator to the stdlib.	2013-10-28 14:18:31 -07:00
james.brodman	4d289b16c2	Redesign after being hit with the KISS bat.	2013-10-23 14:25:43 -04:00
egaburov	f89bad1e94	launch now passes the right info into tasking	2013-10-23 12:51:06 +02:00
james.brodman	f97a2d68c8	Bugfix for non-const shift amt and unit tests.	2013-10-22 18:29:20 -04:00
james.brodman	899f85ce9c	Initial Support for new stdlib shift operator	2013-10-22 18:06:54 -04:00
egaburov	1710b9171f	removed LLVM_3_0 legacy part and changed copyright to 2013	2013-10-18 08:53:01 +02:00
egaburov	7e9b4c0924	added avx2-i64x4 and avx1.1-i64x4 targets	2013-10-15 10:02:10 +02:00
Ilia Filippov	92773ada6d	fix for ISPC for compfails at sse4-i8 and sse4-i16	2013-10-11 15:23:40 +04:00
Dmitry Babokin	43245bbc11	Adding check for OS AVX support to auto-dispatch code	2013-09-19 15:39:56 +04:00
Evghenii	9861375f0c	renamed avx-i64x4 -> avx1-i64x4	2013-09-13 15:07:14 +02:00
Evghenii	059d80cc11	included suggested changes, ./tests/launch-*.ispc still fails. something is mask64 related, not sure what. help...	2013-09-12 17:18:12 +02:00
egaburov	7364e06387	added mask64	2013-09-12 12:02:42 +02:00
egaburov	efc20c2110	added svml support to all sse/avx modes	2013-09-11 17:07:54 +02:00
egaburov	19379db3b6	svml cleanup	2013-09-11 16:48:56 +02:00
egaburov	7a32699573	added svml.m4	2013-09-11 15:18:03 +02:00
egaburov	320c41ffcf	added svml support. experimental. for some reason all sybmols are visible..	2013-09-11 15:16:50 +02:00
egaburov	9c79d4d182	addded avxh with vectorWidth=4 support, use --target=avxh to enable it	2013-09-11 12:58:02 +02:00
james.brodman	8db378b265	Revert "Remove support for using SVML for math lib routines." This reverts commit `d9c38b5c1f`.	2013-09-04 16:01:58 -04:00
Matt Pharr	1276ea9844	Revert "Remove support for building with LLVM 3.1" This reverts commit `d3c567503b`. Conflicts: opt.cpp	2013-08-06 17:00:35 -07:00
Matt Pharr	5b20b06bd9	Add avg_{up,down}_int{8,16} routines to stdlib These compute the average of two given values, rounding up and down, respectively, if the result isn't exact. When possible, these are mapped to target-specific intrinsics (PADD[BW] on IA and VH[R]ADD[US] on NEON.) A subsequent commit will add pattern-matching to generate calls to these intrinsincs when the corresponding patterns are detected in the IR.)	2013-08-06 08:41:12 -07:00

1 2 3

132 Commits