aaron/ispc - ispc - git.frat.tech

aaron/ispc

Author	SHA1	Message	Date
Dmitry Babokin	89fa6a4e75	Updating Makefile warning message to use LLVM 3.8.	2016-05-10 23:03:35 +03:00
Andrey Shishpanov	1324e6cdd5	added SKX target definition	2016-02-25 00:43:58 +03:00
Dmitry Babokin	f6dbffd58c	Updating copyright dates for recently modified files	2016-02-04 15:14:19 +03:00
Dmitry Babokin	06b9b46fb3	Adding support for using LLVM build with -DNDEBUG	2016-01-29 23:23:59 +03:00
Anton Mitrokhin	9f083f99ac	classof implementations for all classes	2015-07-10 12:27:16 +03:00
Anton Mitrokhin	8217448ee5	Id's for Stmt-inherited classes	2015-07-09 14:45:33 +03:00
Anton Mitrokhin	26a93bc733	Id's fo Expr-inherited classes	2015-07-09 12:38:58 +03:00
Anton Mitrokhin	3eccce5e4f	[AVX512]: new .ll file for knl target	2015-05-21 14:50:51 +03:00
Dmitry Babokin	82d425da04	Typo fix in Makefile	2015-04-09 14:02:49 +03:00
Dmitry Babokin	3e8f33da68	Addressing build issue coming from LLVM bug 16902	2015-04-02 19:19:57 +03:00
Dmitry Babokin	6c766be68c	Update warning	2014-10-20 21:10:40 +04:00
Dmitry Babokin	100285a325	Typo. Keep LDFLAGS commented.	2014-10-16 20:25:31 +04:00
Dmitry Babokin	8283cfc3ff	Fixing comment in Makefile and adding tips for building binaries for distribution on Linux.	2014-10-16 20:17:02 +04:00
Dmitry Babokin	17ee085396	PTX support is off by default. Fix to make it compile with PTX support off by default.	2014-10-16 17:20:26 +04:00
evghenii	4e7ae5269b	added pseudo_prefetch definitions	2014-10-14 14:48:02 +02:00
evghenii	8745888ce9	merged with master	2014-08-11 10:04:54 +02:00
Anton Mitrokhin	02e584d932	fixed broken Makefile (failed to build 3.5)	2014-08-01 17:06:34 +04:00
Anton Mitrokhin	d64e1bcd82	updated Makefile for reverce LLVM_3_6 flag	2014-08-01 16:07:52 +04:00
Anton Mitrokhin	c0fc9b7aca	removed duplicatees in Makefile	2014-07-30 15:15:15 +04:00
Anton Mitrokhin	725be222ac	added LLVM_3_6 var	2014-07-30 11:50:15 +04:00
Anton Mitrokhin	d56d690d77	Fixed Makefile target gcc to use g++ instead of clang	2014-07-11 18:25:33 +04:00
evghenii	b3c5a9c4d6	added #ifdef ISPC_NVPTX_ENALED ... #endif guards	2014-07-09 12:32:18 +02:00
evghenii	69f3898a61	Merge branch 'master' into nvptx_merge	2014-07-07 16:30:12 +02:00
Christoph Junghans	1a8002cf65	fix LLVM_VERSION for minor versions != 0 llvm version 3.4.2 got converted to 3_4.2 and not 3_4 as intended. see https://bugs.gentoo.org/show_bug.cgi?id=515114	2014-06-25 23:44:00 -06:00
Ilia Filippov	114f58bb0b	support LLVM trunk after r204934 and zlib commits	2014-04-01 18:51:05 +04:00
Ilia Filippov	cc8bae2f2c	Adding warning about LLVM_HOME in Makefile	2014-04-01 16:46:26 +04:00
Evghenii	4641a15287	Merge branch 'master' into nvptx	2014-03-19 10:53:07 +01:00
Ilia Filippov	38ce3f368c	support LLVM trunk after r202720 revision	2014-03-04 16:02:01 +04:00
Ilia Filippov	c4e35050b0	support of building with C++11	2014-03-04 16:01:18 +04:00
Evghenii	a8c5da0ae0	adjusted Makefile	2014-02-20 11:04:11 +01:00
Ilia Filippov	3fd9d5a025	support of LLVM 3.5	2013-11-21 19:09:43 +04:00
egaburov	7e9b4c0924	added avx2-i64x4 and avx1.1-i64x4 targets	2013-10-15 10:02:10 +02:00
Dmitry Babokin	17b54cb0c8	Fix problem with building ISPC by clang 3.4	2013-10-11 16:29:17 +04:00
Dmitry Babokin	8297edd251	Switching default compiler on Unix from g++ to clang++	2013-10-11 16:29:16 +04:00
Evghenii	9861375f0c	renamed avx-i64x4 -> avx1-i64x4	2013-09-13 15:07:14 +02:00
egaburov	7364e06387	added mask64	2013-09-12 12:02:42 +02:00
egaburov	320c41ffcf	added svml support. experimental. for some reason all sybmols are visible..	2013-09-11 15:16:50 +02:00
egaburov	9c79d4d182	addded avxh with vectorWidth=4 support, use --target=avxh to enable it	2013-09-11 12:58:02 +02:00
Ilia Filippov	320b1700ff	correction of adding -Werror option	2013-08-30 16:01:01 +04:00
Ilia Filippov	f620cdbaa1	Changes in perf.py functionality, unification of examples, correction build warnings	2013-08-26 14:04:59 +04:00
Matt Pharr	ea8591a85a	Fix build with LLVM top-of-tree (link libcurses)	2013-08-10 11:22:43 -07:00
Matt Pharr	cd9afe946c	Merge branch 'master' into arm Conflicts: Makefile builtins.cpp ispc.cpp ispc.h ispc.vcxproj opt.cpp	2013-08-06 17:39:21 -07:00
Dmitry Babokin	dff7735af9	Fix for Windows build and making NEON target optional	2013-08-02 19:24:34 -07:00
Matt Pharr	ab3b633733	Add 8-bit and 16-bit specialized NEON targets. Like SSE4-8 and SSE4-16, these use 8-bit and 16-bit values for mask elements, respectively, and thus should generate the best code when used for computation with datatypes of those sizes.	2013-07-30 08:44:16 -07:00
Matt Pharr	780b0dfe47	Add SSE4-16 target. Along the lines of sse4-8, this is an 8-wide target for SSE4, using 16-bit elements for the mask. It's thus (in principle) the best target for SIMD computation with 16-bit datatypes.	2013-07-25 09:46:01 -07:00
Matt Pharr	53414f12e6	Add SSE4 target optimized for computation with 8-bit datatypes. This change adds a new 'sse4-8' target, where programCount is 16 and the mask element size is 8-bits. (i.e. the most appropriate sizing of the mask for SIMD computation with 8-bit datatypes.)	2013-07-23 17:30:32 -07:00
Matt Pharr	e7abf3f2ea	Add support for mask vectors of 8 and 16-bit element types. There were a number of places throughout the system that assumed that the execution mask would only have either 32-bit or 1-bit elements. This commit makes it possible to have a target with an 8- or 16-bit mask.	2013-07-23 16:50:11 -07:00
Matt Pharr	d7b0c5794e	Add support for ARM NEON targets. Initial support for ARM NEON on Cortex-A9 and A15 CPUs. All but ~10 tests pass, and all examples compile and run correctly. Most of the examples show a ~2x speedup on a single A15 core versus scalar code. Current open issues/TODOs - Code quality looks decent, but hasn't been carefully examined. Known issues/opportunities for improvement include: - fp32 vector divide is done as a series of scalar divides rather than a vector divide (which I believe exists, but I may be mistaken.) This is particularly harmful to examples/rt, which only runs ~1.5x faster with ispc, likely due to long chains of scalar divides. - The compiler isn't generating a vmin.f32 for e.g. the final scalar min in reduce_min(); instead it's generating a compare and then a select instruction (and similarly elsewhere). - There are some additional FIXMEs in builtins/target-neon.ll that include both a few pieces of missing functionality (e.g. rounding doubles) as well as places that deserve attention for possible code quality improvements. - Currently only the "cortex-a9" and "cortex-15" CPU targets are supported; LLVM supports many other ARM CPUs and ispc should provide access to all of the ones that have NEON support (and aren't too obscure.) - ~5 of the reduce-* tests hit an assertion inside LLVM (unfortunately only when the compiler runs on an ARM host, though). - The Windows build hasn't been tested (though I've tried to update ispc.vcxproj appropriately). It may just work, but will more likely have various small issues.) - Anything related to 64-bit ARM has seen no attention.	2013-07-19 23:07:24 -07:00
Dmitry Babokin	95fcdc36ee	Tracking ToT changes, which now require to link option library. This is Unix only. Windows will be fixed separately	2013-06-18 22:12:33 +04:00
Dmitry Babokin	4b388edca9	Splitting .ll files to be compiled in two versions - 32 and 64 bit. Unix only	2013-05-24 10:29:00 +04:00

1 2 3

103 Commits