Evghenii
|
24e1a98275
|
compiles
|
2014-02-20 11:20:13 +01:00 |
|
Evghenii
|
a8c5da0ae0
|
adjusted Makefile
|
2014-02-20 11:04:11 +01:00 |
|
Evghenii
|
4196c723eb
|
merged with nvptx
|
2014-02-20 11:01:58 +01:00 |
|
Evghenii
|
11612a24ee
|
+added addition usage info
|
2014-02-20 10:53:23 +01:00 |
|
Evghenii
|
c54a91eab3
|
added ptxgen
|
2014-02-20 10:50:17 +01:00 |
|
Evghenii
|
dea856b7e3
|
added ptxgen
|
2014-02-20 08:52:37 +01:00 |
|
Evghenii
|
01cbe3d289
|
changed to Makefile for gpu
|
2014-02-20 08:45:52 +01:00 |
|
Evghenii
|
ce6ca49d21
|
adding ptxtools
|
2014-02-20 08:18:18 +01:00 |
|
evghenii
|
c325c0e085
|
added layout
|
2014-02-19 13:40:06 +01:00 |
|
Evghenii
|
67be0a85c0
|
use rand instead of drand48
|
2014-02-19 12:52:12 +01:00 |
|
Evghenii
|
3ec910ff85
|
+1
|
2014-02-19 12:50:54 +01:00 |
|
Evghenii
|
946d2c17c8
|
use time for seed value
|
2014-02-19 11:57:21 +01:00 |
|
Evghenii
|
c513493757
|
+1
|
2014-02-19 11:51:14 +01:00 |
|
Evghenii
|
869379020d
|
some filename changes
|
2014-02-19 11:44:08 +01:00 |
|
Evghenii
|
a10be8087f
|
+1
|
2014-02-19 11:38:15 +01:00 |
|
Evghenii
|
4d61c04e5c
|
added working ptxc file
|
2014-02-19 11:26:30 +01:00 |
|
Evghenii
|
07fe1c5659
|
added anyoption
|
2014-02-19 09:28:31 +01:00 |
|
Evghenii
|
14e76108cb
|
optimization for _all
|
2014-02-06 14:24:50 +01:00 |
|
Evghenii
|
9ecb4f4ac8
|
added tunnings for aobench
|
2014-02-06 10:13:18 +01:00 |
|
Evghenii
|
9e1ab7c6b6
|
allow to add ISPC_FLAGS
|
2014-02-06 10:13:01 +01:00 |
|
Evghenii
|
8ffa84f875
|
added some #ifdef .. #endif for control flow tests
|
2014-02-06 10:12:31 +01:00 |
|
Evghenii
|
c8e92feb14
|
added additional optimizaotion passes for PTX target
|
2014-02-06 10:11:58 +01:00 |
|
Evghenii
|
c23dd8a951
|
fixed __puts_nvptx
|
2014-02-05 17:48:04 +01:00 |
|
Evghenii
|
7b2ceba128
|
added "internal" for helper functions to avoid them being exported to PTX
|
2014-02-05 17:02:05 +01:00 |
|
Dmitry Babokin
|
2570385770
|
Merge pull request #730 from egaburov/double_math
Added double precision support for reciprocals: rsqrt rcp
|
2014-02-05 17:57:39 +03:00 |
|
Evghenii
|
aeb2f01a15
|
some performance fix. it works, but have no idea why. checkpoint
|
2014-02-05 15:36:06 +01:00 |
|
evghenii
|
c59cff396d
|
added {rsqrt,rcp}d support for knc.h. test-147.ispc & test-148.ispc pass.
|
2014-02-05 13:55:38 +01:00 |
|
evghenii
|
ecc9c88ff8
|
fix packed_store_active2 for knc-i1x8.h
|
2014-02-05 13:52:24 +01:00 |
|
Evghenii
|
eb01ffd4e6
|
first commit for {rsqrt,rcp}d knc support. going to test on other node now
|
2014-02-05 13:43:07 +01:00 |
|
Evghenii
|
f225b558ec
|
added {rsqrt,rcp}d support for sse4.h
|
2014-02-05 13:42:45 +01:00 |
|
Evghenii
|
688d9c9a82
|
added support for rsqrtd/rcpd for generic-*.h
|
2014-02-05 13:20:44 +01:00 |
|
evghenii
|
09e8381ec7
|
change {rsqrt,rcp}_double to {rsqrt,rcp}d_decl
|
2014-02-05 13:05:04 +01:00 |
|
evghenii
|
732a315a4b
|
removed __declspec(safe) duplicate
|
2014-02-05 13:04:45 +01:00 |
|
Evghenii
|
686c1d676d
|
improvements
|
2014-02-05 12:04:36 +01:00 |
|
Evghenii
|
048da693c5
|
fix sqrt
|
2014-02-05 10:52:08 +01:00 |
|
Dmitry Babokin
|
9a3b949687
|
Merge pull request #734 from dbabokin/run_test_compiler
Update list of accepted system compilers in run_test.py
|
2014-02-05 12:40:07 +03:00 |
|
Dmitry Babokin
|
40186d3813
|
Update list of accepted system compilers in run_test.py
|
2014-02-05 13:39:28 +04:00 |
|
Dmitry Babokin
|
66c986ba13
|
Merge pull request #733 from jbrodman/master
Modify alloy.py to put dbg llvm builds in different folders. Disallow initializing void * with ptr to const. (#731)
|
2014-02-05 11:32:25 +03:00 |
|
jbrodman
|
98cfc17843
|
Fix bug with printing due to uneven handling of bool types
|
2014-02-04 08:12:02 -08:00 |
|
Evghenii
|
d3a6693eef
|
adding __have_native_{rsqrtd,rcpd} to select between native support for double precision reciprocals and using slower but safe version in stdlib
|
2014-02-04 16:29:23 +01:00 |
|
Evghenii
|
fe98fe8cdc
|
added fast approximate rcp(double) accurate to 15 digits
|
2014-02-04 15:23:34 +01:00 |
|
Evghenii
|
eb1a495a7a
|
added support for fast approximate rsqrt(double). Provide 16 digit accurancy but is over 3x faster than 1/sqrt(double)
|
2014-02-04 14:44:54 +01:00 |
|
jbrodman
|
720975dff4
|
Disallow initializing void * with ptr to const.
|
2014-02-04 03:36:19 -08:00 |
|
jbrodman
|
4ee0e6996a
|
Merge branch 'master' of https://github.com/ispc/ispc
|
2014-02-04 02:48:41 -08:00 |
|
jbrodman
|
47bdca1041
|
Modify alloy.py to put dbg llvm builds in different folders.
|
2014-02-04 02:46:07 -08:00 |
|
Evghenii
|
c2ed214a74
|
added declaretion for movmsk_ptx
|
2014-02-03 08:57:27 +01:00 |
|
Evghenii
|
1a56fbc101
|
+1
|
2014-02-03 08:51:55 +01:00 |
|
Evghenii
|
98c82242c5
|
allowed static and disable memcpy/memmove/memset operations
|
2014-02-03 08:02:50 +01:00 |
|
Evghenii
|
e6a6df1052
|
+1
|
2014-02-02 19:04:26 +01:00 |
|
Evghenii
|
6d034596d3
|
+1
|
2014-02-02 19:01:10 +01:00 |
|