release notes, housekeeping for 1.0.6 release
This commit is contained in:
@@ -1,3 +1,34 @@
|
|||||||
|
=== v1.0.6 === (17 August 2011)
|
||||||
|
|
||||||
|
Some additional cross-program instance operations have been added to the
|
||||||
|
standard library. reduce_equal() checks to see if the given value is the
|
||||||
|
same across all running program instances, and exclusive_scan_{and,or,and}()
|
||||||
|
computes a scan over the given value in the running program instances.
|
||||||
|
See the documentation of these new routines for more information:
|
||||||
|
http://ispc.github.com/ispc.html#cross-program-instance-operations.
|
||||||
|
|
||||||
|
The simple task system implementations used in the examples have been
|
||||||
|
improved. The Windows version no nlonger has a hard limit on the number of
|
||||||
|
tasks that can be launched, and all versions have less dynamic memory
|
||||||
|
allocation and less locking. More of the examples now have paths that also
|
||||||
|
measure performance using tasks along with SPMD vectorization.
|
||||||
|
|
||||||
|
Two new examples have been added: one that shows the implementation of a
|
||||||
|
ray-marching volume rendering algorithm, and one that shows a 3D stencil
|
||||||
|
computation, as might be done for PDE solutions.
|
||||||
|
|
||||||
|
Standard library routines to issue prefetches have been added. See the
|
||||||
|
documentation for more details: http://ispc.github.com/ispc.html#prefetches.
|
||||||
|
|
||||||
|
Fast versions of the float to half-precision float conversion routines have
|
||||||
|
been added. For more details, see:
|
||||||
|
http://ispc.github.com/ispc.html#conversions-to-and-from-half-precision-floats.
|
||||||
|
|
||||||
|
There is the usual set of small bug fixes. Notably, a number of details
|
||||||
|
related to handling 32 versus 64 bit targets have been fixed, which in turn
|
||||||
|
has fixed a bug related to tasks having incorrect values for pointers
|
||||||
|
passed to them.
|
||||||
|
|
||||||
=== v1.0.5 === (1 August 2011)
|
=== v1.0.5 === (1 August 2011)
|
||||||
|
|
||||||
Multi-element vector swizzles are supported; for example, given a 3-wide
|
Multi-element vector swizzles are supported; for example, given a 3-wide
|
||||||
|
|||||||
@@ -1988,6 +1988,18 @@ function returns the 16 bits that are the closest match to the given
|
|||||||
int16 float_to_half(float f)
|
int16 float_to_half(float f)
|
||||||
uniform int16 float_to_half(uniform float f)
|
uniform int16 float_to_half(uniform float f)
|
||||||
|
|
||||||
|
There are also faster versions of these functions that don't worry about
|
||||||
|
handling floating point infinity, "not a number" and denormalized numbers
|
||||||
|
correctly. These are faster than the above functions, but are less
|
||||||
|
precise.
|
||||||
|
|
||||||
|
::
|
||||||
|
|
||||||
|
float half_to_float_fast(unsigned int16 h)
|
||||||
|
uniform float half_to_float_fast(uniform unsigned int16 h)
|
||||||
|
int16 float_to_half_fast(float f)
|
||||||
|
uniform int16 float_to_half_fast(uniform float f)
|
||||||
|
|
||||||
|
|
||||||
Atomic Operations and Memory Fences
|
Atomic Operations and Memory Fences
|
||||||
-----------------------------------
|
-----------------------------------
|
||||||
|
|||||||
@@ -31,7 +31,7 @@ PROJECT_NAME = "Intel SPMD Program Compiler"
|
|||||||
# This could be handy for archiving the generated documentation or
|
# This could be handy for archiving the generated documentation or
|
||||||
# if some version control system is used.
|
# if some version control system is used.
|
||||||
|
|
||||||
PROJECT_NUMBER = 1.0.5
|
PROJECT_NUMBER = 1.0.6
|
||||||
|
|
||||||
# The OUTPUT_DIRECTORY tag is used to specify the (relative or absolute)
|
# The OUTPUT_DIRECTORY tag is used to specify the (relative or absolute)
|
||||||
# base path where the generated documentation will be put.
|
# base path where the generated documentation will be put.
|
||||||
|
|||||||
Reference in New Issue
Block a user