Is there any support for half-precision floating point values in CUDA?
Background: I want to manipulate an OpenGL texture using glTexSubImage3D with data from a PBO which I generate using CUDA. The texture is stored in GL_INTENSITY16 format (which is a half-precision floating type AFAIK). I don't want to use glPixelTransferf(GL_x_SCALE,...) to scale from integer values, since it seems to be much faster without the scaling.
Any advice?