Login  Register

Re: Broken output from my algorithm on nVidia OpenCL implementation

Posted by Michael Bien on Apr 21, 2011; 2:41pm
URL: https://forum.jogamp.org/Broken-output-from-my-algorithm-on-nVidia-OpenCL-implementation-tp2843828p2847257.html

hello Piotr,

works fine with AMD drivers on x86_64
CLContext [id: 139933794781904, platform: ATI Stream, profile: FULL_PROFILE, devices: 1]
CLDevice [id: 139933794718016 name: Intel(R) Core(TM) i7 CPU         940  @ 2.93GHz type: CPU profile: FULL_PROFILE]
driver: 2.0
Compiling kernel.
(...)

however the kernel does not compile cleanly for my NV GPU:
CLContext [id: 139763470826864, platform: NVIDIA CUDA, profile: FULL_PROFILE, devices: 1]
CLDevice [id: 139763471694336 name: GeForce GTX 295 type: GPU profile: FULL_PROFILE]
driver: 270.41.06
Exception in thread "main" com.jogamp.opencl.CLException$CLInvalidBinaryException:
CLDevice [id: 140152635904016 name: GeForce GTX 295 type: GPU profile: FULL_PROFILE] build log:
ptxas error   : Entry function 'sort16PairsPlusLocal' uses too much shared data (0x4030 bytes + 0x10 bytes system, 0x4000 max) [error: CL_INVALID_BINARY]
        at com.jogamp.opencl.CLException.newException(CLException.java:78)
        at com.jogamp.opencl.CLProgram.build(CLProgram.java:381)
        at com.jogamp.opencl.CLProgramBuilder.build(CLProgramBuilder.java:247)
        at com.jogamp.opencl.CLProgramBuilder.build(CLProgramBuilder.java:224)
        at streampacker.Main.main(Main.java:67)

why don't you start a github project? Makes sharing and collaboration easier.

regards,
michael