Re: Broken output from my algorithm on nVidia OpenCL implementation
Posted by Michael Bien on Apr 21, 2011; 2:41pm
URL: https://forum.jogamp.org/Broken-output-from-my-algorithm-on-nVidia-OpenCL-implementation-tp2843828p2847257.html
hello Piotr,
works fine with AMD drivers on x86_64
CLContext [id: 139933794781904, platform: ATI Stream, profile: FULL_PROFILE, devices: 1]
CLDevice [id: 139933794718016 name: Intel(R) Core(TM) i7 CPU 940 @ 2.93GHz type: CPU profile: FULL_PROFILE]
driver: 2.0
Compiling kernel.
(...)
however the kernel does not compile cleanly for my NV GPU:
CLContext [id: 139763470826864, platform: NVIDIA CUDA, profile: FULL_PROFILE, devices: 1]
CLDevice [id: 139763471694336 name: GeForce GTX 295 type: GPU profile: FULL_PROFILE]
driver: 270.41.06
Exception in thread "main" com.jogamp.opencl.CLException$CLInvalidBinaryException:
CLDevice [id: 140152635904016 name: GeForce GTX 295 type: GPU profile: FULL_PROFILE] build log:
ptxas error : Entry function 'sort16PairsPlusLocal' uses too much shared data (0x4030 bytes + 0x10 bytes system, 0x4000 max) [error: CL_INVALID_BINARY]
at com.jogamp.opencl.CLException.newException(CLException.java:78)
at com.jogamp.opencl.CLProgram.build(CLProgram.java:381)
at com.jogamp.opencl.CLProgramBuilder.build(CLProgramBuilder.java:247)
at com.jogamp.opencl.CLProgramBuilder.build(CLProgramBuilder.java:224)
at streampacker.Main.main(Main.java:67)
why don't you start a github project? Makes sharing and collaboration easier.
regards,
michael