> Thanks a lot for the explanation!
> I tried on snow leopard with LWS = 0 and not rounding up and it works
> but it is many times slower as expected (because it may be not
> optimal, up to 10 times slower in my case).
> Will give it a shot tomorrow on Lion.
> What's the best way to figure out the optimal LWS on a generic system
> using some reasonable assumptions (I considered WGS=256 a reasonable
> assumption ... I must assume Lion drivers somehow screwed up that
i don't know. Check the CLInfo tool to get some experience with common
values for certain hardware (max WG size and for getMaxWorkItemSizes).
Thats what i do also - CL is still new its hard to make those
assumptions right now, esp if you want to run on every type of hardware
(CPU, GPU etc).
so, don't hardcode those values and don't use anything larger than those
(device dependent) max values and hope for the best :)
On 08/31/2011 01:44 AM, Giovanni Idili [via jogamp] wrote: