Using an CPU as OpenCL device on macOS leads to a floating point exception
After !4 (merged) was merged, also CPU device are possible. On linux this is no problem, but on macOS this leads to a floating point exception, because
CL_KERNEL_WORK_GROUP_SIZE is 1 for CPU OpenCL devices, which seems to be macOS specific, according to this post: https://stackoverflow.com/a/25287001. This leads to a division by zero here in the kernel generation code, because
batchSize depends on the work group size, and seems to be expected to be at least 8.