
I'm very curious to see how good this will look on a more complex interior scene - if it looks good and fast, I'll start dreaming of having this in Blender's viewport someday.
Moderators: Dade, jromang, tomb, coordinators
OpenCL Device 0: Type = TYPE_GPU
OpenCL Device 0: Name = GeForce 8800 GT
OpenCL Device 0: Compute units = 14
OpenCL Device 0: Max. work group size = 512
Failed to open file 'rendering_kernel.cl'
Lord Crc wrote:Yeah Dade forgot (or didn't want?) to include the kernel file
doug@l64dev:~/dev/SmallptGPU-v1.0$ ./smallptGPU 0 1 1024 768 scenes/simple.scn
Usage: ./smallptGPU
Usage: ./smallptGPU <use CPU device (0 or 1)> <use GPU device (0 or 1)> <window width> <window height> <scene file>
Reading scene: scenes/simple.scn
Scene size: 5
OpenCL Device 0: Type = TYPE_GPU
OpenCL Device 0: Name = GeForce 8400 GS
OpenCL Device 0: Compute units = 1
OpenCL Device 0: Max. work group size = 512
Reading file 'rendering_kernel.cl' (size 2634 bytes)
Failed to build OpenCL kernel: -42
OpenCL Programm Build Log: ptxas ptx input, line 2088; fatal : Parsing error near '.': syntax error
: Retrieving binary for 'anonymous_jit_identity', for gpu='sm_11', usage mode='
__kernel void RadianceGPU(
__global Vec *colors, __global unsigned int *seedsInput,
__global Sphere *Spheres, __global Camera *camera,
const unsigned int SphereCount, const int width, const int height,
const int currentSample,
__global int *pixels) {
const int gid = get_global_id(0);
const int lid = get_local_id(0);
const int gid2 = 2 * gid;
const int x = gid % width;
const int y = gid / width;
// move seed to local store
unsigned int seeds[2];
mem_fence(CLK_GLOBAL_MEM_FENCE);
seeds[0] = seedsInput[gid2+0];
seeds[1] = seedsInput[gid2+1];
...
Radiance(Spheres, SphereCount, &ray,
&seeds[0], &r);
...
mem_fence(CLK_GLOBAL_MEM_FENCE);
seedsInput[gid2+0] = seeds[0];
seedsInput[gid2+1] = seeds[1];
}
Lord Crc wrote:Not sure if the fences are necessary but didn't affect performance, so I'd say better safe than sorry
Dade wrote:Dougal2: your error is really strange, it looks like in internal error of the compiler (bug ?). -42 stands for CL_INVALID_BINARY. It looks like the compiler is generating something wrong.
Users browsing this forum: No registered users and 2 guests