http://voxelium.wordpress.com/2013/08/1 ... tructions/
Unfortunately the improvement is small compared to SSE, but at least it's consistently faster and easy to implement.
Faster Incoherent Ray Traversal Using 8-Wide AVX Instruction
Re: Faster Incoherent Ray Traversal Using 8-Wide AVX Instruc
Looks nice,
Will go deeper... but do you think that such strategy can also help on the GPU ?
Will go deeper... but do you think that such strategy can also help on the GPU ?
Re: Faster Incoherent Ray Traversal Using 8-Wide AVX Instruc
Thanks! Not sure about the GPU, I think it's worth investigating, but going beyond 8-way branching is probably not a good idea.