doing this now > 20 years all this ads seems to be odd (at least for me).
every time i look into the details of impl. its boring crap and never optimized for the used hw.
so the good old times are over since years...
Search found 54 matches
- Thu Mar 22, 2018 3:00 pm
- Forum: General Development
- Topic: DXR
- Replies: 6
- Views: 1067
- Wed Jan 31, 2018 12:18 pm
- Forum: Links & Papers
- Topic: Stackless MBVH Traversal for CPU, MIC and GPU Ray Tracing
- Replies: 15
- Views: 14208
Re: Stackless MBVH Traversal for CPU, MIC and GPU Ray Tracing
again: this is 10 years old crap. go with state of the art -> google is your friend !
- Wed Nov 29, 2017 9:36 am
- Forum: General Development
- Topic: Path tracing benchmark
- Replies: 3
- Views: 3259
Re: Path tracing benchmark
just a quick update: we did some tests on intels knights mill. to make it short: the machine is boring.
no perf. progress at all. all the vector ext. are simple crap. for graphics intel seems to be a dead end. gpus will dominate the next years.
so all the cpu stuff was wasted time.
mp
no perf. progress at all. all the vector ext. are simple crap. for graphics intel seems to be a dead end. gpus will dominate the next years.
so all the cpu stuff was wasted time.
mp
- Tue Nov 07, 2017 9:21 am
- Forum: Tools, Demos & Sources
- Topic: SIMDy
- Replies: 7
- Views: 2377
Re: SIMDy
sorry, but absolutely useless. the n+1 invocation of an "auto-vectorizer" ... and pathon ? what is it really good for ?
- Wed Jan 25, 2017 1:16 pm
- Forum: General Development
- Topic: Path tracing benchmark
- Replies: 3
- Views: 3259
Path tracing benchmark
It took some time after we build up our first coherent traversal kernel for avx512 http://ompf2.com/viewtopic.php?f=3&t=2103 to get a competitive incoherent kernel ready for prime time. Here it is ! For the benchmark we used a full pipeline available easily on any architecture: 1) camera ray gen...
- Tue Oct 04, 2016 10:25 am
- Forum: General Development
- Topic: AVX512 MBVH4 Traversal
- Replies: 8
- Views: 6372
Re: AVX512 MBVH4 Traversal
yes (nn sampling and bi-linear sampling). keep in mind that knl has 90gb/s on pretty large mem and extra 400gb/s on 16gb. atm we are playing around with all the diff. mem. options. beside this, we try to run the pt as a special kind of "stand-alone-app" on knl without os noise. a lot o new...
- Fri Sep 30, 2016 11:53 am
- Forum: General Development
- Topic: AVX512 MBVH4 Traversal
- Replies: 8
- Views: 6372
Re: AVX512 MBVH4 Traversal
yes, i would like to run the bench on latest gpu gen. but titan is all i have around.concerning the incoherent transport: yes it will be a different story for shure. first of all, the implementation is not straight forward on avx512 (avx512 is pretty inflexible when it comes to random access streami...
- Fri Sep 23, 2016 1:05 pm
- Forum: General Development
- Topic: AVX512 MBVH4 Traversal
- Replies: 8
- Views: 6372
AVX512 MBVH4 Traversal
with intels knl being now available to everyone, people start asking for a native avx512 port of clpt ( http://ompf2.com/viewtopic.php?f=3&t=2075 ). knl seems to be the first accelerator from intel with some kind of power under the hood (knf and knc have been simple nonstarters). so i did an opt...
- Thu Jul 14, 2016 9:28 am
- Forum: General Development
- Topic: super fast parallel sbvh construction
- Replies: 6
- Views: 9575
Re: super fast parallel sbvh construction
spectral wrote:Is there some patent on it ... or we are free to use it or even add it to embree ?
Which Embree kernel do you use for the comparison ?
no patent. you are free to use it as you like. paper describes everything in detail.
we compared against all of them and took the best performing one.
mp
- Tue Jun 07, 2016 12:03 pm
- Forum: General Development
- Topic: super fast parallel sbvh construction
- Replies: 6
- Views: 9575
super fast parallel sbvh construction
some month ago we presented early results of our mbvh4 traversal and intersection routines for manycore cpus. these implementations outperform latest embree kernels by 4x for coherent ray transport and > 2x for incoherent transport. the underlying accl. structure is a mbvh4 built with spatial splits...