PyCUDA implementation of a GPU-accelerated fractal flame renderer.
Go to file
Steven Robertson 094890c324 Use shared memory for iter_count and have each CP processed by only one CTA.
Slower, but the code is a bit simpler conceptually, and the difference will be
more than accounted for by better scheduling towards the end of the process.
2010-09-07 14:54:50 -04:00
cuburnlib Use shared memory for iter_count and have each CP processed by only one CTA. 2010-09-07 14:54:50 -04:00
helpers Initial commit. 2010-08-27 12:28:02 -04:00
bench.py Add the first of many microbenchmarks 2010-09-07 12:44:12 -04:00
main.py Finally runs again 2010-09-06 11:18:20 -04:00
TODO Switch from to_inject() to object insertion. One less kludge to deal with. 2010-09-06 16:09:37 -04:00