Steven Robertson 094890c324 Use shared memory for iter_count and have each CP processed by only one CTA.
Slower, but the code is a bit simpler conceptually, and the difference will be
more than accounted for by better scheduling towards the end of the process.
2010-09-07 14:54:50 -04:00
2010-08-27 12:28:02 -04:00
2010-09-07 12:44:12 -04:00
2010-09-06 11:18:20 -04:00
Description
PyCUDA implementation of a GPU-accelerated fractal flame renderer.
2.3 MiB
Languages
Python 92.8%
Cuda 6%
Shell 0.6%
C 0.6%