Steven Robertson 618b51b1b1 Speed enhancement: alpha packing.
When the alpha channel is used in a color palette, the code now replaces
the blue channel in the accumulation buffer with a pair of two U16s,
which encode the values of the blue and alpha channels as a fraction of
the value of the density. When the alpha channel is always 1.0, the blue
channel works as normal. Density is now always the last element in the
accumulation buffer.

Eliminating the separate IO operations improved total runtime by more
than 30% on my card, while the extra calculations reduced that to 20%
when alpha was present (though that can be optimized further).
2011-10-11 09:57:37 -04:00
2011-10-11 09:57:37 -04:00
2010-09-12 16:23:24 -04:00
2011-10-03 17:10:38 -04:00
2011-10-03 17:37:32 -04:00

Cuburn

This project is a fractal flame renderer. It is still under development, but already implements most of the genome parameters that flam3 supports, and beats CPU by a healthy margin (20-40x in most cases).

This project is licensed under the GPL version 3.

Description
PyCUDA implementation of a GPU-accelerated fractal flame renderer.
Readme 2.3 MiB
Languages
Python 92.8%
Cuda 6%
Shell 0.6%
C 0.6%