mirror of https://github.com/stevenrobertson/cuburn.git synced 2025-07-31 20:45:51 -04:00

Go to file

Steven Robertson f58289af53 Hotspot writeback. 10x performance increase.

Create a map assigning two bits to every output bin. During the atomic
flush, compute a threshold for discarding writes altogether that would
keep us under 2% error - discard 1 of every 2 writes if we've already
accumulated 64 writes (hotspot value 1), 7 of 8 if we're above 256
(hotspot value 2), or 31 of 32 at 2048 (hotspot value 3). Pack this
value into a read-only buffer that can often be cached at L2, and for
particularly concentrated flames (which historically choke cuburn), L1.
During writeback, discard writes at the apporpriate rate. During the
flush of the integer accumulator to the float, scale the integer
accumulators by the discard rate.

This works because for most flames, there's not a lot of interesting
stuff in the middle regimes; either stuff is very well defined, in which
case we pretty much know exactly what the color is going to be
(remember, the max 2% relative error gets log-scaled as well), or it's
loosely defined so we should keep it at full accuracy.

Of course, a 10x boost is best-case-ish - a long, high-res render. I
realized though that I really didn't care about low quality stuff and
should go for broke optimizing this for my use case, which is
ridiculously high res HDR stuff. (On pathological flames, on the other
hand, 10x is conservative; this easily gives us 100x.)

2017-05-09 21:16:43 -07:00

cuburn

Hotspot writeback. 10x performance increase.

2017-05-09 21:16:43 -07:00

dist

Queue -> Channel

2014-12-20 18:27:27 -08:00

helpers

Fix xfsearch

2013-01-06 22:01:54 -08:00

scripts

Add a simple graph-walker for playback

2015-10-26 01:35:09 -07:00

.gitignore

added .gitignore for pyc and pyo files

2011-12-18 20:40:48 -07:00

COPYING

Add GPL

2012-01-15 12:24:12 -05:00

LICENSE

Add GPL

2012-01-15 12:24:12 -05:00

main.py

Always flush status lines

2017-05-09 21:09:40 -07:00

README.md

Add a link to the sample repo projet

2014-12-20 18:31:07 -08:00

README.md

Cuburn

This project is a fractal flame renderer, similar to flam3, built for CUDA GPUs. It also includes a new blending system, the ability to perform spline-based animation of individual parameters over time, and new, HDR-aware filtering mechanisms. The overall result is a distinctive, dreamlike atmosphere — rendered up to one hundred times faster than CPU.

This project is licensed under the GPL version 3.

Getting started

To use cuburn, you'll need the following dependencies:

A CUDA-compatible graphics card with SM 2.0 (GTX 400 series) or newer
A recent CUDA toolkit (at least v4.1) and drivers
pycuda
numpy
scipy
tempita
Maybe some other stuff, I'll come back to check later

Perform a git checkout of cuburn, cd to cuburn's directory, and run

python main.py -g input.flam3

... and your GPU will start rendering your fractal flame, saving the results to the current home directory. main.py is the primary interface to cuburn, and it includes built-in help; run python main.py --help to find out what it supports.

Once the program runs, check out the sample flock for more details on the design of the JSON flame format and how to use it to manage and render a collection of flames.

Differences between flam3 and cuburn

Gorgeous, fluid interpolation

Erik Reckase and Vitor Bosshard came up with a phenomenal new approach to applying the typical rotational parameter interpolation used by flam3's animation system, which served as a template for the current spline-based system used by cuburn (which Erik also worked on). The result can literally stop you in your tracks when you see it, and I'm thrilled to have been able to implement their ideas.

Everything is an animation

Cuburn is built for beautiful, fluid animations, and the interface is built with that in mind. If you want a single image — no motion blur, no velocity interpolation — you can pass the --still argument to any cuburn command that accepts profile arguments.

Temporally-aware, graphical, JSON-encoded flame representation

Cuburn can read flam3-style XML flame descriptions, but it really shines when used with its native JSON dialect. This method of representing flames has temporality in every expression, and was designed from the ground up for nondestructive composition and blending, so that everything from individual frames to composed sequences to entire flocks can be lovingly tended by a community of artists and editors.

The format is also amenable to composition with less directed input, such as that produced by a genetic algorithm, which can add variety and spark inspiration without destroying the original intent or severing the "anchor points" which tie a flock together.

It still needs a frontend to realize this humanely, so right now this might as well be vaporware.

Output profiles

Cuburn separates the description of the underlying mathematical system from the particulars of rendering. Instead of containing information such as output resolution within the flame XML, cuburn allows flames to specify the camera in terms of IFS coordinates, and then applies an output profile to convert camera coordinates to image-space coordinates for rendering.

Filtering parameters and sampling patterns are done the same way, and in almost every case, parameters are continuously-valued. This means that the same splines which allow animators complete control over the movement of the frame can be used to describe its filtering relative to the other sheep in a flock.

Totally revised filtering

I sat down with a notebook and a laptop running Maxima, and came out with some weird, inefficient shear-sampled hybrid directional-bilateral-gradient nonsense filter and a new tone-mapping algorithm that intentionally adds spatial distortion around clipped regions. Looks great, tho.