Steven Robertson 185823ad55 Rearrange the main render loop... again.
Using one stream with two pagelocked host buffers allows us to keep the
GPU work queue full without pegging the CPU, and also reduces the
incidences where a host buffer will get overwritten before it can be
written. devtid() was flaky, so this patch also introduces a ringbuffer
to handle the 'slots' concept. It also introduces an adaptive number of
temporal samples, which improves efficiency but also killed the
assumption that (ntemporal_samples % 256 == 0), which required some
additional fixes.
2011-10-28 08:30:36 -04:00
2011-10-17 22:35:22 -04:00
2011-10-03 17:37:32 -04:00

Cuburn

This project is a fractal flame renderer. It is still under development, but already implements most of the genome parameters that flam3 supports, and beats CPU by a healthy margin (20-40x in most cases).

This project is licensed under the GPL version 3.

Description
PyCUDA implementation of a GPU-accelerated fractal flame renderer.
Readme 2.3 MiB
Languages
Python 92.8%
Cuda 6%
Shell 0.6%
C 0.6%