mirror of
https://github.com/stevenrobertson/cuburn.git
synced 2025-02-05 11:40:04 -05:00
Add README
This commit is contained in:
parent
6959729ea1
commit
a052f7f4c5
8
README.md
Normal file
8
README.md
Normal file
@ -0,0 +1,8 @@
|
|||||||
|
# Cuburn
|
||||||
|
|
||||||
|
This project is a fractal flame renderer. It is still under development, but
|
||||||
|
already implements most of the genome parameters that [flam3](http://flam3.com)
|
||||||
|
supports, and beats CPU by a healthy margin (20-40x in most cases).
|
||||||
|
|
||||||
|
This project is licensed under the GPL version 3.
|
||||||
|
|
48
TODO
48
TODO
@ -1,48 +0,0 @@
|
|||||||
Status: passes rudimentary tests
|
|
||||||
|
|
||||||
Current goals:
|
|
||||||
|
|
||||||
- Start xforms
|
|
||||||
- xform selection, pre- and post-transform in xform
|
|
||||||
- first of the variations
|
|
||||||
|
|
||||||
Things to do (rather severely incomplete):
|
|
||||||
|
|
||||||
- LaunchContext thread distribution based on generated code register count and
|
|
||||||
shared memory size
|
|
||||||
- qlocal storage
|
|
||||||
- Performance implications of different state spaces
|
|
||||||
- Shared / cache projected usage and its effect on above
|
|
||||||
- Implement qlocal storage, and hide the complexity
|
|
||||||
- The `Feature` class
|
|
||||||
- Transform count and per-transform code layout
|
|
||||||
- Filter size, oversample, final buffer size
|
|
||||||
- Buffer allocation, clearing, reading from device
|
|
||||||
- Preview window
|
|
||||||
- When/how to sample?
|
|
||||||
- OpenGL interop worth it?
|
|
||||||
- Implement
|
|
||||||
- Implement xforms
|
|
||||||
- Shuffle
|
|
||||||
- State space implications, you know the drill
|
|
||||||
- Implement
|
|
||||||
- Test effects on quality by masking off writes on all but one lane and
|
|
||||||
boosting the sample density to compensate (muuuuuch later on)
|
|
||||||
- DE
|
|
||||||
- Clean up code (particularly DSL stuff incl. injector)
|
|
||||||
|
|
||||||
Things to test:
|
|
||||||
|
|
||||||
- Debug flag/dict/whatever for entire project in general
|
|
||||||
- Iteration counters for IterThread
|
|
||||||
|
|
||||||
Things to benchmark:
|
|
||||||
|
|
||||||
- Kernel invocation and/or interrupt times (will high load freeze X?)
|
|
||||||
- MWC float conversion
|
|
||||||
- The entire scatter process
|
|
||||||
- Radix sort of writeback coordinates
|
|
||||||
- Log-copy-histogram approach
|
|
||||||
- Direct reductions
|
|
||||||
- Surface loads, stores, reductions
|
|
||||||
|
|
Loading…
Reference in New Issue
Block a user