Commit Graph

71 Commits

Author SHA1 Message Date
Steven Robertson
1398706886 Remove SS from DE, and improve performance. 2012-01-20 11:17:07 -05:00
Steven Robertson
8c29212821 Experimental supersampling and DE changes 2012-01-09 21:15:05 -05:00
Steven Robertson
693a7a6dc3 Go back to smaller dither radius factor 2011-12-21 16:20:41 -05:00
Steven Robertson
09725ba794 Correct dither fail. 2011-12-21 11:59:40 -05:00
Steven Robertson
4ab18c06e0 Name a few more templates 2011-12-17 20:30:21 -05:00
Steven Robertson
b43481e374 New genome format to support flockutil 2011-12-15 11:11:05 -05:00
Steven Robertson
c59829ad86 Fix buffer overflow 2011-12-10 16:23:24 -05:00
Steven Robertson
411d8b6263 Be more conservative in running overflow check 2011-12-10 12:42:40 -05:00
Steven Robertson
6c50e6dadc New atomic write mode 2011-12-10 12:18:00 -05:00
Steven Robertson
a0fd3f965f More template names 2011-12-10 12:16:42 -05:00
Steven Robertson
a509e58b43 Fix overflow-related bug (hopefully) 2011-12-09 18:46:40 -05:00
Steven Robertson
c5da1efc74 Lockless lossy shared memory writeback.
Barely tested! And yet it's going straight into master. Lucky you!
2011-12-09 16:13:23 -05:00
Steven Robertson
6bac3b3a95 Use reordered, lossy bit handling 2011-12-09 14:14:36 -05:00
Steven Robertson
d3ee6f36c2 Flat (pre-packed int) palettes in deferred mode. 2011-12-08 20:55:07 -05:00
Steven Robertson
b76208078f Deferred works again. Time to break it. 2011-12-08 15:28:10 -05:00
Steven Robertson
b73461132c Use consts for image size instead of immediates.
This saves us from having to recompile if the frame size changes.
2011-12-08 12:07:22 -05:00
erik
5ce5763da7 changed sense (and name) of values for affine xforms to match smoulder renders. 2011-11-23 07:54:48 -07:00
Steven Robertson
22fdc98128 Fix point swapping. 2011-11-20 10:08:13 -05:00
Steven Robertson
0f848b8bb8 Dither color when packing for deferred write. 2011-11-12 11:06:44 -05:00
Steven Robertson
24c0c8ee56 Fix some color foibles (more yet remain) 2011-11-12 10:42:02 -05:00
Steven Robertson
9ef5363652 Fix dumb overflow bug 2011-11-11 17:54:33 -05:00
Steven Robertson
eb43b151dc Deferred writeback. 2011-11-11 17:37:27 -05:00
Steven Robertson
7815c13ba4 Fix camera offset WRT flam3 2011-11-06 10:01:26 -05:00
Steven Robertson
0936e34b88 Fix cards stalling out on degenerate flames 2011-10-29 11:14:11 -04:00
Steven Robertson
185823ad55 Rearrange the main render loop... again.
Using one stream with two pagelocked host buffers allows us to keep the
GPU work queue full without pegging the CPU, and also reduces the
incidences where a host buffer will get overwritten before it can be
written. devtid() was flaky, so this patch also introduces a ringbuffer
to handle the 'slots' concept. It also introduces an adaptive number of
temporal samples, which improves efficiency but also killed the
assumption that (ntemporal_samples % 256 == 0), which required some
additional fixes.
2011-10-28 08:30:36 -04:00
Steven Robertson
f3a79b200c New badvals mechanism. 2011-10-27 12:59:58 -04:00
Steven Robertson
7c84c6a7a9 Final xform color *is* used after all 2011-10-27 08:46:55 -04:00
Steven Robertson
8939a6343a New genome representation, and device interp. 2011-10-25 15:44:39 -04:00
Steven Robertson
6c2df777b0 Remove a TODO 2011-10-16 13:52:01 -04:00
Steven Robertson
0cc904c4f1 Do post affine transforms. How did I miss this? 2011-10-16 13:43:46 -04:00
Steven Robertson
5111a0f05c Eliminate needless pre_ var separation 2011-10-16 13:42:37 -04:00
Steven Robertson
9ff018de87 Actually fix dithering. (I've seen this before...) 2011-10-15 19:08:16 -04:00
Steven Robertson
63483480d0 Bias the radius to avoid very large dither offsets 2011-10-15 00:50:24 -04:00
Steven Robertson
3be14547ea Use 3*256 instead of 2*512 blocks; faster on GF104 2011-10-15 00:33:37 -04:00
Steven Robertson
c7728d3507 Add faster no-L1 accum 2011-10-15 00:32:30 -04:00
Steven Robertson
dd645bcbf6 Use one dither offset per block. 2011-10-15 00:29:22 -04:00
Steven Robertson
83670df2c7 Fix random seeds. 2011-10-14 11:56:58 -04:00
Steven Robertson
b081bc9378 Remove a sync from iter.
A small but consistent improvement.
2011-10-11 14:56:23 -04:00
Steven Robertson
618b51b1b1 Speed enhancement: alpha packing.
When the alpha channel is used in a color palette, the code now replaces
the blue channel in the accumulation buffer with a pair of two U16s,
which encode the values of the blue and alpha channels as a fraction of
the value of the density. When the alpha channel is always 1.0, the blue
channel works as normal. Density is now always the last element in the
accumulation buffer.

Eliminating the separate IO operations improved total runtime by more
than 30% on my card, while the extra calculations reduced that to 20%
when alpha was present (though that can be optimized further).
2011-10-11 09:57:37 -04:00
Steven Robertson
46c6074b92 Use C++ pass-by-reference to explicitly share. 2011-10-03 16:53:29 -04:00
Steven Robertson
18a60ec066 Major bugfix. Also include thread-swapping that works. 2011-06-25 20:37:08 -04:00
Erik Reckase
44f897f28e fixed enable/disable chaos 'if chain' in iter kernel function 2011-06-24 09:59:14 -06:00
Erik Reckase
b732a3c244 now the chaos 'if chain' is only used if there are non-unity chaos entries in the genome. 2011-06-24 08:18:08 -06:00
Erik Reckase
50b664b1f9 chaos support \0/ 2011-06-24 06:09:04 -06:00
Erik Reckase
746185ce4d added support for pre_blur. all variations that start with 'pre_' will be applied to the
output of the affine transform before the other variation contributions are calculated.
2011-06-20 14:05:00 -06:00
Steven Robertson
702e303509 Gaussian dither 2011-06-13 23:20:39 -04:00
Steven Robertson
89b6732752 Skip the final xform when (re)joining the attractor 2011-06-12 19:29:10 -04:00
Steven Robertson
e79df46c66 Refactor API
--HG--
rename : cuburn/code/filter.py => cuburn/code/filtering.py
2011-06-11 15:59:10 -04:00
Steven Robertson
6f3c27007a Remove outdated MemBench stuff 2011-06-11 15:58:37 -04:00
Steven Robertson
94c453d153 Filter adjustments (density prefilter, gutter) 2011-06-11 15:58:15 -04:00