Commit Graph

206 Commits

Author SHA1 Message Date
be31708c09 Fix memory corruption bug (overshoot in colorclip) 2011-10-25 15:43:05 -04:00
efc2ac23e2 Fix rendering at insane resolutions 2011-10-19 14:17:01 -04:00
3466113d64 Fix --renumber 2011-10-17 22:35:22 -04:00
68f0e0aa9e Use milliseconds in sleep timeouts, not seconds 2011-10-17 20:51:09 -04:00
20520d2f69 Open primes.bin in binary mode. 2011-10-17 19:31:09 -04:00
6144601259 Add --nopause option for batch with preview 2011-10-16 13:52:27 -04:00
6c2df777b0 Remove a TODO 2011-10-16 13:52:01 -04:00
966d794881 More correct sequence options 2011-10-16 13:51:39 -04:00
0fc80889c9 Show FPS, and don't clobber the time module 2011-10-16 13:51:04 -04:00
8ce2470dfb Relax FUSE a little (no visible impact so far) 2011-10-16 13:45:27 -04:00
c4ce3cf4c2 Don't crash on empty render(times) 2011-10-16 13:44:22 -04:00
0cc904c4f1 Do post affine transforms. How did I miss this? 2011-10-16 13:43:46 -04:00
5111a0f05c Eliminate needless pre_ var separation 2011-10-16 13:42:37 -04:00
9bafbda81a Refactor host rendering code for better load 2011-10-15 22:22:43 -04:00
8e99c9c463 Add --resume 2011-10-15 22:21:49 -04:00
6e6321bb76 Fix reading from stdin 2011-10-15 22:20:00 -04:00
702137dfda Fix inconsistent width detection. 2011-10-15 22:18:17 -04:00
9ff018de87 Actually fix dithering. (I've seen this before...) 2011-10-15 19:08:16 -04:00
92708b32ba Add raw output for direct x264 encoding 2011-10-15 14:31:56 -04:00
63483480d0 Bias the radius to avoid very large dither offsets 2011-10-15 00:50:24 -04:00
3be14547ea Use 3*256 instead of 2*512 blocks; faster on GF104 2011-10-15 00:33:37 -04:00
c7728d3507 Add faster no-L1 accum 2011-10-15 00:32:30 -04:00
dd645bcbf6 Use one dither offset per block. 2011-10-15 00:29:22 -04:00
83670df2c7 Fix random seeds. 2011-10-14 11:56:58 -04:00
7f1cd5888e More documentation updates 2011-10-13 17:32:55 -04:00
30a8d2bfa8 Put sleep with the debug opts 2011-10-13 17:23:48 -04:00
3714e2b170 Fix filename formatter to be actually helpful. 2011-10-13 17:12:28 -04:00
b168a2431e 32-bit compatibility (I think?) 2011-10-13 16:56:20 -04:00
14872ee6ed Add --sleep for slightly more usable system 2011-10-13 16:55:26 -04:00
e6e2c4a8d7 Add --sync option. 2011-10-13 07:53:55 -04:00
9a348f0517 Remove obsolete bench.py 2011-10-12 14:40:41 -04:00
53127ffe7f Move sortbench to helpers/ 2011-10-12 14:39:10 -04:00
d08b983542 Modernize the runner; add preview support; etc. 2011-10-12 14:09:13 -04:00
4834c9fdfa Change synchronization model. 2011-10-12 14:08:13 -04:00
81f61d4d5d Improve asynchrony; improve palette interp perf. 2011-10-12 14:07:28 -04:00
7b9bb165ac Change the way compile options are handled 2011-10-12 14:02:32 -04:00
f04ad7ab68 Performance improvements in Genome.__init__() 2011-10-12 13:57:43 -04:00
0f615bd98b Performance improvements in affine helpers 2011-10-12 13:56:34 -04:00
d409f02e4a Precompile accessors.
This improves packing speed by 8x, which is visible on small or
low-quality frames.
2011-10-12 11:50:07 -04:00
a12714f4c4 Fix MWC test 2011-10-12 07:36:07 -04:00
72ca02032a Fix flicker (and a few other things). 2011-10-11 20:58:09 -04:00
9b03f557c2 Fix missing control points in async version.
The allocation pool was reallocating the same frame as soon as it had
left the current scope, before it had been copied. We just reallocate
the same chunks. I don't think this has any real performance impact but
this can be verified.
2011-10-11 20:54:33 -04:00
b081bc9378 Remove a sync from iter.
A small but consistent improvement.
2011-10-11 14:56:23 -04:00
095936666e Actually asynchronous rendering.
This change didn't affect GPU performance at all, but it did improve CPU
startup time, and should also improve time for long-running animations.
2011-10-11 11:27:40 -04:00
8c7e86c7c7 Fixed fraction to not exceed range 2011-10-11 11:26:38 -04:00
618b51b1b1 Speed enhancement: alpha packing.
When the alpha channel is used in a color palette, the code now replaces
the blue channel in the accumulation buffer with a pair of two U16s,
which encode the values of the blue and alpha channels as a fraction of
the value of the density. When the alpha channel is always 1.0, the blue
channel works as normal. Density is now always the last element in the
accumulation buffer.

Eliminating the separate IO operations improved total runtime by more
than 30% on my card, while the extra calculations reduced that to 20%
when alpha was present (though that can be optimized further).
2011-10-11 09:57:37 -04:00
a052f7f4c5 Add README 2011-10-03 17:37:32 -04:00
6959729ea1 Add quick debug option 2011-10-03 17:10:38 -04:00
46c6074b92 Use C++ pass-by-reference to explicitly share. 2011-10-03 16:53:29 -04:00
72dbae1ebe Done. The Boost version is much faster, alas. 2011-08-31 13:24:44 -04:00