cuburn

mirror of https://github.com/stevenrobertson/cuburn.git synced 2025-08-01 13:05:25 -04:00

Go to file

Steven Robertson 094890c324 Use shared memory for iter_count and have each CP processed by only one CTA.

Slower, but the code is a bit simpler conceptually, and the difference will be
more than accounted for by better scheduling towards the end of the process.

2010-09-07 14:54:50 -04:00

cuburnlib

Use shared memory for iter_count and have each CP processed by only one CTA.

2010-09-07 14:54:50 -04:00

helpers

Initial commit.

2010-08-27 12:28:02 -04:00

bench.py

Add the first of many microbenchmarks

2010-09-07 12:44:12 -04:00

main.py

Finally runs again

2010-09-06 11:18:20 -04:00

TODO

Switch from to_inject() to object insertion. One less kludge to deal with.

2010-09-06 16:09:37 -04:00