mirror of
https://github.com/stevenrobertson/cuburn.git
synced 2026-05-11 18:40:20 -04:00
There are 16 bar.sync() registers available per *chip*, not per block, and I was using number 8 in the shuffle code. Evidently the driver rewrites them per SM, but does not compact their range. Good to know.