There appears to be some bad interaction with the append/consume counters on cayman (and compute shaders at least). I traced fglrx and it appears it directly uses GDS memory. This adds cayman specific paths to directly use GDS memory for these atomics. Dave.