a dual G5. This was on a simple microbenchmark that made use of smp_wmb for store ordering, but it did not involve any IO access (which presumably would disadvantage eieio further). Given the G5 speedup, I'd be surprised if there is not an improvment on POWER4 and 5 as well, although no idea about POWER6 or cell...