I'm also reading this
2009 paper by Klein where he explain how he managed to run PTAM realtime on iPhone, despite the 15x-30x slower CPU.
Combining a few tricks both from this paper and from the ROS team, I bet you could easily reduce the PTAM computation cost from 20ms to 8-10ms, meaning a global latency of 20ms (PS3 Eye+PTAM) without using gyros
Also we can reduce the mapping thread priority without affecting the realtime tracking performance (as Klein did on the iPhone port), meaning the tracking process would only take one full core, leaving the other cores to video game computation.
edit: I estimated the performance gain from the ROS team:
ROS team benchmarked their implementation @20Hz on an ATOM 1.6Ghz (source: ROS website)
ATOM 1.6Ghz scored 303 on PassMark benchmark (source: cpubenchmark.net)
The original PTAM was benchmarked @50Hz on a Core2Duo 2.66Ghz (source: 2007 paper)
Core2 Duo 2.66Ghz scored 1675 on PassMark benchmark (source: cpubenchmark.net)
Thus, the performance gain is 20/50*1675/303=2,2.
So on a Core2 Duo 2.66Ghz their implementation should take 20ms/2.2=9ms
So just with the ROS PTAM implementation, we're already reaching the 20ms latency
Nick3DvB, you talked about a win32 implementation of ROS PTAM, have you seen it somewhere ?