The more latch activity, the more CPU is used and the less scalable is a system, since the concurrent processes will have to wait for each other at the serialization points.So the inefficient statement reported more than 3,000,000 latches, whereas the simple table scan required only approx. 50,000.