Capturing video structure with mixture of probabilistic index maps