. . . "a) acquiring facial images of the people from first input images captured by at least a first means for capturing images, including a visual sensing device," .