. . . "a) means for acquiring facial images of the people from first input images captured by at least a first means for capturing images, including a visual sensing device," .