As another alternative, a learning processing is effected in such a manner as to extract an object directly from an image by use of a neural network, and a candidate image of a moving object is directly extracted by use of the result of learning, so that a shade image of an input corresponding to the particular area is extracted as a template.