"Therefore, in general, it is possible to detect the characteristic portion of this image information on the basis of the voice information that is attached to this image information. [0036] Accordingly, in the case where the image information is identified as that with a story such as a drama or the like, a producer of the image information, namely, a human being is capable of generating the summa" . . . . .