Therefore, since a value of TFIDF is a property of being increased in case of a character string (conjunction or auxiliary word, etc.) frequently appearing and appearing in many documents or a character string appearing only in a specific document and frequently appearing in the specific document, the character strings in a document can be numerically converted by the TFIDF, and thus the document