Particular embodiments relate to electronic archive data management and more specifically to a data management system configured to classify, analyze and query data maintained in unstructured format such as file systems, web logs, wikis, email text, image, audio, video and other multimedia data