The data collector may have various handlers that may interact with a data source, which may be a database, web service, file system, collaboration system, or other source, and may store a identifying signature and content signature for each document or item.