This sounds like an attractive solution, but even if we overlook the untested legal situation and notable financial issues ??? a scraper would require access to a form of the work in which the text can be copied which may not be free ??? there remain some very serious and difficult problems to extract automatically meaningful data from this.