At its simplest level, this approach breaks down web pages into different blocks of meaning, based upon the way we might actually see a page, with blocks of text and pictures, and line breaks and white space, and other separators of text and images and other content.