receiving a spoken query from a user of an information retrieval system, the system including a language model and an index constructed from tokenized phrases and at least one tokenized start phrase marker or end phrase marker, the phrases created by parsing a plurality of documents into a hierarchical set of phrases, each of the plurality of documents including text;