The underlying process for determining the special vocabulary used in the corpus domain model is term and collocation parsing (FIG. 6). [0064] Term parsing 110, 115 is a process of uncovering the specialized vocabulary of a particular subject domain.