. . . "Such tests and tasks should be pre-tested to see, for example, whether they produce the expected kinds of language, and whether the marking criteria are appropriate. (For more about marking criteria see Alderson, Clapham & Wall 1995: Chapter 5, and Weigle 2002.)" .