. "For each sample, the text is first pre-processed by deleting spaces and characters that are not language-specific, such as URLs or numbers." . . . . . .