Department of Computer Sciences of Computer and Information Sciences Nourah bint Abdulrahman University.O.Box 84428 Arabia;
Department of Information Systems of Computer and Information Sciences Saud University Arabia;
Text classification; language-independent tokenization; sub word tokenization;