The quality of text mining or text analytics all rely on good quality natural language processing or linguistic tools if you want to go beyond classification to actually grasp the meaning of the words and be able to retrieve, extract and translate text. There are different methodologies and technologies to build such tools one of which is Finite State (FST). We've made available FST tools to tokenize, perform morphological analysis and do part of speech disambiguation on the Open Xerox virtual lab. Try them here