text processing
TextGuru
Posted August 8th, 2007 by thoughtworksTextGuru is a text processing library that can be used to do summarization, categorization, key phrase generation, part of speech tagging, identification of place and human names in input text, and sentence boundary detection on a variety of documents (Microsoft Word™, Microsoft PowerPoint™, PDF™, OpenOffice.org Writer, AbiWord™, HTML, and plain text.)