RSS Feed for This PostCurrent Article

Apache UIMA: Unstructured Information Management Architecture

UIMA is a framework and SDK for developing software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user.

As quoted from the website, an example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. UIMA enables such an application to be decomposed into components, for example “language identification” -> “language specific segmentation” -> “sentence boundary detection” -> “entity detection (person/place names etc.)”.

UIMA is a component framework for analysing unstructured content such as text, audio and video. It comprises an SDK and tooling for composing and running analytic components written in Java and C++, with some support for Perl, Python and TCL.


Trackback URL


Sorry, comments for this entry are closed at this time.