Apache UIMA Java SDK 3.3.1
Apache UIMA (Unstructured Information Management Architecture) v3.3.1 Release Notes
Contents
What is UIMA?
Major Changes in this Release
How to Get Involved
How to Report Issues
What is UIMA?
Unstructured Information Management applications are software systems that analyze large volumes of
unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a
framework and SDK for developing such applications. An example UIM application might ingest plain
text and identify entities, such as persons, places, organizations; or relations, such as works-for
or located-at. UIMA enables such an application to be decomposed into components, for example
"language identification" -> "language specific segmentation" -> "sentence boundary detection" ->
"entity detection (person/place names etc.)". Each component must implement interfaces defined by
the framework and must provide self-describing metadata via XML descriptor files. The framework
manages these components and the data flow between them. Components are written in Java or C++; the
data that flows between components is designed for efficient mapping between these languages. UIMA
additionally provides capabilities to wrap components as network services, and can scale to very
large volumes by replicating processing pipelines over a cluster of networked nodes.
Apache UIMA is an Apache-licensed open source implementation of the UIMA specification (that
specification is, in turn, being developed concurrently by a technical committee within
OASIS, a standards organization). We invite and encourage you to
participate in both the implementation and specification efforts.
UIMA is a component framework for analysing unstructured content such as text, audio and video. It
comprises an SDK and tooling for composing and running analytic components written in Java and C++,
with some support for Perl, Python and TCL.
Notable changes in this release
This is a bug fix release.
Bugs fixed
- 🦟 Issue #255: File handle leak accessing performanceTuning.properties
- 🦟 Issue #240: Helper annotation created by SelectFS should not survive
- 🦟 Issue #238: Form 6 serializes non-reachable FSes but should not
- 🦟 Issue #235: Misleading error message when JCas type is not registered
- 🦟 [UIMA-6479] PearPackagingMavenPlugin has ancient JUnit dependency
- 🦟 [UIMA-6473] CasToComparableText is broken
Improvements
- ⭐️ Issue #222: Support comparing test files irrespective of line endings
- ⭐️ [UIMA-6480] Add tests with empty arrays to CAS de/ser-suite
Refactoring
- ⚙️ [UIMA-6454] Update dependencies
- ⚙️ [UIMA-6463] Use toolchains to ensure compatibility with Java 1.8
- ⚙️ [UIMA-6469] Cleaning up file handling code
For a full list of issues affecting this release, please see:
How to Get Involved
The Apache UIMA project really needs and appreciates any contributions, including documentation
help, source code and feedback. If you are interested in contributing, please visit
http://uima.apache.org/get-involved.html.
How to Report Issues
The Apache UIMA project uses GitHub for issue tracking. Please report any issues you find at
https://github.com/apache/uima-uimaj/issues.
Download
The official release comes with verifiable signatures and hashes and can be downloaded from the Apache UIMA homepage: https://uima.apache.org/downloads.cgi
Note that in particular the Source code" links below do not provide official release artifacts. It is only as a convenience by GitHub.
Convenience binaries are provided via Maven Central: https://search.maven.org/search?q=g:org.apache.uima