Skip to content

v2.4.0

Compare
Choose a tag to compare
@khituras khituras released this 29 Mar 13:09
· 697 commits to master since this release

This release sees quite a few bug fixes and streamline in various components.
Most importantly, there are a few new components:

  • jcore-annotation-adder-ae: Reads annotations from external sources and adds them to the CAS. Helpful for the integration of annotations that could not be created from within UIMA.
  • jcore-db-checkpoint-ae: This is a JeDIS component. It helps to set checkpoints within a pipeline that are stored in the database and thus provide collection processing progression information. This component can also mark the end of a pipeline, setting documents that have reached this point in a pipeline to is_processed and removing the is_in_process flag.
  • jcore-bc2gm-reader: Allows to read the original format used for the data of the BioCreative II Gene Mention challenge.
  • jcore-bc2gmformat-writer writes CAS annotation in the BC2GM format.
  • jcore-lingscope-ae wraps the Lingscope code as a UIMA AE. The component needs to be further configured regarding which models to use, possibly requiring downloads from the original project. One concrete configuration has been made available for JCoRe in the jcore-projects repository.
  • jcore-mutationfinder-ae wraps the MutationFinder algorithm as a UIMA component. Finds single point gene mutations via regular expressions in text and normalizes them.
  • jcore-ppd-writer writes CAS annotation data into the piped format (that is, tokens that are annotated with meta data, separated by the pipe character: The|DET blue|ADJ house|NN.|.). This format is used be the JNET named entity recognizer for training.