Train models for each major entity class #6

JohnGiorgi · 2018-08-02T19:24:31Z

Need to train models for each major entity class: PRGE, LIVB, DISO, CHED. The first three are fairly straight-forward. As for the last, there are multiple levels of granularity to the entity annotations, for now, might just cheat and collapse everything under the CHED tag.

For relations, we are at the mercy of what datasets are available. Right now, we could train a model for adverse drug events using the ADE corpus.

There should be a base and large version for each model. In the case of BERT, this would correspond to whether the BERT base or large model was used. Any model not implemented should raise a NotImplementedError (see #155).

Finally, the model names should follow a convention. Maybe [model-name]-[entity or relation]-[base or large], e.g. bert-for-ner-prge, bert-for-ner-prge-lg. See PyTorch Transformers or SpaCy for inspiration.

BERT

Entities

Relations

Train ADE

The text was updated successfully, but these errors were encountered:

nleguillarme · 2020-10-13T13:58:20Z

Hi @JohnGiorgi.

I am currently working on a review of taxon mentions recognition tools for ecological information extraction, and I have just discovered Saber which I'd like too include as an example of state-of-the-art deep learning-based approach.

Unfortunately, it seems that the LIVB pre-trained model does not exist at the moment. Any idea when it might be available? Or should I consider training my own model?

Thank you for your help.

JohnGiorgi · 2020-10-13T14:01:25Z

Hi @nleguillarme,

Thanks for your interest. Unfortunately, we are no longer maintaining the project. I would suggest checking out AllenNLP, Transformers or ScispaCy for state-of-the-art NER. ScispaCy has pretrained models that will detect organism names (see the model trained on BIONLP13CG specifically).

nleguillarme · 2020-10-13T14:06:16Z

Too bad the project is dead, it seemed like a great tool.
Thanks for the pointers.

JohnGiorgi added the enhancement New feature or request label Aug 2, 2018

JohnGiorgi self-assigned this Aug 2, 2018

JohnGiorgi mentioned this issue Aug 10, 2018

Create landing page for web-service #16

Merged

JohnGiorgi added the production label Oct 16, 2018

JohnGiorgi pinned this issue Jan 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train models for each major entity class #6

Train models for each major entity class #6

JohnGiorgi commented Aug 2, 2018 •

edited

Loading

nleguillarme commented Oct 13, 2020

JohnGiorgi commented Oct 13, 2020

nleguillarme commented Oct 13, 2020

Train models for each major entity class #6

Train models for each major entity class #6

Comments

JohnGiorgi commented Aug 2, 2018 • edited Loading

BERT

Entities

Relations

nleguillarme commented Oct 13, 2020

JohnGiorgi commented Oct 13, 2020

nleguillarme commented Oct 13, 2020

JohnGiorgi commented Aug 2, 2018 •

edited

Loading