You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The pipeline has no idea as to what the original structure of the input document is, so it is creating a "virtual" exemplar where each chunk is a sentence.
d38f63d
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
amedsaid1831.dw042.perseus-eng1.conllu was ran through a processing pipeline from https://github.com/scaife-viewer/ogl-pdl-annotations.
The pipeline has no idea as to what the original structure of the input document is, so it is creating a "virtual" exemplar where each chunk is a sentence.
e.g. https://beyond-translation.perseus.org/reader/urn:cts:greekLit:tlg0012.tlg001.parrish-eng1-trees:1
This is not ideal for our purposes.
There is nearly a 1:1 relationship between sentences and lines, except for this sentence:
This is where a MISC annotation or comment in the UD file may be helpful. We'll explore this in the next commit.