Fixed map from {\'{E}} to É rather than Έ#12
Fixed map from {\'{E}} to É rather than Έ#12fuhrmanator wants to merge 4 commits intopcooksey:masterfrom
Conversation
pcooksey
left a comment
There was a problem hiding this comment.
If you search the string there are actually two "\'{E}" and the first one already had the value "\u00C9". So I think removing the second one rather than changing it is better. I looked online and it seems the unicode \u0388 and \u00c9 both produce the same latex (https://www.johndcook.com/unicode_latex.html).
This reverts commit 58ef266.
Based on the translation for Ί U+038A to \'{}{I}, I added a set of empty
{} after the \' for several capital greek (with tonos) characters, such
as Ό u+038C and Έ u+0388. Note that these LaTeX commands won't produce
the true unicode character, but rather an apostrophe followed by the
capital letter. Not being an expert in Greek, I'm not sure how likely
these encodings would appear in a BibTeX file.
|
I changed some things regarding the duplicates. BibTeX encoding of accents technically is really limited. The unicode-latex translation site you provided seems ambitious, and likely out of the scope of BibTeX (the original encodings were from 1988!). If the goal is to take a raw BibTeX file and convert it to a web page, it might make sense to assume very little is going on in the BibTeX. Currently, JabRef converts BibTeX files in unicode to standard LaTeX encoding, so that might be a place to start. Lots of eyes have looked at https://github.com/JabRef/jabref/blob/master/src/main/java/org/jabref/logic/util/strings/HTMLUnicodeConversionMaps.java for example. However, I see it has mappings to things like |
|
JabRef recently switched to latex2unicode which covered more cases and handled exceptions better then our own conversion algorithm. Maybe the following map is helpful: https://github.com/tomtung/latex2unicode/blob/master/src/main/scala/com/github/tomtung/latex2unicode/helper/Escape.scala |
No description provided.