ctakes-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeffrey Miller <jeff...@gmail.com>
Subject DefaultJCasTermAnnotator behavior with period and semicolon in UMLS terms
Date Wed, 05 Feb 2020 17:24:55 GMT
Hi,

I've noticed that if a term contains a period or a semicolon, as an
example, from the sno_rx_16ab dictionary, "antibody ; toxoplasma", that
this will not be found if the semicolon is attached to the first word, but
will be found if it is either "antibody ; toxoplasma" or "antibody
;toxoplasma". There is similar behavior with a period in the same place. My
first instinct was that this had to do with the sentence splitter and
sentences being the default lookup window. I found an older discussion
about this in reference to periods in genes, but it was from a while back.
Just curious if anyone has dealt with this issue.

Thanks,
Jeff

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message