From commits-return-5127-apmail-ctakes-commits-archive=ctakes.apache.org@ctakes.apache.org Wed Oct 31 11:04:16 2018 Return-Path: X-Original-To: apmail-ctakes-commits-archive@www.apache.org Delivered-To: apmail-ctakes-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0D09A18BDD for ; Wed, 31 Oct 2018 11:04:16 +0000 (UTC) Received: (qmail 65554 invoked by uid 500); 31 Oct 2018 11:04:16 -0000 Delivered-To: apmail-ctakes-commits-archive@ctakes.apache.org Received: (qmail 65439 invoked by uid 500); 31 Oct 2018 11:04:15 -0000 Mailing-List: contact commits-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list commits@ctakes.apache.org Received: (qmail 64775 invoked by uid 99); 31 Oct 2018 11:04:15 -0000 Received: from Unknown (HELO svn01-us-west.apache.org) (209.188.14.144) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Oct 2018 11:04:15 +0000 Received: from svn01-us-west.apache.org (localhost [127.0.0.1]) by svn01-us-west.apache.org (ASF Mail Server at svn01-us-west.apache.org) with ESMTP id 09E673A26F8 for ; Wed, 31 Oct 2018 11:04:14 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Subject: svn commit: r1845321 [31/42] - in /ctakes/trunk/ctakes-web-rest: ./ src/ src/main/ src/main/java/ src/main/java/org/ src/main/java/org/apache/ src/main/java/org/apache/ctakes/ src/main/java/org/apache/ctakes/rest/ src/main/java/org/apache/ctakes/rest/f... Date: Wed, 31 Oct 2018 11:04:01 -0000 To: commits@ctakes.apache.org From: gandhirajan@apache.org X-Mailer: svnmailer-1.0.9 Message-Id: <20181031110414.09E673A26F8@svn01-us-west.apache.org> Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/ReadMe.txt URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/ReadMe.txt?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/ReadMe.txt (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/ReadMe.txt Wed Oct 31 11:03:43 2018 @@ -0,0 +1,9 @@ +# +# This directory contains all data files that Lvg use. +# +- IDB: contains IDB property file. +- ReadMe: this is it. +- config: contains Lvg configuration files. +- misc: contains files for flat file system. +- rules: contains rule file for persistent trie to use. +- tables: contains files for Lvg database. Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/diacriticMap.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/diacriticMap.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/diacriticMap.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/diacriticMap.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,84 @@ +#---------------------------------------------------------- +# Strip diacritics mappint table +# Unicode | Mapped ASCII | Char | Unicode Name +# U+XXXX | ASCII char | (notes) | (notes) +#---------------------------------------------------------- +U+00D8|O|Ø|LATIN CAPITAL LETTER O WITH STROKE +U+00F8|o|ø|LATIN SMALL LETTER O WITH STROKE +U+0110|D|Đ|LATIN CAPITAL LETTER D WITH STROKE +U+0111|d|đ|LATIN SMALL LETTER D WITH STROKE +U+0126|H|Ħ|LATIN CAPITAL LETTER H WITH STROKE +U+0127|h|ħ|LATIN SMALL LETTER H WITH STROKE +U+0131|i|ı|LATIN SMALL LETTER DOTLESS I +U+013F|L|Ŀ|LATIN CAPITAL LETTER L WITH MIDDLE DOT +U+0140|l|ŀ|LATIN SMALL LETTER L WITH MIDDLE DOT +U+0141|L|Ł|LATIN CAPITAL LETTER L WITH STROKE +U+0142|l|ł|LATIN SMALL LETTER L WITH STROKE +U+0149|n|ʼn|LATIN SMALL LETTER N PRECEDED BY APOSTROPHE +U+0166|T|Ŧ|LATIN CAPITAL LETTER T WITH STROKE +U+0167|t|ŧ|LATIN SMALL LETTER T WITH STROKE +U+0180|b|ƀ|LATIN SMALL LETTER B WITH STROKE +U+0181|B|Ɓ|LATIN CAPITAL LETTER B WITH HOOK +U+0182|B|Ƃ|LATIN CAPITAL LETTER B WITH TOPBAR +U+0183|b|ƃ|LATIN SMALL LETTER B WITH TOPBAR +U+0186|O|Ɔ|LATIN CAPITAL LETTER OPEN O +U+0187|C|Ƈ|LATIN CAPITAL LETTER C WITH HOOK +U+0188|c|ƈ|LATIN SMALL LETTER C WITH HOOK +U+0189|D|Ɖ|LATIN CAPITAL LETTER AFRICAN D +U+018A|D|Ɗ|LATIN CAPITAL LETTER D WITH HOOK +U+018B|D|Ƌ|LATIN CAPITAL LETTER D WITH TOPBAR +U+018C|d|ƌ|LATIN SMALL LETTER D WITH TOPBAR +U+0190|E|Ɛ|LATIN CAPITAL LETTER OPEN E +U+0191|F|Ƒ|LATIN CAPITAL LETTER F WITH HOOK +U+0192|f|ƒ|LATIN SMALL LETTER F WITH HOOK +U+0193|G|Ɠ|LATIN CAPITAL LETTER G WITH HOOK +U+0197|I|Ɨ|LATIN CAPITAL LETTER I WITH STROKE +U+0198|K|Ƙ|LATIN CAPITAL LETTER K WITH HOOK +U+0199|k|ƙ|LATIN SMALL LETTER K WITH HOOK +U+019A|l|ƚ|LATIN SMALL LETTER L WITH BAR +U+019C|M|Ɯ|LATIN CAPITAL LETTER TURNED M +U+019D|N|Ɲ|LATIN CAPITAL LETTER N WITH LEFT HOOK +U+019E|n|ƞ|LATIN SMALL LETTER N WITH LONG RIGHT LEG +U+019F|O|Ɵ|LATIN CAPITAL LETTER O WITH MIDDLE TILDE +U+01A4|P|Ƥ|LATIN CAPITAL LETTER P WITH HOOK +U+01A5|p|ƥ|LATIN SMALL LETTER P WITH HOOK +U+01AB|t|ƫ|LATIN SMALL LETTER T WITH PALATAL HOOK +U+01AC|T|Ƭ|LATIN CAPITAL LETTER T WITH HOOK +U+01AD|t|ƭ|LATIN SMALL LETTER T WITH HOOK +U+01AE|T|Ʈ|LATIN CAPITAL LETTER T WITH RETROFLEX HOOK +U+01B2|V|Ʋ|LATIN CAPITAL LETTER V WITH HOOK +U+01B3|Y|Ƴ|LATIN CAPITAL LETTER Y WITH HOOK +U+01B4|y|ƴ|LATIN SMALL LETTER Y WITH HOOK +U+01B5|Z|Ƶ|LATIN CAPITAL LETTER Z WITH STROKE +U+01B6|z|ƶ|LATIN SMALL LETTER Z WITH STROKE +U+01DD|e|ǝ|LATIN SMALL LETTER TURNED E +U+01E4|G|Ǥ|LATIN CAPITAL LETTER G WITH STROKE +U+01E5|g|ǥ|LATIN SMALL LETTER G WITH STROKE +U+0220|N|Ƞ|LATIN CAPITAL LETTER N WITH LONG RIGHT LEG +U+0221|d|ȡ|LATIN SMALL LETTER D WITH CURL +U+0224|Z|Ȥ|LATIN CAPITAL LETTER Z WITH HOOK +U+0225|z|ȥ|LATIN SMALL LETTER Z WITH HOOK +U+0234|l|ȴ|LATIN SMALL LETTER L WITH CURL +U+0235|n|ȵ|LATIN SMALL LETTER N WITH CURL +U+0236|t|ȶ|LATIN SMALL LETTER T WITH CURL +U+0237|j|ȷ|LATIN SMALL LETTER DOTLESS J +U+023A|A|Ⱥ|LATIN CAPITAL LETTER A WITH STROKE +U+023B|C|Ȼ|LATIN CAPITAL LETTER C WITH STROKE +U+023C|c|ȼ|LATIN SMALL LETTER C WITH STROKE +U+023D|L|Ƚ|LATIN CAPITAL LETTER L WITH BAR +U+023E|T|Ⱦ|LATIN CAPITAL LETTER T WITH DIAGONAL STROKE +U+023F|s|ȿ|LATIN SMALL LETTER S WITH SWASH TAIL +U+0240|z|ɀ|LATIN SMALL LETTER Z WITH SWASH TAIL +U+0243|B|Ƀ|LATIN CAPITAL LETTER B WITH STROKE +U+0244|U|Ʉ|LATIN CAPITAL LETTER U BAR +U+0245|V|Ʌ|LATIN CAPITAL LETTER TURNED V +U+0246|E|Ɇ|LATIN CAPITAL LETTER E WITH STROKE +U+0247|e|ɇ|LATIN SMALL LETTER E WITH STROKE +U+0248|J|Ɉ|LATIN CAPITAL LETTER J WITH STROKE +U+0249|j|ɉ|LATIN SMALL LETTER J WITH STROKE +U+024A|Q|Ɋ|LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL +U+024B|q|ɋ|LATIN SMALL LETTER Q WITH HOOK TAIL +U+024C|R|Ɍ|LATIN CAPITAL LETTER R WITH STROKE +U+024D|r|ɍ|LATIN SMALL LETTER R WITH STROKE +U+024E|Y|Ɏ|LATIN CAPITAL LETTER Y WITH STROKE +U+024F|y|ɏ|LATIN SMALL LETTER Y WITH STROKE Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/ligatureMap.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/ligatureMap.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/ligatureMap.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/ligatureMap.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,27 @@ +#---------------------------------------------------------- +# Split ligature mapping table +# Unicode | Mapped ASCII | Char | Unicode Name +# U+XXXX | ASCII string | (notes) | (notes) +# The first line is needed due to defect in Unicode Normalization Algorithm +#---------------------------------------------------------- +U+00B5|µ|µ|MICRO SIGN +U+00C6|AE|Æ|LATIN CAPITAL LETTER AE +U+00E6|ae|æ|LATIN SMALL LETTER AE +U+0152|OE|Œ|LATIN CAPITAL LIGATURE OE +U+0153|oe|œ|LATIN SMALL LIGATURE OE +U+0195|hv|ƕ|LATIN SMALL LETTER HV +U+01A2|OI|Ƣ|LATIN CAPITAL LETTER OI +U+01A3|oi|ƣ|LATIN SMALL LETTER OI +U+01A6|YR|Ʀ|LATIN LETTER YR +U+0222|OU|Ȣ|LATIN CAPITAL LETTER OU +U+0223|ou|ȣ|LATIN SMALL LETTER OU +U+0238|db|ȸ|LATIN SMALL LETTER DB DIGRAPH +U+0239|qp|ȹ|LATIN SMALL LETTER QP DIGRAPH +U+0276|OE|ɶ|LATIN LETTER SMALL CAPITAL OE +U+02A3|DZ|ʣ|LATIN SMALL LETTER DZ DIGRAPH +U+02A6|ts|ʦ|LATIN SMALL LETTER TS DIGRAPH +U+02AA|ls|ʪ|LATIN SMALL LETTER LS DIGRAPH +U+02AB|lz|ʫ|LATIN SMALL LETTER LZ DIGRAPH +U+2120|℠|℠|SERVICE MARK +U+2121|℡|℡|TELEPHONE SIGN +U+2122|™|™|TRADE MARK SIGN Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/nonStripMap.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/nonStripMap.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/nonStripMap.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/nonStripMap.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,62 @@ +#---------------------------------------------------------- +# Non-ASCII strip or mapping table +# Unicode | Mapped ASCII | Char | Unicode Name +# U+XXXX | ASCII string | (notes) | (notes) +#---------------------------------------------------------- +U+00B5|u|µ|MICRO SIGN +U+00DF|beta|ß|LATIN SMALL LETTER SHARP S +U+0251|alpha|ɑ|LATIN SMALL LETTER ALPHA +U+0263|gamma|ɣ|LATIN SMALL LETTER GAMMA +U+0278|phi|ɸ|LATIN SMALL LETTER PHI +U+0283|esh|ʃ|LATIN SMALL LETTER ESH +U+028A|ezh|ʊ|LATIN SMALL LETTER UPSILON +U+0391|ALPHA|Α|GREEK CAPITAL LETTER ALPHA +U+0392|BETA|Β|GREEK CAPITAL LETTER BETA +U+0393|GAMMA|Γ|GREEK CAPITAL LETTER GAMMA +U+0394|DELTA|Δ|GREEK CAPITAL LETTER DELTA +U+0395|EPSILON|Ε|GREEK CAPITAL LETTER EPSILON +U+0396|ZETA|Ζ|GREEK CAPITAL LETTER ZETA +U+0397|ETA|Η|GREEK CAPITAL LETTER ETA +U+0398|THETA|Θ|GREEK CAPITAL LETTER THETA +U+0399|IOTA|Ι|GREEK CAPITAL LETTER IOTA +U+039A|KAPPA|Κ|GREEK CAPITAL LETTER KAPPA +U+039B|LAMDA|Λ|GREEK CAPITAL LETTER LAMDA +U+039C|MU|Μ|GREEK CAPITAL LETTER MU +U+039D|NU|Ν|GREEK CAPITAL LETTER NU +U+039E|XI|Ξ|GREEK CAPITAL LETTER XI +U+039F|OMICRON|Ο|GREEK CAPITAL LETTER OMICRON +U+03A0|PI|Π|GREEK CAPITAL LETTER PI +U+03A1|RHO|Ρ|GREEK CAPITAL LETTER RHO +U+03A3|SIGMA|Σ|GREEK CAPITAL LETTER SIGMA +U+03A4|TAU|Τ|GREEK CAPITAL LETTER TAU +U+03A5|UPSILON|Υ|GREEK CAPITAL LETTER UPSILON +U+03A6|PHI|Φ|GREEK CAPITAL LETTER PHI +U+03A7|CHI|Χ|GREEK CAPITAL LETTER CHI +U+03A8|PSI|Ψ|GREEK CAPITAL LETTER PSI +U+03A9|OMEGA|Ω|GREEK CAPITAL LETTER OMEGA +U+03B1|alpha|α|GREEK SMALL LETTER ALPHA +U+03B2|beta|β|GREEK SMALL LETTER BETA +U+03B3|gamma|γ|GREEK SMALL LETTER GAMMA +U+03B4|delta|δ|GREEK SMALL LETTER DELTA +U+03B5|epsilon|ε|GREEK SMALL LETTER EPSILON +U+03B6|zeta|ζ|GREEK SMALL LETTER ZETA +U+03B7|eta|η|GREEK SMALL LETTER ETA +U+03B8|theta|θ|GREEK SMALL LETTER THETA +U+03B9|iota|ι|GREEK SMALL LETTER IOTA +U+03BA|kappa|κ|GREEK SMALL LETTER KAPPA +U+03BB|lamda|λ|GREEK SMALL LETTER LAMDA +U+03BC|mu|μ|GREEK SMALL LETTER MU +U+03BD|nu|ν|GREEK SMALL LETTER NU +U+03BE|xi|ξ|GREEK SMALL LETTER XI +U+03BF|omicron|ο|GREEK SMALL LETTER OMICRON +U+03C0|pi|π|GREEK SMALL LETTER PI +U+03C1|rho|ρ|GREEK SMALL LETTER RHO +U+03C2|sigma|ς|GREEK SMALL LETTER FINAL SIGMA +U+03C3|sigma|σ|GREEK SMALL LETTER SIGMA +U+03C4|tau|τ|GREEK SMALL LETTER TAU +U+03C5|upsilon|υ|GREEK SMALL LETTER UPSILON +U+03C6|phi|φ|GREEK SMALL LETTER PHI +U+03C7|chi|χ|GREEK SMALL LETTER CHI +U+03C8|psi|ψ|GREEK SMALL LETTER PSI +U+03C9|omega|ω|GREEK SMALL LETTER OMEGA +U+237A|alpha|⍺|APL FUNCTIONAL SYMBOL ALPHA Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/symbolMap.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/symbolMap.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/symbolMap.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/symbolMap.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,99 @@ +#---------------------------------------------------------- +# Symbols and puntuaction mapping table +# Unicode | Mapped ASCII | Char | Unicode Name +# U+XXXX | ASCII string | (notes) | (notes) +#---------------------------------------------------------- +U+00AB|"|«|LEFT-POINTING DOUBLE ANGLE QUOTATION MARK +U+00AD|-|­|SOFT HYPHEN +U+00B4|'|´|ACUTE ACCENT +U+00BB|"|»|RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK +U+00F7|/|÷|DIVISION SIGN +U+01C0|||ǀ|LATIN LETTER DENTAL CLICK +U+01C3|!|ǃ|LATIN LETTER RETROFLEX CLICK +U+02B9|'|ʹ|MODIFIER LETTER PRIME +U+02BA|"|ʺ|MODIFIER LETTER DOUBLE PRIME +U+02BC|'|ʼ|MODIFIER LETTER APOSTROPHE +U+02C4|^|˄|MODIFIER LETTER UP ARROWHEAD +U+02C6|^|ˆ|MODIFIER LETTER CIRCUMFLEX ACCENT +U+02C8|'|ˈ|MODIFIER LETTER VERTICAL LINE +U+02CB|`|ˋ|MODIFIER LETTER GRAVE ACCENT +U+02CD|_|ˍ|MODIFIER LETTER LOW MACRON +U+02DC|~|˜|SMALL TILDE +U+0300|`|̀|COMBINING GRAVE ACCENT +U+0301|'|́|COMBINING ACUTE ACCENT +U+0302|^|̂|COMBINING CIRCUMFLEX ACCENT +U+0303|~|̃|COMBINING TILDE +U+030B|"|̋|COMBINING DOUBLE ACUTE ACCENT +U+030E|"|̎|COMBINING DOUBLE VERTICAL LINE ABOVE +U+0331|_|̱|COMBINING MACRON BELOW +U+0332|_|̲|COMBINING LOW LINE +U+0338|/|̸|COMBINING LONG SOLIDUS OVERLAY +U+0589|:|։|ARMENIAN FULL STOP +U+05C0|||׀|HEBREW PUNCTUATION PASEQ +U+05C3|:|׃|HEBREW PUNCTUATION SOF PASUQ +U+066A|%|٪|ARABIC PERCENT SIGN +U+066D|*|٭|ARABIC FIVE POINTED STAR +U+200B| |​|ZERO WIDTH SPACE +U+2010|-|‐|HYPHEN +U+2011|-|‑|NON-BREAKING HYPHEN +U+2012|-|‒|FIGURE DASH +U+2013|-|–|EN DASH +U+2014|-|—|EM DASH +U+2015|--|―|HORIZONTAL BAR +U+2016||||‖|DOUBLE VERTICAL LINE +U+2017|_|‗|DOUBLE LOW LINE +U+2018|'|‘|LEFT SINGLE QUOTATION MARK +U+2019|'|’|RIGHT SINGLE QUOTATION MARK +U+201A|,|‚|SINGLE LOW-9 QUOTATION MARK +U+201B|'|‛|SINGLE HIGH-REVERSED-9 QUOTATION MARK +U+201C|"|“|LEFT DOUBLE QUOTATION MARK +U+201D|"|”|RIGHT DOUBLE QUOTATION MARK +U+201E|"|„|DOUBLE LOW-9 QUOTATION MARK +U+201F|"|‟|DOUBLE HIGH-REVERSED-9 QUOTATION MARK +U+2032|'|′|PRIME +U+2033|"|″|DOUBLE PRIME +U+2034|'''|‴|TRIPLE PRIME +U+2035|`|‵|REVERSED PRIME +U+2036|"|‶|REVERSED DOUBLE PRIME +U+2037|'''|‷|REVERSED TRIPLE PRIME +U+2038|^|‸|CARET +U+2039|<|‹|SINGLE LEFT-POINTING ANGLE QUOTATION MARK +U+203A|>|›|SINGLE RIGHT-POINTING ANGLE QUOTATION MARK +U+203D|?|‽|INTERROBANG +U+2044|/|⁄|FRACTION SLASH +U+204E|*|⁎|LOW ASTERISK +U+2052|%|⁒|COMMERCIAL MINUS SIGN +U+2053|~|⁓|SWUNG DASH +U+2060| |⁠|WORD JOINER +U+20E5|\|⃥|COMBINING REVERSE SOLIDUS OVERLAY +U+2212|-|−|MINUS SIGN +U+2215|/|∕|DIVISION SLASH +U+2216|\|∖|SET MINUS +U+2217|*|∗|ASTERISK OPERATOR +U+2223|||∣|DIVIDES +U+2236|:|∶|RATIO +U+223C|~|∼|TILDE OPERATOR +U+2264|<=|≤|LESS-THAN OR EQUAL TO +U+2265|>=|≥|GREATER-THAN OR EQUAL TO +U+2266|<=|≦|LESS-THAN OVER EQUAL TO +U+2267|>=|≧|GREATER-THAN OVER EQUAL TO +U+2303|^|⌃|UP ARROWHEAD +U+2329|<|〈|LEFT-POINTING ANGLE BRACKET +U+232A|>|〉|RIGHT-POINTING ANGLE BRACKET +U+266F|#|♯|MUSIC SHARP SIGN +U+2731|*|✱|HEAVY ASTERISK +U+2758|||❘|LIGHT VERTICAL BAR +U+2762|!|❢|HEAVY EXCLAMATION MARK ORNAMENT +U+27E6|[|⟦|MATHEMATICAL LEFT WHITE SQUARE BRACKET +U+27E8|<|⟨|MATHEMATICAL LEFT ANGLE BRACKET +U+27E9|>|⟩|MATHEMATICAL RIGHT ANGLE BRACKET +U+2983|{|⦃|LEFT WHITE CURLY BRACKET +U+2984|}|⦄|RIGHT WHITE CURLY BRACKET +U+3003|"|〃|DITTO MARK +U+3008|<|〈|LEFT ANGLE BRACKET +U+3009|>|〉|RIGHT ANGLE BRACKET +U+301B|]|〛|RIGHT WHITE SQUARE BRACKET +U+301C|~|〜|WAVE DASH +U+301D|"|〝|REVERSED DOUBLE PRIME QUOTATION MARK +U+301E|"|〞|DOUBLE PRIME QUOTATION MARK +U+FEFF| ||ZERO WIDTH NO-BREAK SPACE Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/synonymMap.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/synonymMap.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/synonymMap.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/synonymMap.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,6 @@ +#---------------------------------------------------------- +# Unicode synonym mapping table +# Unicode | Synonym Base | Char | Synonym Char | Unicode Name | Synonym Name +# U+XXXX | U+XXXX | (Notes) | (Notes) | (Notes) | (Notes) +#---------------------------------------------------------- +U+03BC|U+00B5|μ|µ|GREEK SMALL LETTER MU|MICRO SIGN Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/unicodeMap.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/unicodeMap.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/unicodeMap.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/Unicode/unicodeMap.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,39 @@ +#---------------------------------------------------------- +# Unicode mapping table +# Unicode | Mapped ASCII | Char | Unicode Name +# U+XXXX | ASCII string | (notes) | (notes) +#---------------------------------------------------------- +U+00D7|*|×|MULTIPLICATION SIGN +U+0149|n|ʼn|LATIN SMALL LETTER N PRECEDED BY APOSTROPHE +U+0184|6|Ƅ|LATIN CAPITAL LETTER TONE SIX +U+0185|6|ƅ|LATIN SMALL LETTER TONE SIX +U+01A7|2|Ƨ|LATIN CAPITAL LETTER TONE TWO +U+01A8|2|ƨ|LATIN SMALL LETTER TONE TWO +U+01BB|2|ƻ|LATIN LETTER TWO WITH STROKE +U+01BC|5|Ƽ|LATIN CAPITAL LETTER TONE FIVE +U+01BD|5|ƽ|LATIN SMALL LETTER TONE FIVE +U+0261|g|ɡ|LATIN SMALL LETTER SCRIPT G +U+0406|I|І|CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I +U+04BB|h|һ|CYRILLIC SMALL LETTER SHHA +U+04C0|I|Ӏ|CYRILLIC LETTER PALOCHKA +U+1E9A|a|ẚ|LATIN SMALL LETTER A WITH RIGHT HALF RING +U+212E|e|℮|ESTIMATED SYMBOL +U+2153|1/3|⅓|VULGAR FRACTION ONE THIRD +U+2154|2/3|⅔|VULGAR FRACTION TWO THIRDS +U+2155|1/5|⅕|VULGAR FRACTION ONE FIFTH +U+2156|2/5|⅖|VULGAR FRACTION TWO FIFTHS +U+2157|3/5|⅗|VULGAR FRACTION THREE FIFTHS +U+2158|4/5|⅘|VULGAR FRACTION FOUR FIFTHS +U+2159|1/6|⅙|VULGAR FRACTION ONE SIXTH +U+215A|5/6|⅚|VULGAR FRACTION FIVE SIXTHS +U+215B|1/8|⅛|VULGAR FRACTION ONE EIGHTH +U+215C|3/8|⅜|VULGAR FRACTION THREE EIGHTHS +U+215D|5/8|⅝|VULGAR FRACTION FIVE EIGHTHS +U+215E|7/8|⅞|VULGAR FRACTION SEVEN EIGHTHS +U+215F|1/|⅟|FRACTION NUMERATOR ONE +U+2190|<-|←|LEFTWARDS ARROW +U+2192|->|→|RIGHTWARDS ARROW +U+2194|<->|↔|LEFT RIGHT ARROW +U+21D0|<=|⇐|LEFTWARDS DOUBLE ARROW +U+21D2|=>|⇒|RIGHTWARDS DOUBLE ARROW +U+21D4|<=>|⇔|LEFT RIGHT DOUBLE ARROW Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties Wed Oct 31 11:03:43 2018 @@ -0,0 +1,98 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one +# or more contributor license agreements. See the NOTICE file +# distributed with this work for additional information +# regarding copyright ownership. The ASF licenses this file +# to you under the Apache License, Version 2.0 (the +# "License"); you may not use this file except in compliance +# with the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, +# software distributed under the License is distributed on an +# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY +# KIND, either express or implied. See the License for the +# specific language governing permissions and limitations +# under the License. +# + +#------------------------------------------------------------------- +# Directory and files +# +# LVG_DIR: the absolute path of the lexical tool directory +# LVG_STOP_WORD_FILE: the relative path (to LVG_DIR) of stop word file. +# LVG_NONINFO_WORD_FILE: the relative path (to LVG_DIR) of non-info word file. +# LVG_CONJ_WORD_FILE: the relative path (to LVG_DIR) of conjuction file. +# LVG_REMOVE_S_FILE: the relative path (to LVG_DIR) of removeS file. +# +# LVG_DIACRITICS_FILE: the relative path (to LVG_DIR) of diacritics file. +# LVG_LIGATURES_FILE: the relative path (to LVG_DIR) of ligature file. +# LVG_UNICODE_SYNONYM_FILE: the relative path (to LVG_DIR) of Unicode synonym file +# LVG_UNICODE_SYMBOLS_FILE: the relative path (to LVG_DIR) of symbols map file +# LVG_UNICODE_FILE: the relative path (to LVG_DIR) of Unicode map file +# LVG_NON_STRIP_MAP_UNICODE_FILE: the relative path (to LVG_DIR) of non-strip Unicode map file +#------------------------------------------------------------------- + +# LVG_DIR=/some_path/ctakes-lvg/resources/ctakes-lvg/ +# LVG_DIR=AUTO_MODE results in using user's current working directory. +LVG_DIR=AUTO_MODE + +LVG_STOP_WORD_FILE=data/misc/stopWords.data +LVG_NONINFO_WORD_FILE=data/misc/nonInfoWords.data +LVG_CONJ_WORD_FILE=data/misc/conjunctionWord.data +LVG_REMOVE_S_FILE=data/misc/removeS.data +# +LVG_DIACRITICS_FILE=data/Unicode/diacriticMap.data +LVG_LIGATURES_FILE=data/Unicode/ligatureMap.data +LVG_UNICODE_SYNONYM_FILE=data/Unicode/synonymMap.data +LVG_UNICODE_SYMBOL_FILE=data/Unicode/symbolMap.data +LVG_UNICODE_FILE=data/Unicode/unicodeMap.data +LVG_NON_STRIP_MAP_UNICODE_FILE=data/Unicode/nonStripMap.data +#------------------------------------------------------------------- +# Database and JDBC driver +# +# DB_TYPE: HSQLDB, MYSQL, or OTHER +# DB_DRIVER: the JDBC driver +#------------------------------------------------------------------- +# DB_HOST: Hostname of database +# DB_NAME: database name for Lvg (default is lvg2008) +# DB_USERNAME: user name for using Lvg database (default is lvg) +# DB_PASSWORD: password for lvg user (default is lvg) +#------------------------------------------------------------------- +#DB_TYPE=HSQLDB +#DB_DRIVER=org.hsqldb.jdbcDriver +#DB_NAME=lvg2008 +#DB_HOST=null +#DB_USERNAME=sa +#DB_PASSWORD= +DB_TYPE=MYSQL +DB_DRIVER=com.mysql.jdbc.Driver +DB_NAME= +DB_HOST= +DB_USERNAME= +DB_PASSWORD= + +#------------------------------------------------------------------- +# Variables: +# +# MIN_TERM_LENGTH: min term length in trie. +# MAX_RULE_UNINFLECTED_TERMS: The max. number of rules based uninflections. +# MAX_METAPHONE: The max. number of characters in metaphone format. +# CGI_EOP: the string of the mark of the end of the set. +# NO_OUTPUT: the string for no output +# TRUNCATED_RESULTS: the maximum number of output LexiItems. +# DER_TRIE_STEM_LENGTH: minmum number of legal stem length in derivation trie +# START_TAG: start tag for the unicode symbol name. +# END_TAG: end tag for the unicode symbol name. +#------------------------------------------------------------------- +MIN_TERM_LENGTH=3 +MAX_RULE_UNINFLECTED_TERMS=10 +MAX_METAPHONE=6 +CGI_EOP=__THE_END__ +NO_OUTPUT=-No Output- +TRUNCATED_RESULTS=80 +LVG_PROMPT=DEFAULT +DIR_TRIE_STEM_LENGTH=0 +START_TAG=![ +END_TAG=]! Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.TEMPLATE URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.TEMPLATE?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.TEMPLATE (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.TEMPLATE Wed Oct 31 11:03:43 2018 @@ -0,0 +1,67 @@ +#------------------------------------------------------------------- +# Directory and files +# +# LVG_DIR: the absolute path of the lexical tool directory +# LVG_STOP_WORD_FILE: the relative path (to LVG_DIR) of stop word file. +# LVG_NONINFO_WORD_FILE: the relative path (to LVG_DIR) of non-info word file. +# LVG_CONJ_WORD_FILE: the relative path (to LVG_DIR) of conjuction file. +# LVG_REMOVE_S_FILE: the relative path (to LVG_DIR) of removeS file. +# +# LVG_DIACRITICS_FILE: the relative path (to LVG_DIR) of diacritics file. +# LVG_LIGATURES_FILE: the relative path (to LVG_DIR) of ligature file. +# LVG_UNICODE_SYNONYM_FILE: the relative path (to LVG_DIR) of Unicode synonym file +# LVG_UNICODE_SYMBOLS_FILE: the relative path (to LVG_DIR) of symbols map file +# LVG_UNICODE_FILE: the relative path (to LVG_DIR) of Unicode map file +# LVG_NON_STRIP_MAP_UNICODE_FILE: the relative path (to LVG_DIR) of non-strip Unicode map file +#------------------------------------------------------------------- +LVG_DIR=___LVG_DIR___ +LVG_STOP_WORD_FILE=data/misc/stopWords.data +LVG_NONINFO_WORD_FILE=data/misc/nonInfoWords.data +LVG_CONJ_WORD_FILE=data/misc/conjunctionWord.data +LVG_REMOVE_S_FILE=data/misc/removeS.data +# +LVG_DIACRITICS_FILE=data/Unicode/diacriticMap.data +LVG_LIGATURES_FILE=data/Unicode/ligatureMap.data +LVG_UNICODE_SYNONYM_FILE=data/Unicode/synonymMap.data +LVG_UNICODE_SYMBOL_FILE=data/Unicode/symbolMap.data +LVG_UNICODE_FILE=data/Unicode/unicodeMap.data +LVG_NON_STRIP_MAP_UNICODE_FILE=data/Unicode/nonStripMap.data +#------------------------------------------------------------------- +# Database and JDBC driver +# +# DB_TYPE: HSQLDB, MYSQL, or OTHER +# DB_DRIVER: the JDBC driver +# DB_HOST: Hostname of MySql database +# DB_NAME: database name for Lvg (default is lvg2008) +# DB_USERNAME: user name for using Lvg database (default is lvg) +# DB_PASSWORD: password for lvg user (default is lvg) +#------------------------------------------------------------------- +DB_TYPE=___DB_TYPE___ +DB_DRIVER=___DB_DRIVER___ +DB_NAME=___DB_NAME___ +DB_HOST=___DB_HOST___ +DB_USERNAME=___DB_USERNAME___ +DB_PASSWORD=___DB_PASSWORD___ +#------------------------------------------------------------------- +# Variables: +# +# MIN_TERM_LENGTH: min term length in trie. +# MAX_RULE_UNINFLECTED_TERMS: The max. number of rules based uninflections. +# MAX_METAPHONE: The max. number of characters in metaphone format. +# CGI_EOP: the string of the mark of the end of the set. +# NO_OUTPUT: the string for no output +# TRUNCATED_RESULTS: the maximum number of output LexiItems. +# DER_TRIE_STEM_LENGTH: minmum number of legal stem length in derivation trie +# START_TAG: start tag for the unicode symbol name. +# END_TAG: end tag for the unicode symbol name. +#------------------------------------------------------------------- +MIN_TERM_LENGTH=3 +MAX_RULE_UNINFLECTED_TERMS=10 +MAX_METAPHONE=6 +CGI_EOP=__THE_END__ +NO_OUTPUT=-No Output- +TRUNCATED_RESULTS=80 +LVG_PROMPT=DEFAULT +DIR_TRIE_STEM_LENGTH=3 +START_TAG=![ +END_TAG=]! Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.hsql URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.hsql?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.hsql (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/config/lvg.properties.hsql Wed Oct 31 11:03:43 2018 @@ -0,0 +1,74 @@ +#------------------------------------------------------------------- +# Directory and files +# +# LVG_DIR: the absolute path of the lexical tool directory +# LVG_STOP_WORD_FILE: the relative path (to LVG_DIR) of stop word file. +# LVG_NONINFO_WORD_FILE: the relative path (to LVG_DIR) of non-info word file. +# LVG_CONJ_WORD_FILE: the relative path (to LVG_DIR) of conjuction file. +# LVG_REMOVE_S_FILE: the relative path (to LVG_DIR) of removeS file. +# +# LVG_DIACRITICS_FILE: the relative path (to LVG_DIR) of diacritics file. +# LVG_LIGATURES_FILE: the relative path (to LVG_DIR) of ligature file. +# LVG_UNICODE_SYNONYM_FILE: the relative path (to LVG_DIR) of Unicode synonym file +# LVG_UNICODE_SYMBOLS_FILE: the relative path (to LVG_DIR) of symbols map file +# LVG_UNICODE_FILE: the relative path (to LVG_DIR) of Unicode map file +# LVG_NON_STRIP_MAP_UNICODE_FILE: the relative path (to LVG_DIR) of non-strip Unicode map file +#------------------------------------------------------------------- +LVG_DIR=/Projects/lvg2008/ +LVG_STOP_WORD_FILE=data/misc/stopWords.data +LVG_NONINFO_WORD_FILE=data/misc/nonInfoWords.data +LVG_CONJ_WORD_FILE=data/misc/conjunctionWord.data +LVG_REMOVE_S_FILE=data/misc/removeS.data +# +LVG_DIACRITICS_FILE=data/Unicode/diacriticMap.data +LVG_LIGATURES_FILE=data/Unicode/ligatureMap.data +LVG_UNICODE_SYNONYM_FILE=data/Unicode/synonymMap.data +LVG_UNICODE_SYMBOL_FILE=data/Unicode/symbolMap.data +LVG_UNICODE_FILE=data/Unicode/unicodeMap.data +LVG_NON_STRIP_MAP_UNICODE_FILE=data/Unicode/nonStripMap.data +#------------------------------------------------------------------- +# Database and JDBC driver +# +# DB_TYPE: HSQLDB, MYSQL, or OTHER +# DB_DRIVER: the JDBC driver +# DB_HOST: Hostname of MySql database +# DB_NAME: database name for Lvg (default is lvg2008) +# DB_USERNAME: user name for using Lvg database (default is lvg) +# DB_PASSWORD: password for lvg user (default is lvg) +#------------------------------------------------------------------- +DB_TYPE=HSQLDB +DB_DRIVER=org.hsqldb.jdbcDriver +DB_NAME=lvg2008 +DB_HOST=null +DB_USERNAME=sa +DB_PASSWORD= +#---------------------------------------------- +#DB_TYPE=MYSQL +#DB_DRIVER=com.mysql.jdbc.Driver +#DB_NAME=lvg2008 +#DB_HOST=localhost +#DB_USERNAME=lvg +#DB_PASSWORD=lvg +#------------------------------------------------------------------- +# Variables: +# +# MIN_TERM_LENGTH: min term length in trie. +# MAX_RULE_UNINFLECTED_TERMS: The max. number of rules based uninflections. +# MAX_METAPHONE: The max. number of characters in metaphone format. +# CGI_EOP: the string of the mark of the end of the set. +# NO_OUTPUT: the string for no output +# TRUNCATED_RESULTS: the maximum number of output LexiItems. +# DER_TRIE_STEM_LENGTH: minmum number of legal stem length in derivation trie +# START_TAG: start tag for the unicode symbol name. +# END_TAG: end tag for the unicode symbol name. +#------------------------------------------------------------------- +MIN_TERM_LENGTH=3 +MAX_RULE_UNINFLECTED_TERMS=10 +MAX_METAPHONE=6 +CGI_EOP=__THE_END__ +NO_OUTPUT=-No Output- +TRUNCATED_RESULTS=80 +LVG_PROMPT=DEFAULT +DIR_TRIE_STEM_LENGTH=3 +START_TAG=![ +END_TAG=]! Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/conjunctionWord.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/conjunctionWord.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/conjunctionWord.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/conjunctionWord.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,107 @@ +# +# This file contains all conjuction words +# +aboard +about +across +after +against +aka +albeit +along +alongside +although +amid +amidst +among +amongst +and +around +as +astride +at +atop +bar +because +before +beneath +beside +besides +between +but +by +circa +despite +down +during +except +excluding +failing +following +for +from +given +if +in +inside +into +less +lest +like +mid +minus +near +nearby +nor +notwithstanding +of +off +on +once +only +onto +or +out +past +pending +per +plus +provided +providing +regarding +respecting +round +since +so +suppose +supposing +than +though +throughout +to +toward +towards +under +underneath +unless +unlike +until +unto +upon +versus +vs +wanting +when +whenever +where +whereas +wherein +wherever +whether +while +whilst +with +within +without +worth +yet Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/nonInfoWords.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/nonInfoWords.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/nonInfoWords.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/nonInfoWords.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,9 @@ +# +# This file contains all non-information wrods +# +NOT OTHERWISE SPECIFIED +NEC AND NOS +NOS +(NOS) +/NOS +NEC Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/removeS.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/removeS.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/removeS.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/removeS.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,36 @@ +#----------------------------------------------------------------------- +# This file defines rules for removing (s) in a term +# +#----------------------------------------------------------------------- +D$ +P$ +^$ +PC$ +SC$ +^C$ +SCC$ +^CC$ +pp$ +alpha$ +beta$ +gamma$ +#epsilon$ +#zeta$ +#eta$ +#theta$ +#iota$ +#kappa$ +#lamda$ +#mu$ +#nu$ +#xi$ +#omikron$ +#pi$ +#rho$ +#sigma$ +#tau$ +#upsilon$ +#phi$ +#chi$ +#psi$ +#omega$ Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/stopWords.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/stopWords.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/stopWords.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/stopWords.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,14 @@ +# +# This file contain all the stop words +# +of +and +with +for +nos +to +in +by +on +the +(non mesh) Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/symbolSynonyms.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/symbolSynonyms.data?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/symbolSynonyms.data (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/misc/symbolSynonyms.data Wed Oct 31 11:03:43 2018 @@ -0,0 +1,4 @@ +#---------------------------------------------------------- +# Numeric Entity | Synonym Numeric Entity +#---------------------------------------------------------- +956|181 Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/dm.rul URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/dm.rul?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/dm.rul (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/dm.rul Wed Oct 31 11:03:43 2018 @@ -0,0 +1,1036 @@ +# File: dm.rul +# +# @(#)dm.rul 1.1 24 Feb 1995 +# +# This file contains the suffix rules for generating derivational +# variants. +# +# All lines starting with a '#' are comment lines. +# Whitespace is ignored. +# +# Format of non-comment lines is: +# +# in_suffix|in_cat_|in_infl|out_suffix|out_cat|out_infl +# +# where, +# +# in_suffix: is the matching suffix of input term +# in_cat: is the part of speech of the input term +# in_infl: is the inflection of the input term +# out_suffix: is the generated output suffix +# out_cat: is the part of speech of the output term +# out_infl: is the inflection of the output term +# +# Part of speech can be one of: +# +# adj: adjective +# adv: adverb +# noun: noun +# verb: verb +# +# For example, ability|noun|able|adj +# +# Says that a noun ending in "ability" generates an adjective form with "able" +# replacing "ability", e.g., readability -> readable +# +# +# e.g. readability|readable +RULE: ability$|noun|base|able$|adj|base +# +# e.g. washable|wash +RULE: able$|adj|base|$|noun|base +EXCEPTION: available|avail; +EXCEPTION: ballotable|ballot; +EXCEPTION: capable|cap; +EXCEPTION: finable|fin; +EXCEPTION: habitable|habit; +EXCEPTION: potable|pot; +EXCEPTION: sable|s; +EXCEPTION: suitable|suit; +EXCEPTION: tractable|tract; +EXCEPTION: trainable|train; +EXCEPTION: treatable|treat; +EXCEPTION: able|; +EXCEPTION: amenable|Amen; +EXCEPTION: amenable|amen; +EXCEPTION: bearable|bear; +EXCEPTION: datable|DAT; +EXCEPTION: reusable|reus; +# +# e.g. washable|wash +RULE: able$|adj|base|$|verb|base +EXCEPTION: ballotable|ballot; +EXCEPTION: capable|cap; +EXCEPTION: disagreeable|disagree; +EXCEPTION: incurable|incur; +EXCEPTION: able|; +EXCEPTION: bearable|bear; +EXCEPTION: potable|pot; +# +# e.g. irritable|irritate +RULE: able$|adj|base|ate$|verb|base +EXCEPTION: probable|probate; +EXCEPTION: sable|sate; +EXCEPTION: stable|state; +EXCEPTION: tractable|tractate; +# +# e.g. improvable|improve +RULE: able$|adj|base|e$|verb|base +EXCEPTION: educable|educe; +EXCEPTION: liable|lie; +EXCEPTION: viable|vie; +# +# e.g. passably|pass +RULE: ably$|adv|base|$|verb|base +# +# e.g. adequacy|adequate, celibacy|celibate +RULE: acy$|noun|base|ate$|adj|base +# +# e.g. assemblage|assemble +RULE: age$|noun|base|e$|verb|base +EXCEPTION: borage|bore; +EXCEPTION: salvage|salve; +EXCEPTION: savage|save; +EXCEPTION: mirage|mire; +# +# e.g. functional|function +RULE: al$|adj|base|$|noun|base +EXCEPTION: ammoniacal|ammoniac; +EXCEPTION: banal|ban; +EXCEPTION: carpal|carp; +EXCEPTION: choreal|chore; +EXCEPTION: costal|cost; +EXCEPTION: dental|dent; +EXCEPTION: fatal|fat; +EXCEPTION: final|fin; +EXCEPTION: fundal|fund; +EXCEPTION: genal|gen; +EXCEPTION: genual|genu; +EXCEPTION: hemal|hem; +EXCEPTION: humoral|humor; +EXCEPTION: incidental|incident; +EXCEPTION: jugal|jug; +EXCEPTION: legal|leg; +EXCEPTION: limbal|limb; +EXCEPTION: literal|liter; +EXCEPTION: meatal|meat; +EXCEPTION: nodal|nod; +EXCEPTION: papal|pap; +EXCEPTION: partal|part; +EXCEPTION: penal|pen; +EXCEPTION: proal|pro; +EXCEPTION: pupal|pup; +EXCEPTION: ramal|ram; +EXCEPTION: rational|ration; +EXCEPTION: rimal|rim; +EXCEPTION: sinal|sin; +EXCEPTION: spinal|spin; +EXCEPTION: sternal|stern; +EXCEPTION: total|tot; +EXCEPTION: tubal|tub; +EXCEPTION: internal|intern; +EXCEPTION: lateral|later; +EXCEPTION: mineral|miner; +EXCEPTION: oral|or; +EXCEPTION: mal|m; +EXCEPTION: al|; +EXCEPTION: amoral|amor; +EXCEPTION: basal|bas; +EXCEPTION: bursal|burs; +EXCEPTION: cemental|cement; +EXCEPTION: cereal|cere; +EXCEPTION: dorsal|dors; +EXCEPTION: modal|mod; +EXCEPTION: mortal|mort; +EXCEPTION: renal|ren; +EXCEPTION: lienal|lien; +EXCEPTION: royal|roy; +# +# e.g. universal|universe +RULE: al$|adj|base|e$|noun|base +EXCEPTION: acral|acre; +EXCEPTION: banal|bane; +EXCEPTION: caval|cave; +EXCEPTION: conjunctival|conjunctive; +EXCEPTION: coral|core; +EXCEPTION: dual|due; +EXCEPTION: fetal|fete; +EXCEPTION: final|fine; +EXCEPTION: formal|forme; +EXCEPTION: genal|gene; +EXCEPTION: hemal|heme; +EXCEPTION: internal|interne; +EXCEPTION: moral|more; +EXCEPTION: oral|ore; +EXCEPTION: pial|pie; +EXCEPTION: primal|prime; +EXCEPTION: prodromal|prodrome; +EXCEPTION: special|specie; +EXCEPTION: vasal|vase; +EXCEPTION: morsal|morse; +EXCEPTION: callosal|callose; +EXCEPTION: teal|tee; +EXCEPTION: rimal|rime; +EXCEPTION: sinal|sine; +# +# e.g. protozoal|protozoa, vaginal|vagina, visceral|viscera +RULE: al$|adj|base|a$|noun|base +EXCEPTION: aural|aura; +EXCEPTION: ideal|idea; +EXCEPTION: material|materia; +EXCEPTION: medial|media; +EXCEPTION: oral|ora; +EXCEPTION: prodromal|prodroma; +EXCEPTION: spinal|spina; +EXCEPTION: venal|vena; +EXCEPTION: plasmal|plasma; +EXCEPTION: anorectal|anorecta; +EXCEPTION: antebrachial|antebrachia; +EXCEPTION: antral|antra; +EXCEPTION: astral|astra; +EXCEPTION: atlantal|atlanta; +EXCEPTION: atrial|atria; +EXCEPTION: bigeminal|bigemina; +EXCEPTION: brachial|brachia; +EXCEPTION: bronchial|bronchia; +EXCEPTION: cacuminal|cacumina; +EXCEPTION: calcaneal|calcanea; +EXCEPTION: callosal|callosa; +EXCEPTION: capital|capita; +EXCEPTION: cecal|ceca; +EXCEPTION: central|centra; +EXCEPTION: cerebral|cerebra; +EXCEPTION: corporal|corpora; +EXCEPTION: dorsal|dorsa; +EXCEPTION: ilial|ilia; +EXCEPTION: jugal|juga; +EXCEPTION: marsupial|marsupia; +EXCEPTION: menstrual|menstrua; +EXCEPTION: pectoral|pectora; +EXCEPTION: primordial|primordia; +EXCEPTION: ramal|rama; +EXCEPTION: tergal|terga; +EXCEPTION: vaccinal|vaccina; +EXCEPTION: angelical|angelica; +EXCEPTION: tubal|tuba; +EXCEPTION: vital|vita; +EXCEPTION: zonal|zona; +EXCEPTION: moral|mora; +EXCEPTION: real|rea; +EXCEPTION: mesal|mesa; +EXCEPTION: molal|mola; +EXCEPTION: papal|papa; +EXCEPTION: teal|tea; +# +# e.g. bacterial|bacterium +RULE: al$|adj|base|um$|noun|base +EXCEPTION: oval|ovum; +EXCEPTION: radial|radium; +EXCEPTION: seral|serum; +EXCEPTION: caval|cavum; +EXCEPTION: aural|aurum; +EXCEPTION: central|centrum; +EXCEPTION: jugal|jugum; +EXCEPTION: cordial|cordium; +# +# e.g. governance|govern +RULE: ance$|noun|base|$|verb|base +EXCEPTION: importance|import; +EXCEPTION: penance|pen; +EXCEPTION: balance|BAL; +EXCEPTION: balance|bal; +EXCEPTION: Formance|form; +# +# e.g. defiance|defiant, conformance|conformant +RULE: ance$|noun|base|ant$|adj|base +EXCEPTION: ambulance|ambulant; +# +# e.g. insurance|insure +RULE: ance$|noun|base|e$|verb|base +EXCEPTION: balance|bale; +EXCEPTION: finance|fine; +# +# e.g. relaxant|relax +RULE: ant$|adj|base|$|verb|base +EXCEPTION: errant|err; +EXCEPTION: gallant|gall; +EXCEPTION: important|import; +EXCEPTION: rampant|ramp; +EXCEPTION: trenchant|trench; +EXCEPTION: Protestant|protest; +EXCEPTION: idant|id; +EXCEPTION: ant|; +# +# e.g. triumphant|triumph +RULE: ant$|noun|base|$|verb|base +EXCEPTION: formant|form; +EXCEPTION: gallant|gall; +EXCEPTION: spinant|spin; +EXCEPTION: ant|; +EXCEPTION: Protestant|protest; +EXCEPTION: idant|id; +# +# e.g. radiant|radiate +RULE: ant$|adj|base|ate$|verb|base +EXCEPTION: arrogant|arrogate; +EXCEPTION: pellant|pellate; +# +# e.g. stimulant|stimulate +RULE: ant$|noun|base|ate$|verb|base +EXCEPTION: grant|grate; +EXCEPTION: plant|plate; +EXCEPTION: rant|rate; +EXCEPTION: slant|slate; +# +# e.g. fumigant|fumigation +RULE: ant$|adj|base|ation$|noun|base +EXCEPTION: conversant|conversation; +EXCEPTION: dilatant|dilatation; +EXCEPTION: important|importation; +EXCEPTION: pellant|pellation; +EXCEPTION: vacant|vacation; +EXCEPTION: Protestant|protestation; +# +# e.g. servant|serve +RULE: ant$|noun|base|e$|verb|base +EXCEPTION: fusant|fuse; +EXCEPTION: maturant|mature; +EXCEPTION: mutant|mute; +EXCEPTION: truant|true; +EXCEPTION: savant|save; +# +# e.g. Cambodian|Cambodia +RULE: an$|adj|base|a$|noun|base +EXCEPTION: median|media; +EXCEPTION: span|spa; +EXCEPTION: Dirian|diria; +EXCEPTION: Dominican|dominica; +EXCEPTION: Raman|rama; +EXCEPTION: lean|lea; +EXCEPTION: nonan|nona; +# +# e.g. amphibian|amphibia +RULE: an$|noun|base|a$|noun|base +EXCEPTION: loan|loa; +EXCEPTION: median|media; +EXCEPTION: span|spa; +EXCEPTION: Dirian|diria; +EXCEPTION: Dominican|dominica; +EXCEPTION: Duran|dura; +EXCEPTION: Milian|milia; +EXCEPTION: ban|ba; +EXCEPTION: bran|bra; +EXCEPTION: lean|lea; +EXCEPTION: pean|pea; +EXCEPTION: aran|ara; +EXCEPTION: evan|eva; +EXCEPTION: ratan|rata; +EXCEPTION: nonan|nona; +EXCEPTION: nyhan|nyha; +EXCEPTION: saran|sara; +# +# e.g. inflationary|inflation, cautionary|caution +RULE: ary$|adj|base|$|noun|base +EXCEPTION: binary|bin; +EXCEPTION: literary|liter; +EXCEPTION: rotary|rot; +EXCEPTION: cavitary|cavit; +# +# e.g. insular|insulate, modular|modulate +RULE: ar$|adj|base|ate$|verb|base +EXCEPTION: far|fate; +EXCEPTION: insular|insulate; +# +# e.g. fibular|fibula +RULE: ar$|adj|base|a$|noun|base +EXCEPTION: amygdalar|amygdala; +EXCEPTION: columnar|columna; +EXCEPTION: pilar|pila; +EXCEPTION: velar|vela; +EXCEPTION: sear|sea; +EXCEPTION: lunar|luna; +EXCEPTION: molar|mola; +EXCEPTION: pylar|pyla; +# +# e.g. modular|module +RULE: ar$|adj|base|e$|noun|base +EXCEPTION: alar|ale; +EXCEPTION: hoar|hoe; +EXCEPTION: pilar|pile; +EXCEPTION: solar|sole; +EXCEPTION: volar|vole; +EXCEPTION: lunar|lune; +EXCEPTION: malar|male; +EXCEPTION: talar|tale; +EXCEPTION: lazar|laze; +EXCEPTION: mular|mule; +# +# e.g. alveolar|alveolus +RULE: ar$|adj|base|us$|noun|base +EXCEPTION: popular|populus; +# +# e.g. fantasia|fantastic +RULE: asia$|noun|base|astic$|adj|base +# +# e.g. flirtation|flirt +RULE: ation$|noun|base|$|verb|base +EXCEPTION: designation|design; +EXCEPTION: donation|don; +EXCEPTION: revelation|revel; +EXCEPTION: rotation|rot; +EXCEPTION: situation|situ; +EXCEPTION: oration|or; +EXCEPTION: conation|con; +EXCEPTION: fellation|fell; +# +# e.g. alienation|alienate +RULE: ation$|noun|base|ate$|verb|base +EXCEPTION: ration|rate; +EXCEPTION: station|state; +EXCEPTION: creation|create; +EXCEPTION: aspiration|aspirate; +EXCEPTION: predation|predate; +EXCEPTION: recreation|recreate; +EXCEPTION: vacation|vacate; +# +# e.g. admiration|admire +RULE: ation$|noun|base|e$|verb|base +EXCEPTION: education|educe; +EXCEPTION: machination|machine; +EXCEPTION: probation|probe; +EXCEPTION: salvation|salve; +EXCEPTION: aspiration|aspire; +EXCEPTION: conation|cone; +EXCEPTION: mutation|mute; +# +# e.g. narrative|narrate +RULE: ative$|noun|base|ate$|verb|base +# +# e.g. pigmented|pigment +RULE: ed$|adj|base|$|noun|base +EXCEPTION: abed|ab; +EXCEPTION: armed|arm; +EXCEPTION: bated|bat; +EXCEPTION: collected|collect; +EXCEPTION: fated|fat; +EXCEPTION: lobed|lob; +EXCEPTION: striped|strip; +EXCEPTION: united|unit; +EXCEPTION: wicked|wick; +EXCEPTION: cared|car; +EXCEPTION: rated|rat; +EXCEPTION: ed|; +EXCEPTION: armoured|Armour; +EXCEPTION: spined|spin; +EXCEPTION: tubed|tub; +EXCEPTION: wooded|Wood; +EXCEPTION: naked|nak; +# +# e.g. glued|glue +RULE: ed$|adj|base|e$|noun|base +EXCEPTION: closed|close; +EXCEPTION: jaded|jade; +EXCEPTION: incensed|incense; +EXCEPTION: tired|tire; +EXCEPTION: pied|pie; +# +# e.g. fluency|fluent cogency|cogent +RULE: ency$|noun|base|ent$|adj|base +EXCEPTION: emergency|emergent; +# +# e.g. absorbency|absorbent +RULE: ency$|noun|base|ent$|noun|base +# +# e.g. deferential|deference +RULE: ential$|adj|base|ence$|noun|base +# +# e.g. corpulent|corpulence +RULE: ent$|adj|base|ence$|noun|base +EXCEPTION: influent|influence; +EXCEPTION: eminent|eminence; +EXCEPTION: bent|bence; +# +# e.g. machinery|machine +RULE: ery$|noun|base|e$|noun|base +EXCEPTION: forgery|forge; +EXCEPTION: surgery|surge; +EXCEPTION: misery|mise; +# +# e.g. employer|employ +RULE: er$|noun|base|$|verb|base +EXCEPTION: buffer|buff; +EXCEPTION: bunker|bunk; +EXCEPTION: butter|butt; +EXCEPTION: canter|cant; +EXCEPTION: caper|cap; +EXCEPTION: career|care; +EXCEPTION: charter|chart; +EXCEPTION: corner|corn; +EXCEPTION: dresser|dress; +EXCEPTION: flicker|flick; +EXCEPTION: flower|flow; +EXCEPTION: rider|rid; +EXCEPTION: sewer|sew; +EXCEPTION: shower|show; +EXCEPTION: skewer|skew; +EXCEPTION: sweater|sweat; +EXCEPTION: taper|tap; +EXCEPTION: tender|tend; +EXCEPTION: tower|tow; +EXCEPTION: trimer|trim; +EXCEPTION: wager|wag; +EXCEPTION: whisker|whisk; +EXCEPTION: fiber|fib; +EXCEPTION: venter|vent; +EXCEPTION: wither|with; +EXCEPTION: er|; +EXCEPTION: Bender|bend; +EXCEPTION: Cooper|coop; +EXCEPTION: Crocker|crock; +EXCEPTION: Fisher|fish; +EXCEPTION: Fowler|fowl; +EXCEPTION: Gruber|grub; +EXCEPTION: Horner|horn; +EXCEPTION: Hunter|hunt; +EXCEPTION: Hurler|hurl; +EXCEPTION: Letterer|letter; +EXCEPTION: Lister|list; +EXCEPTION: Palmer|palm; +EXCEPTION: Parker|park; +EXCEPTION: Ringer|ring; +EXCEPTION: Senter|sent; +EXCEPTION: Turner|turn; +EXCEPTION: Walker|walk; +EXCEPTION: Wheeler|wheel; +EXCEPTION: adder|add; +EXCEPTION: banter|bant; +EXCEPTION: dimer|dim; +EXCEPTION: former|form; +EXCEPTION: gruber|grub; +EXCEPTION: hunger|hung; +EXCEPTION: liter|lit; +EXCEPTION: meter|met; +EXCEPTION: number|numb; +EXCEPTION: plumber|plumb; +EXCEPTION: porter|port; +EXCEPTION: rooster|roost; +EXCEPTION: roter|rot; +EXCEPTION: scraper|scrap; +EXCEPTION: snicker|snick; +EXCEPTION: solder|sold; +EXCEPTION: splinter|splint; +EXCEPTION: luster|lust; +EXCEPTION: liner|lin; +EXCEPTION: muller|mull; +# +# e.g. pursuer|pursue +RULE: er$|noun|base|e$|verb|base +EXCEPTION: crater|crate; +EXCEPTION: liver|live; +EXCEPTION: river|rive; +EXCEPTION: singer|singe; +EXCEPTION: baker|bake; +EXCEPTION: piper|pipe; +EXCEPTION: shaker|shake; +EXCEPTION: analyser|analyse; +EXCEPTION: meter|mete; +EXCEPTION: wager|wage; +# +# e.g. hypothetic|hypothesis +RULE: etic$|adj|base|esis$|noun|base +EXCEPTION: genetic|genesis; +# +# e.g. maniac|mania +RULE: iac$|noun|base|ia$|noun|base +EXCEPTION: ammoniac|ammonia; +# +# e.g. colonial|colony +RULE: ial$|adj|base|y$|noun|base +EXCEPTION: partial|party; +# +# e.g. reliance|reliant +RULE: iance$|noun|base|iant$|adj|base +# +# e.g. Algerian|Algeria +RULE: ian$|adj|base|ia$|noun|base +EXCEPTION: median|media; +EXCEPTION: Dirian|diria; +# +# e.g. Syrian|Syria +RULE: ian$|noun|base|ia$|noun|base +EXCEPTION: median|media; +EXCEPTION: Dirian|diria; +EXCEPTION: Milian|milia; +# +# e.g. trombiculiasis|trombicula +RULE: iasis$|noun|base|a$|noun|base +EXCEPTION: auriasis|aura; +EXCEPTION: loiasis|loa; +# +# e.g. critical|critic +RULE: ical$|adj|base|ic$|noun|base +# +# e.g. anatomical|anatomy +RULE: ical$|adj|base|y$|noun|base +# +# e.g. elasticity|elastic +RULE: icity$|noun|base|ic$|adj|base +# +# e.g. alcoholic|alcohol +RULE: ic$|adj|base|$|noun|base +EXCEPTION: actinic|actin; +EXCEPTION: classic|class; +EXCEPTION: conic|con; +EXCEPTION: formic|form; +EXCEPTION: genic|gen; +EXCEPTION: pediatric|pediatr; +EXCEPTION: rustic|rust; +EXCEPTION: sonic|son; +EXCEPTION: tonic|ton; +EXCEPTION: manic|man; +EXCEPTION: genetic|genet; +EXCEPTION: genetic|Genet; +EXCEPTION: limbic|limb; +EXCEPTION: ic|; +EXCEPTION: Gallic|gall; +EXCEPTION: Homeric|homer; +EXCEPTION: Masonic|mason; +EXCEPTION: abasic|abas; +EXCEPTION: adenic|aden; +EXCEPTION: allelic|allel; +EXCEPTION: anisic|anis; +EXCEPTION: arsonic|arson; +EXCEPTION: ataraxic|Atarax; +EXCEPTION: balanic|balan; +EXCEPTION: basic|bas; +EXCEPTION: basilic|basil; +EXCEPTION: biotic|Biot; +EXCEPTION: caloric|calor; +EXCEPTION: choreic|chore; +EXCEPTION: crotonic|Croton; +EXCEPTION: cryptic|crypt; +EXCEPTION: cubic|cub; +EXCEPTION: enzymic|enzym; +EXCEPTION: folic|fol; +EXCEPTION: fundic|fund; +EXCEPTION: funic|fun; +EXCEPTION: gallic|gall; +EXCEPTION: genesic|genes; +EXCEPTION: hemic|hem; +EXCEPTION: ironic|iron; +EXCEPTION: lipasic|lipas; +EXCEPTION: lyssic|lyss; +EXCEPTION: maleic|male; +EXCEPTION: phasic|phas; +EXCEPTION: plumbic|plumb; +EXCEPTION: tabic|tab; +EXCEPTION: tactic|tact; +EXCEPTION: toric|tor; +EXCEPTION: biotic|biot; +EXCEPTION: malic|mal; +EXCEPTION: rabic|rab; +# +# e.g. amnesic|amnesia; and academic|academia +RULE: ia$|noun|base|ic$|adj|base +EXCEPTION: acidophilia|acidophilic; +EXCEPTION: media|medic; +EXCEPTION: olympia|Olympic; +EXCEPTION: Eugenia|eugenic; +# +# e.g. barbaric|barbarism +RULE: ic$|adj|base|ism$|noun|base +EXCEPTION: bimetallic|bimetallism; +EXCEPTION: boric|borism; +EXCEPTION: bromic|bromism; +EXCEPTION: camphoric|camphorism; +EXCEPTION: telluric|tellurism; +EXCEPTION: cubic|Cubism; +# +# e.g. demagogic|demagogy +RULE: ic$|adj|base|y$|noun|base +EXCEPTION: autonomic|autonomy; +EXCEPTION: colonic|colony; +EXCEPTION: ferric|ferry; +EXCEPTION: Germanic|germany; +EXCEPTION: caffeic|caffey; +EXCEPTION: italic|Italy; +EXCEPTION: academic|academy; +EXCEPTION: sympathic|sympathy; +# +# e.g. deficient|deficiency +RULE: ient$|adj|base|iency$|noun|base +# +# e.g. easily|easy +RULE: ily$|adv|base|y$|adj|base +# +# e.g. ambition|ambitious religion|religious +RULE: ion$|noun|base|ious$|adj|base +EXCEPTION: fraction|fractious; +EXCEPTION: pion|pious; +# +# e.g. glorious|glory +RULE: ious$|adj|base|y$|noun|base +EXCEPTION: copious|copy; +# +# e.g. capitalism|capital +RULE: ism$|noun|base|$|noun|base +EXCEPTION: actinism|actin; +EXCEPTION: humanism|human; +EXCEPTION: ism|; +EXCEPTION: Catholicism|catholic; +EXCEPTION: Expressionism|expression; +EXCEPTION: Impressionism|impression; +EXCEPTION: cubism|cub; +# +# e.g. anachronism|anachronistic atavism|atavistic +RULE: ism$|noun|base|istic$|adj|base +EXCEPTION: ballism|ballistic; +# +# e.g. artist|art +RULE: ist$|noun|base|$|noun|base +EXCEPTION: dentist|dent; +EXCEPTION: grist|gr; +EXCEPTION: list|l; +EXCEPTION: methodist|method; +EXCEPTION: humanist|human; +EXCEPTION: mist|m; +# +# e.g. fatalist|fatalism; and atheist|atheism +RULE: ist$|noun|base|ism$|noun|base +EXCEPTION: agonist|agonism; +EXCEPTION: Baptist|baptism; +# +# e.g. abnormality|abnormal +RULE: ity$|noun|base|$|adj|base +EXCEPTION: dimity|dim; +EXCEPTION: dexterity|dexter; +EXCEPTION: hilarity|hilar; +EXCEPTION: majority|major; +EXCEPTION: density|dens; +EXCEPTION: severity|sever; +# +# e.g. adversity|adverse +RULE: ity$|noun|base|e$|adj|base +EXCEPTION: relativity|relative; +EXCEPTION: cooperativity|Cooperative; +EXCEPTION: gentility|Gentile; +EXCEPTION: adversity|adverse; +# +# e.g. frivolity|frivolous +RULE: ity$|noun|base|ous$|adj|base +EXCEPTION: ingenuity|ingenuous; +# +# e.g. explosive|explosion +RULE: ive$|noun|base|ion$|noun|base +EXCEPTION: accusative|accusation; +EXCEPTION: defective|defection; +EXCEPTION: determinative|determination; +EXCEPTION: native|nation; +EXCEPTION: objective|objection; +EXCEPTION: passive|passion; +EXCEPTION: positive|position; +EXCEPTION: subjunctive|subjunction; +EXCEPTION: reflexive|reflexion; +EXCEPTION: conservative|conservation; +EXCEPTION: cooperative|cooperation; +# +# e.g. civilization|civilize +RULE: ization$|noun|base|ize$|verb|base +# +# e.g. actively|active +RULE: ly$|adv|base|$|adj|base +EXCEPTION: hardly|hard; +EXCEPTION: lately|late; +EXCEPTION: early|ear; +EXCEPTION: singly|sing; +EXCEPTION: ly|; +EXCEPTION: awfully|awful; +EXCEPTION: badly|bad; +EXCEPTION: barely|bare; +EXCEPTION: deeply|deep; +EXCEPTION: openly|open; +EXCEPTION: poorly|poor; +EXCEPTION: really|real; +EXCEPTION: sharply|sharp; +EXCEPTION: shortly|short; +EXCEPTION: wildly|wild; +# +# e.g. invariably|invariable +RULE: ly$|adv|base|le$|adj|base +EXCEPTION: supply|supple; +EXCEPTION: apply|apple; +# +# e.g. editor|edit +RULE: or$|noun|base|$|verb|base +EXCEPTION: armor|arm; +EXCEPTION: demeanor|demean; +EXCEPTION: donor|don; +EXCEPTION: door|do; +EXCEPTION: or|; +EXCEPTION: humor|hum; +EXCEPTION: meteor|mete; +EXCEPTION: rigor|rig; +EXCEPTION: tailor|tail; +EXCEPTION: Castor|cast; +EXCEPTION: Tendor|tend; +EXCEPTION: cantor|cant; +EXCEPTION: castor|cast; +EXCEPTION: clamor|clam; +EXCEPTION: furor|fur; +EXCEPTION: rotor|rot; +EXCEPTION: tumor|tum; +EXCEPTION: rubor|rub; +# +# e.g. conductor|conduct +RULE: or$|noun|base|e$|verb|base +EXCEPTION: dolor|dole; +EXCEPTION: equator|equate; +EXCEPTION: honor|hone; +EXCEPTION: livor|live; +EXCEPTION: minor|mine; +EXCEPTION: razor|raze; +EXCEPTION: stridor|stride; +EXCEPTION: primor|prime; +EXCEPTION: timor|time; +EXCEPTION: dicor|dice; +EXCEPTION: donor|done; +EXCEPTION: fetor|fete; +EXCEPTION: pavor|pave; +EXCEPTION: savor|save; +EXCEPTION: tensor|tense; +# +# e.g. diagnosis|diagnose +RULE: osis$|noun|base|ose$|verb|base +# +# e.g. porosity|porous +RULE: osity$|noun|base|ous$|adj|base +# +# e.g. ichthyotic|ichthyosis +RULE: otic$|adj|base|osis$|noun|base +# +# e.g. cancerous|cancer +RULE: ous$|adj|base|$|noun|base +EXCEPTION: bullous|bull; +EXCEPTION: hideous|hide; +EXCEPTION: parous|par; +EXCEPTION: ous|; +EXCEPTION: amorous|amor; +EXCEPTION: androgenous|androgen; +EXCEPTION: caseous|case; +EXCEPTION: circuitous|circuit; +EXCEPTION: genetous|Genet; +EXCEPTION: momentous|moment; +EXCEPTION: spinous|spin; +EXCEPTION: sanous|san; +EXCEPTION: torous|tor; +# +# e.g. analogous|analogy +RULE: ous$|adj|base|y$|noun|base +EXCEPTION: bullous|bully; +EXCEPTION: ferrous|ferry; +# +# e.g. diffusion|diffuse +RULE: sion$|noun|base|se$|verb|base +EXCEPTION: basion|base; +EXCEPTION: conversion|converse; +# +# e.g. diversion|divert +RULE: sion$|noun|base|t$|verb|base +EXCEPTION: basion|bat; +EXCEPTION: lesion|let; +EXCEPTION: mission|mist; +EXCEPTION: scansion|scant; +EXCEPTION: version|vert; +# +# e.g. obsession|obsess +RULE: ssion$|noun|base|ss$|verb|base +EXCEPTION: compassion|compass; +EXCEPTION: mission|miss; +EXCEPTION: passion|pass; +# +# e.g. aseptic|asepsis +RULE: tic$|adj|base|sis$|noun|base +EXCEPTION: genetic|genesis; +# +# e.g. tabular|table +RULE: ular$|adj|base|le$|noun|base +EXCEPTION: articular|article; +EXCEPTION: particular|particle; +# +# e.g. nebular|nebula +RULE: ular$|adj|base|ula$|noun|base +# +# e.g. bushy|bush +RULE: y$|adj|base|$|noun|base +EXCEPTION: army|arm; +EXCEPTION: bandy|band; +EXCEPTION: bully|bull; +EXCEPTION: busy|bus; +EXCEPTION: dilatory|dilator; +EXCEPTION: dumpy|dump; +EXCEPTION: handy|hand; +EXCEPTION: husky|husk; +EXCEPTION: jaunty|jaunt; +EXCEPTION: lacy|lac; +EXCEPTION: naughty|naught; +EXCEPTION: pasty|past; +EXCEPTION: perky|perk; +EXCEPTION: ready|read; +EXCEPTION: ruby|rub; +EXCEPTION: scary|scar; +EXCEPTION: seedy|seed; +EXCEPTION: shiny|shin; +EXCEPTION: silly|sill; +EXCEPTION: steady|stead; +EXCEPTION: sticky|stick; +EXCEPTION: slippery|slipper; +EXCEPTION: stingy|sting; +EXCEPTION: stuffy|stuff; +EXCEPTION: testy|test; +EXCEPTION: tiny|tin; +EXCEPTION: wary|war; +EXCEPTION: weary|wear; +EXCEPTION: early|earl; +EXCEPTION: hearty|heart; +EXCEPTION: study|stud; +EXCEPTION: mousey|mouse; +EXCEPTION: y|; +EXCEPTION: astray|astra; +EXCEPTION: bally|ball; +EXCEPTION: butoxy|Butox; +EXCEPTION: downy|Down; +EXCEPTION: earthy|Earth; +EXCEPTION: hilly|Hill; +EXCEPTION: lazy|Laz; +EXCEPTION: malay|mala; +EXCEPTION: okay|Oka; +EXCEPTION: okay|oka; +EXCEPTION: potty|Pott; +EXCEPTION: potty|pott; +EXCEPTION: silvery|Silver; +EXCEPTION: thorny|Thorn; +EXCEPTION: bay|ba; +EXCEPTION: flaky|flak; +EXCEPTION: shady|shad; +EXCEPTION: lazy|laz; +EXCEPTION: massy|mass; +EXCEPTION: spiny|spin; +# +# e.g. bristly|bristle +RULE: y$|adj|base|e$|noun|base +EXCEPTION: fey|fee; +EXCEPTION: heavy|heave; +EXCEPTION: holy|hole; +EXCEPTION: nosy|nose; +EXCEPTION: poky|poke; +EXCEPTION: racy|race; +EXCEPTION: tidy|tide; +EXCEPTION: wary|ware; +EXCEPTION: mousy|mouse; +EXCEPTION: steely|steele; +EXCEPTION: airy|aire; +EXCEPTION: foy|foe; +EXCEPTION: ley|lee; +EXCEPTION: may|mae; +EXCEPTION: roy|roe; +EXCEPTION: tiny|tine; +# +# e.g. abundancy|abundance +RULE: y$|noun|base|e$|noun|base +EXCEPTION: copy|cope; +EXCEPTION: gravy|grave; +EXCEPTION: heavy|heave; +EXCEPTION: policy|police; +EXCEPTION: poly|pole; +EXCEPTION: story|store; +EXCEPTION: tidy|tide; +EXCEPTION: toy|toe; +EXCEPTION: mousy|mouse; +EXCEPTION: steely|steele; +EXCEPTION: foy|foe; +EXCEPTION: ley|lee; +EXCEPTION: may|mae; +EXCEPTION: roy|roe; +# +# some neo-classical terminal rules added: 24-Sep-92 +# +#e.g. thermograph|thermogram +RULE: graph$|noun|base|gram$|noun|base +EXCEPTION: diagraph|diagram; +EXCEPTION: graph|gram; +EXCEPTION: hysterograph|hysterogram; +EXCEPTION: micrograph|microgram; +# +#e.g. thermogram|thermography +RULE: gram$|noun|base|graphy$|noun|base +EXCEPTION: histogram|histography; +EXCEPTION: microgram|micrography; +EXCEPTION: osteogram|osteography; +# +#e.g. photograph|photography +RULE: graph$|noun|base|graphy$|noun|base +EXCEPTION: urethrograph|urethrography; +# +#e.g. thermogram|thermographic +RULE: gram$|noun|base|graphic$|adj|base +EXCEPTION: gram|graphic; +# +#e.g. +RULE: meter$|noun|base|metry$|noun|base +EXCEPTION: auxometer|auxometry; +EXCEPTION: biometer|biometry; +EXCEPTION: oscillometer|oscillometry; +EXCEPTION: pathometer|pathometry; +EXCEPTION: skiameter|skiametry; +EXCEPTION: urometer|urometry; +# +#e.g. +RULE: physis$|noun|base|physeal$|adj|base +# +#NOMINALIZATION Rules: +# +# This is an extreemly productive nominalization formation rule! +# reflected in the "nominalization=" and "nominalization_of=" slots +# in the lexicon. Aronoff (1976) discusses the productivity of "ness" +# versus "ity". +# e.g. aptness|apt +RULE: ness$|noun|base|$|adj|base +EXCEPTION: chilliness|chilli; +EXCEPTION: likeness|like; +EXCEPTION: witness|wit; +EXCEPTION: wilderness|wilder; +# +# e.g. retire|retirement +RULE: ment$|noun|base|$|verb|base +EXCEPTION: Enlightenment|enlighten; +EXCEPTION: Government|govern; +EXCEPTION: abutment|abut; +EXCEPTION: ailment|ail; +EXCEPTION: apartment|apart; +EXCEPTION: basement|base; +EXCEPTION: catchment|catch; +EXCEPTION: department|depart; +EXCEPTION: deportment|deport; +EXCEPTION: environment|environ; +EXCEPTION: filament|fila; +EXCEPTION: foment|fo; +EXCEPTION: lament|la; +EXCEPTION: moment|mo; +EXCEPTION: parchment|parch; +EXCEPTION: pavement|pave; +EXCEPTION: pigment|pig; +EXCEPTION: supplement|supple; +EXCEPTION: temperament|tempera; +EXCEPTION: wonderment|wonder; +# +#e.g. +RULE: y$|verb|base|ication$|noun|base +EXCEPTION: comply|complication; +EXCEPTION: fry|frication; +EXCEPTION: reply|replication; +EXCEPTION: rusty|rustication; +EXCEPTION: supply|supplication; +# +# e.g. provide|provision +RULE: de$|verb|base|sion$|noun|base +EXCEPTION: bade|basion; Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/exceptionD.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/exceptionD.data?rev=1845321&view=auto ============================================================================== Binary file - no diff available. Propchange: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/exceptionD.data ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/exceptionI.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/exceptionI.data?rev=1845321&view=auto ============================================================================== Binary file - no diff available. Propchange: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/exceptionI.data ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/im.rul URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/im.rul?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/im.rul (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/im.rul Wed Oct 31 11:03:43 2018 @@ -0,0 +1,151 @@ +# All comments start with a '#' character in col 1. +# Empty lines and lines with only whitespace are ignored. +# +# @(#)im.rul 1.1 24 Feb 1995 +# +# This file describes the rules used to generate inflections. +# +# The format is as follows: +# +# ||||| +# +# where: +# +# is an input pattern to match as a suffix. +# Lowercase characters are matched as literals. Uppercase letters +# have the following meaning: D=digit [0-9], L=any letter, ^=start +# $=end, [AEIOU]=any vowel, [BCFGHJKMNPQRSTVWXYZ]=any consonant. +# The same uppercase letter used in the output pattern unifies with +# the actual character that matched the input. +# is the input category and must be one of: adj, adv, noun or verb. +# is the input inflection and must be one of: base, singular, +# positive, infinitive, plural, comparative, superlative, present, +# ing or past. +# refers to output category. +# refers to output inflection. +# +# The rules are stated to mean that if the input matches the pattern and is +# of the appropriate category and inflection, then the stated rule generates +# the output inflection. Note that rules only need be stated in one direction. +# The reverse rules are automatically generated. Duplicate identical rules +# are unique'ed out. +# +# Identity rules +RULE: $|verb|base|$|verb|base +RULE: $|noun|base|$|noun|base +RULE: $|adj|base|$|adj|base +RULE: $|adv|base|$|adv|base +# +# Adjective rules +RULE: Cy$|adj|positive|Cier$|adj|comparative +RULE: Cy$|adj|positive|Ciest$|adj|superlative +# +RULE: CVC$|adj|positive|CVSCer$|adj|comparative +RULE: CVC$|adj|positive|CVSCest$|adj|superlative +# +RULE: e$|adj|positive|er$|adj|comparative +EXCEPTION: inhale|inhaler; +RULE: e$|adj|positive|est$|adj|superlative +EXCEPTION: e|est; +# +RULE: $|adj|positive|er$|adj|comparative +EXCEPTION: |er; +EXCEPTION: inhal|inhaler; +# +RULE: $|adj|positive|est$|adj|superlative +EXCEPTION: |est; +# +# Adverb rules +RULE: Cy$|adv|positive|Cier$|adv|comparative +RULE: Cy$|adv|positive|Ciest$|adv|superlative +# +RULE: CVC$|adv|positive|CVSCer$|adv|comparative +RULE: CVC$|adv|positive|CVSCest$|adv|superlative +# +RULE: e$|adv|positive|er$|adv|comparative +EXCEPTION: inhale|inhaler; +# +RULE: e$|adv|positive|est$|adv|superlative +EXCEPTION: e|est; +# +RULE: $|adv|positive|er$|adv|comparative +EXCEPTION: |er; +EXCEPTION: inhal|inhaler; +# +RULE: $|adv|positive|est$|adv|superlative +EXCEPTION: |est; +# +# Noun rules (reg) +RULE: ^L$|noun|singular|^L's$|noun|plural +RULE: D$|noun|singular|D's$|noun|plural +RULE: Cy$|noun|singular|Cies$|noun|plural +RULE: Co$|noun|singular|Cos$|noun|plural +RULE: Co$|noun|singular|Coes$|noun|plural +RULE: ch$|noun|singular|ches$|noun|plural +RULE: sh$|noun|singular|shes$|noun|plural +RULE: s$|noun|singular|ses$|noun|plural +RULE: x$|noun|singular|xes$|noun|plural +RULE: z$|noun|singular|zes$|noun|plural +RULE: $|noun|singular|s$|noun|plural +EXCEPTION: |s; +# +# Noun rules (glreg) +RULE: us$|noun|singular|i$|noun|plural +EXCEPTION: antus|anti; +# +RULE: ma$|noun|singular|mata$|noun|plural +RULE: a$|noun|singular|ae$|noun|plural +RULE: um$|noun|singular|a$|noun|plural +RULE: on$|noun|singular|a$|noun|plural +RULE: sis$|noun|singular|ses$|noun|plural +RULE: is$|noun|singular|ides$|noun|plural +RULE: men$|noun|singular|mina$|noun|plural +RULE: ex$|noun|singular|ices$|noun|plural +RULE: x$|noun|singular|ces$|noun|plural +# +### plural rules include irregular +# noun inflection rules. (ACB) (1-Oct-92) +FILE: plural.rul +# +# Verb rules (present) +RULE: Cy$|verb|infinitive|Cies$|verb|pres +RULE: s$|verb|infinitive|ses$|verb|pres +RULE: x$|verb|infinitive|xes$|verb|pres +RULE: z$|verb|infinitive|zes$|verb|pres +RULE: ch$|verb|infinitive|ches$|verb|pres +RULE: sh$|verb|infinitive|shes$|verb|pres +RULE: Co$|verb|infinitive|Cos$|verb|pres +EXCEPTION: do|dos; +RULE: Co$|verb|infinitive|Coes$|verb|pres +RULE: CVC$|verb|infinitive|CVCs$|verb|pres +RULE: $|verb|infinitive|s$|verb|pres +EXCEPTION: |s; +EXCEPTION: do|dos; +# +# Verb rules (past) +RULE: Cy$|verb|infinitive|Cied$|verb|past +RULE: CVC$|verb|infinitive|CVSCed$|verb|past +RULE: e$|verb|infinitive|ed$|verb|past +EXCEPTION: e|ed; +EXCEPTION: fre|fred; +EXCEPTION: free|freed; +# +RULE: $|verb|infinitive|ed$|verb|past +EXCEPTION: |ed; +EXCEPTION: do|doed; +EXCEPTION: fr|fred; +EXCEPTION: free|freed; +# +# Verb rules (ing) +RULE: ie$|verb|infinitive|ying$|verb|presPart +RULE: ee$|verb|infinitive|eeing$|verb|presPart +RULE: oe$|verb|infinitive|oeing$|verb|presPart +RULE: ye$|verb|infinitive|yeing$|verb|presPart +RULE: CVC$|verb|infinitive|CVSCing$|verb|presPart +RULE: e$|verb|infinitive|ing$|verb|presPart +EXCEPTION: e|ing; +RULE: $|verb|infinitive|ing$|verb|presPart +EXCEPTION: |ing; +# +### Irregular VERB inflection rules (2-Oct-92) +FILE: verbinfl.rul Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/plural.rul URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/plural.rul?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/plural.rul (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/plural.rul Wed Oct 31 11:03:43 2018 @@ -0,0 +1,36 @@ +# Inflection Rules For Irregular Nouns: +# +# OF MICE AND MEN (and TEETH and WIVES): (1-Oct-92) +# +RULE: man$|noun|singular|men$|noun|plural +# +# It is not clear from the dictionaries what the plurals of the non-rodential +# terms: "peritoneal mouse", "pleural mouse" and "joint mouse" is. I assume +# that analogy is strong enough to support "mice" plurals for these. +# +RULE: mouse$|noun|singular|mice$|noun|plural +# +# tooth teeth +# +RULE: tooth$|noun|singular|teeth$|noun|plural +# +# f(e) -> ves (1-Oct-92) (ACB) +# +RULE: fe$|noun|singular|ves$|noun|plural +EXCEPTION: rife|rives; +EXCEPTION: safe|saves; +EXCEPTION: strife|strives; +# +# "belief" and "believes" are of course morphologically related by an +# "f|noun|ve|verb" derivational rule too unproductive to make it into +# dm.rul. +# +RULE: f$|noun|singular|ves$|noun|plural +EXCEPTION: belief|believes; +EXCEPTION: dif|dives; +EXCEPTION: disbelief|disbelieves; +EXCEPTION: grief|grieves; +EXCEPTION: motif|motives; +EXCEPTION: pelf|pelves; +EXCEPTION: relief|relieves; +EXCEPTION: serf|serves; Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/ruleD.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/ruleD.data?rev=1845321&view=auto ============================================================================== Binary file - no diff available. Propchange: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/ruleD.data ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/ruleI.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/ruleI.data?rev=1845321&view=auto ============================================================================== Binary file - no diff available. Propchange: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/ruleI.data ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/trieD.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/trieD.data?rev=1845321&view=auto ============================================================================== Binary file - no diff available. Propchange: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/trieD.data ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/trieI.data URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/trieI.data?rev=1845321&view=auto ============================================================================== Binary file - no diff available. Propchange: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/trieI.data ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/verbinfl.rul URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/verbinfl.rul?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/verbinfl.rul (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/lvg/data/rules/verbinfl.rul Wed Oct 31 11:03:43 2018 @@ -0,0 +1,79 @@ +# VERB INFLECTION RULES: +# +# 0 => k /c _{ing,ed} +# +# These rules would likely be considered part +# of regular English inflection were they not +# of such limited scope. Only 13 verbs in English end in "c" +# of those 12 are covered by these rules. +# "floc" as a verb appears in Webster's Medical Dictinary. +# with the paradigm: flocs flocced flocced floccing (regd) +# "floc" is apparently a back formation from "floculate". Which +# may account for it's deviant regularity. (2-Oct-92) +# +RULE: c$|verb|infinitive|cking$|verb|presPart +EXCEPTION: floc|flocking; +# +RULE: c$|verb|infinitive|cked$|verb|past +EXCEPTION: floc|flocked; +# +RULE: c$|verb|infinitive|cked$|verb|pastPart +EXCEPTION: floc|flocked; +# +# The past part ing/ung rule applies to more verbs than +# the past ing/ung rule. "ring" has "rang" in the past, +# and "wring" has "wrang", sing according to Websters has +# either "sing" or "sung" in the past. +# in the realm of the exotic coolung is a crazy crane, and gunnung is +# a type of swamp mahogony. (and a pung is a kind of sleigh) +# +RULE: ing$|verb|infinitive|ung$|verb|pastPart +EXCEPTION: cooling|coolung; +EXCEPTION: ding|dung; +EXCEPTION: gunning|gunnung; +EXCEPTION: ing|ung; +EXCEPTION: ling|lung; +EXCEPTION: ping|pung; +# +RULE: ing$|verb|infinitive|ung$|verb|past +EXCEPTION: cooling|coolung; +EXCEPTION: ding|dung; +EXCEPTION: gunning|gunnung; +EXCEPTION: ing|ung; +EXCEPTION: ling|lung; +EXCEPTION: ping|pung; +EXCEPTION: ring|rung; +EXCEPTION: wring|wrung; +# +# e.g. feed|fed speed|sped bleed|bled ..... +# +RULE: eed$|verb|infinitive|ed$|verb|past +EXCEPTION: eed|ed; +EXCEPTION: freed|fred; +EXCEPTION: reed|red; +EXCEPTION: weed|wed; +# +RULE: eed$|verb|infinitive|ed$|verb|pastPart +EXCEPTION: eed|ed; +EXCEPTION: freed|fred; +EXCEPTION: reed|red; +EXCEPTION: weed|wed; +# +# ay~aid for both past and past part +#e.g pay|paid +# +RULE: ay$|verb|infinitive|aid$|verb|past +EXCEPTION: bray|braid; +EXCEPTION: May|maid; +EXCEPTION: may|maid; +EXCEPTION: play|plaid; +EXCEPTION: ray|raid; +EXCEPTION: stay|staid; +# +RULE: ay$|verb|infinitive|aid$|verb|pastPart +EXCEPTION: bray|braid; +EXCEPTION: May|maid; +EXCEPTION: may|maid; +EXCEPTION: play|plaid; +EXCEPTION: ray|raid; +EXCEPTION: stay|staid; Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/necontexts/desc/NegationAnnotator.xml URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/necontexts/desc/NegationAnnotator.xml?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/necontexts/desc/NegationAnnotator.xml (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/necontexts/desc/NegationAnnotator.xml Wed Oct 31 11:03:43 2018 @@ -0,0 +1,160 @@ + + + +org.apache.uima.java +true +org.apache.ctakes.necontexts.ContextAnnotator + +NegationAnnotator + + +MaxLeftScopeSize + +Integer +false +true + + +MaxRightScopeSize + +Integer +false +true + + +ScopeOrder + +String +true +true + + +ContextAnalyzerClass +String +false +true + + +ContextHitConsumerClass + +String +false +true + + +WindowAnnotationClass +String +false +true + + +FocusAnnotationClass +String +false +true + + +ContextAnnotationClass +String +false +true + + + + +MaxLeftScopeSize + +7 + + + +MaxRightScopeSize + +7 + + + +ScopeOrder + + +LEFT +RIGHT + + + + +ContextHitConsumerClass + +org.apache.ctakes.necontexts.negation.NegationContextHitConsumer + + + +WindowAnnotationClass + +org.apache.ctakes.typesystem.type.textspan.Sentence + + + +FocusAnnotationClass + +org.apache.ctakes.typesystem.type.textsem.IdentifiedAnnotation + + + +ContextAnnotationClass + +org.apache.ctakes.typesystem.type.syntax.BaseToken + + + +ContextAnalyzerClass + +org.apache.ctakes.necontexts.negation.NegationContextAnalyzer + + + + + + + + + + + + +org.apache.ctakes.typesystem.type.textsem.IdentifiedAnnotation +org.apache.ctakes.typesystem.type.syntax.BaseToken + + +org.apache.ctakes.necontexts.type.ContextAnnotation + + + + + +true +true +false + + + + Added: ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/padtermspotter/lookup/radiology/ExamTitleWords.txt URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/padtermspotter/lookup/radiology/ExamTitleWords.txt?rev=1845321&view=auto ============================================================================== --- ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/padtermspotter/lookup/radiology/ExamTitleWords.txt (added) +++ ctakes/trunk/ctakes-web-rest/src/main/resources/org/apache/ctakes/padtermspotter/lookup/radiology/ExamTitleWords.txt Wed Oct 31 11:03:43 2018 @@ -0,0 +1,79 @@ +US,US_EXAM,5,30 +ultrasound,US_EXAM,0,30 +Lower Ext,LOWER_EXT,5,30 +Lower Ext,LOWER_EXT,15, +LtoR,LOWER_EXT,5, +lower extremity,LOWER_EXT,5,60 +lower extremities,LOWER_EXT,5,60 +LOWER EXT,LOWER_EXT,5, +Lower Extrem,LOWER_EXT,5, +LOWER EXTREM,LOWER_EXT,5, +Ext Lwr,LOWER_EXT,5, +Lower Extrem Art-Graft Cmpl,LOWER_EXT,0, +leg pain,LOWER_EXT,5, +knee down,LOWER_EXT,5, +knees,LOWER_EXT,5,30 +L MRA LOWER,US_LOWER_SOLO,5, +LEFT MRA LOWER,US_LOWER_SOLO,5, +R MRA LOWER,US_LOWER_SOLO,5, +RIGHT MRA LOWER,US_LOWER_SOLO,5, +R US,US_LOWER_SOLO,5, +RT LOWER EXTREMITY,US_LOWER_SOLO,0, +RT LOWER EXTREM,US_LOWER_SOLO,0, +RT LW,US_LOWER_SOLO,0, +L US,US_LOWER_SOLO,0, +LT LOWER EXTREMITY,US_LOWER_SOLO,0, +LT LOWER EXTREM,US_LOWER_SOLO,0, +LT LW,US_LOWER_SOLO,0, +RIGHT US,US_LOWER_SOLO,0, +RIGHT LOWER EXTREMITY,US_LOWER_SOLO,0, +RIGHT LOWER EXTREM,US_LOWER_SOLO,0, +RIGHT LW,US_LOWER_SOLO,0, +LEFT US,US_LOWER_SOLO,0, +LEFT LOWER EXTREMITY,US_LOWER_SOLO,0, +LEFT LOWER EXTREM,US_LOWER_SOLO,0, +LEFT LW,US_LOWER_SOLO,0, +left tibia,US_LOWER_SOLO,5,30 +right tibia,US_LOWER_SOLO,5,30 +US Lower Extrem,US_LOWER,0, +RT LOWER EXTREMITY,US_LOWER_SOLO,5, +RT LOWER EXTREM,US_LOWER_SOLO,5, +RT LW,US_LOWER_SOLO,5, +L US,US_LOWER_SOLO,5, +LT LOWER EXTREMITY,US_LOWER_SOLO,5, +LT LOWER EXTREM,US_LOWER_SOLO,5, +LT LW,US_LOWER_SOLO,5 +RIGHT US,US_LOWER_SOLO,5, +RIGHT LOWER EXTREMITY,US_LOWER_SOLO,5, +RIGHT LOWER EXTREM,US_LOWER_SOLO,5, +RIGHT LW,US_LOWER_SOLO,5, +LEFT US,US_LOWER_SOLO,5, +LEFT LOWER EXTREMITY,US_LOWER_SOLO,5, +LEFT LOWER EXTREM,US_LOWER_SOLO,5, +LEFT LW,US_LOWER_SOLO,5, +Pelvis,US_LOWER_SOLO,5,30 +EXAM: L ,US_LOWER_SOLO,0, +EXAM: R ,US_LOWER_SOLO,0, +EXAM: Left,US_LOWER_SOLO,0, +EXAM: Right,US_LOWER_SOLO,0, +EXAM: LEFT ,US_LOWER_SOLO,0, +EXAM: RIGHT ,US_LOWER_SOLO,0, +EXAM: LT,US_LOWER_SOLO,0, +EXAM: RT,US_LOWER_SOLO,0, +EXAM: V&IRAD,V_RAD,0, +EXAM: r tibia,US_LOWER_SOLO,0, +EXAM: l tibia,US_LOWER_SOLO,0, +EXAM: right tibia,US_LOWER_SOLO,0, +EXAM: left tibia,US_LOWER_SOLO,0, +hip,US_LOWER,0, +l hip,US_LOWER_SOLO,0, +r hip,US_LOWER_SOLO,0, +left hip,US_LOWER_SOLO,0, +right hip,US_LOWER_SOLO,0, +ankle: R ,US_LOWER_SOLO,0, +ankle: L ,US_LOWER_SOLO,0, +ankle: Right ,US_LOWER_SOLO,0, +ankle: Left ,US_LOWER_SOLO,0, +CT,CT_EXAM,5, +CTA,CT_EXAM,5, +V&IRAD,V_IRAD,5, \ No newline at end of file