transliteration
Solr, Special Chars, and Latin to Cyrillic char conversion
I am trying to setup a search engine using Solr (or Lucene) which could have text in both Latin with special chars, (special chars would include Ö or Ç as an example) or Cyrilic chars (examples incl[详细]
2023-04-10 16:10 分类:问答Emacs codepage problems: Terminus font, utf-8 and cyrillic-translit input
I love the cyrillic-translit input method for Emacs. However, after I set the wonderful Terminus as my default font, the Russian characters appear in Arial or something (in any case it\'s not Terminus[详细]
2023-04-10 04:07 分类:问答Efficient data structure/algorithm for transliteration based word lookup
I\'m looking for a efficient data structure/algorithm for storing and searching transliteration based word lookup (like google do: http://www.google.com/transliterate/ but I\'m not trying to use googl[详细]
2023-04-06 22:41 分类:问答Cyrillic transliteration in PHP
How to transliterate cyrillic characters into latin letters? E.g. Главная страница -> 开发者_JAVA技巧Glavnaja stranica[详细]
2023-04-05 21:20 分类:问答Transliteration on Unicode LATIN LETTERS "WITH STROKE"
Feeding the rule \"NFD; [:Nonspacing Mark:] Remove; NFC\" into the ICU Transliterator demo, the character Ø (\\u00d8 == LATIN CAPITAL LETTER O WITH STROKE) remains as-is (i.e. the STROKE is not strip[详细]
2023-03-23 19:49 分类:问答replace special characters by its phoenetic similar character (in php - utf8)
you know that there are many characters like è or é. There are many more, like ö,ä,ì,á,ù,... i want to replace those characters with its \"phoenetic partner\"-chara开发者_如何转开发cter, but i[详细]
2023-03-02 21:01 分类:问答icu4j cyrillic to latin
I\'m trying to get Cyrillic words to be in latin so I can have them in urls. I use icu4j transliterator, but it still gives weird characters like this: Vilʹândimaa. It should be more like viljandima[详细]
2023-03-01 20:50 分类:问答What's the limit of google transliteration?
I\'ve use开发者_运维技巧d google transliteration API experimentally. It\'s working fine and I\'ve noticed that it allows only five words at a time. Is there any method to send more words? and is there[详细]
2023-02-12 18:14 分类:问答How to use Google transliteration API in my java web application?
How to use Google Transliteration API in my Java application. If i give a String (either in English or Arabic) as input, the Google Transliterator API then it should translate it into the correspond[详细]
2023-01-26 22:30 分类:问答Python and character normalization
Hello I retrieve text based utf8 data from a foreign source which contains special chars such as u\"ıöüç\" while I want to normalize the开发者_Python百科m to English such as \"ıöüç\" -> \"iouc[详细]
2023-01-24 12:36 分类:问答