• <tfoot id='ggZud'></tfoot>
      <bdo id='ggZud'></bdo><ul id='ggZud'></ul>
  • <i id='ggZud'><tr id='ggZud'><dt id='ggZud'><q id='ggZud'><span id='ggZud'><b id='ggZud'><form id='ggZud'><ins id='ggZud'></ins><ul id='ggZud'></ul><sub id='ggZud'></sub></form><legend id='ggZud'></legend><bdo id='ggZud'><pre id='ggZud'><center id='ggZud'></center></pre></bdo></b><th id='ggZud'></th></span></q></dt></tr></i><div id='ggZud'><tfoot id='ggZud'></tfoot><dl id='ggZud'><fieldset id='ggZud'></fieldset></dl></div>

    <small id='ggZud'></small><noframes id='ggZud'>

        <legend id='ggZud'><style id='ggZud'><dir id='ggZud'><q id='ggZud'></q></dir></style></legend>

        Java:如何检查字符是否属于特定的 unicode 块?

        时间:2023-07-27

        <legend id='xFygR'><style id='xFygR'><dir id='xFygR'><q id='xFygR'></q></dir></style></legend>

            <tfoot id='xFygR'></tfoot>

            <small id='xFygR'></small><noframes id='xFygR'>

                <bdo id='xFygR'></bdo><ul id='xFygR'></ul>

                <i id='xFygR'><tr id='xFygR'><dt id='xFygR'><q id='xFygR'><span id='xFygR'><b id='xFygR'><form id='xFygR'><ins id='xFygR'></ins><ul id='xFygR'></ul><sub id='xFygR'></sub></form><legend id='xFygR'></legend><bdo id='xFygR'><pre id='xFygR'><center id='xFygR'></center></pre></bdo></b><th id='xFygR'></th></span></q></dt></tr></i><div id='xFygR'><tfoot id='xFygR'></tfoot><dl id='xFygR'><fieldset id='xFygR'></fieldset></dl></div>
                  <tbody id='xFygR'></tbody>
                  本文介绍了Java:如何检查字符是否属于特定的 unicode 块?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

                  问题描述

                  我需要确定我的输入属于哪种自然语言.目标是区分混合输入中的 阿拉伯语英语 单词,其中输入是 Unicode 并从 XML 文本节点中提取.我注意到类 Character.UnicodeBlock.和我的问题有关吗?我怎样才能让它工作?

                  I need to identify what natural language my input belongs to. The goal is to distinguish between Arabic and English words in a mixed input, where the input is Unicode and is extracted from XML text nodes. I have noticed the class Character.UnicodeBlock. Is it related to my problem? How can I get it to work?

                  Character.UnicodeBlock 方法对阿拉伯语很有用,但显然不适用于英语(或其他欧洲语言),因为 BASIC_LATIN Unicode 块涵盖符号和不可打印字符和字母.所以现在我使用 String 对象的 matches() 方法和正则表达式 "[A-Za-z]+" 代替.我可以忍受它,但也许有人可以提出更好/更快的方法.

                  The Character.UnicodeBlock approach was useful for Arabic, but apparently doesn't do it for English (or other European languages) because the BASIC_LATIN Unicode block covers symbols and non-printable characters as well as letters. So now I am using the matches() method of the String object with the regex expression "[A-Za-z]+" instead. I can live with it, but perhaps someone can suggest a nicer/faster way.

                  推荐答案

                  是的,你可以简单地使用 Character.UnicodeBlock.of(char)

                  Yes, you can simply use Character.UnicodeBlock.of(char)

                  这篇关于Java:如何检查字符是否属于特定的 unicode 块?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持html5模板网!

                  上一篇:如何在 Java 中将整数添加到 char 中? 下一篇:转换字符(铸造与 .getNumericValue)

                  相关文章

                  最新文章

                  <tfoot id='iom5f'></tfoot>

                • <legend id='iom5f'><style id='iom5f'><dir id='iom5f'><q id='iom5f'></q></dir></style></legend>

                    1. <small id='iom5f'></small><noframes id='iom5f'>

                        <bdo id='iom5f'></bdo><ul id='iom5f'></ul>

                      <i id='iom5f'><tr id='iom5f'><dt id='iom5f'><q id='iom5f'><span id='iom5f'><b id='iom5f'><form id='iom5f'><ins id='iom5f'></ins><ul id='iom5f'></ul><sub id='iom5f'></sub></form><legend id='iom5f'></legend><bdo id='iom5f'><pre id='iom5f'><center id='iom5f'></center></pre></bdo></b><th id='iom5f'></th></span></q></dt></tr></i><div id='iom5f'><tfoot id='iom5f'></tfoot><dl id='iom5f'><fieldset id='iom5f'></fieldset></dl></div>