<legend id='eboaQ'><style id='eboaQ'><dir id='eboaQ'><q id='eboaQ'></q></dir></style></legend>
  • <i id='eboaQ'><tr id='eboaQ'><dt id='eboaQ'><q id='eboaQ'><span id='eboaQ'><b id='eboaQ'><form id='eboaQ'><ins id='eboaQ'></ins><ul id='eboaQ'></ul><sub id='eboaQ'></sub></form><legend id='eboaQ'></legend><bdo id='eboaQ'><pre id='eboaQ'><center id='eboaQ'></center></pre></bdo></b><th id='eboaQ'></th></span></q></dt></tr></i><div id='eboaQ'><tfoot id='eboaQ'></tfoot><dl id='eboaQ'><fieldset id='eboaQ'></fieldset></dl></div>

      <bdo id='eboaQ'></bdo><ul id='eboaQ'></ul>

      <small id='eboaQ'></small><noframes id='eboaQ'>

        <tfoot id='eboaQ'></tfoot>

        如何对用 lucene 索引的文档进行分类

        时间:2023-09-29
          <tbody id='8b4OW'></tbody>

            <bdo id='8b4OW'></bdo><ul id='8b4OW'></ul>
            • <legend id='8b4OW'><style id='8b4OW'><dir id='8b4OW'><q id='8b4OW'></q></dir></style></legend>

              <i id='8b4OW'><tr id='8b4OW'><dt id='8b4OW'><q id='8b4OW'><span id='8b4OW'><b id='8b4OW'><form id='8b4OW'><ins id='8b4OW'></ins><ul id='8b4OW'></ul><sub id='8b4OW'></sub></form><legend id='8b4OW'></legend><bdo id='8b4OW'><pre id='8b4OW'><center id='8b4OW'></center></pre></bdo></b><th id='8b4OW'></th></span></q></dt></tr></i><div id='8b4OW'><tfoot id='8b4OW'></tfoot><dl id='8b4OW'><fieldset id='8b4OW'></fieldset></dl></div>

              <small id='8b4OW'></small><noframes id='8b4OW'>

                1. <tfoot id='8b4OW'></tfoot>
                2. 本文介绍了如何对用 lucene 索引的文档进行分类的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

                  问题描述

                  我用 Lucene 对一组文档进行了分类(字段:内容、类别).每个文档都有自己的类别,但其中一些被标记为未分类.有没有什么方法可以在java中轻松分类这些文档?

                  I have classified a set of documents with Lucene (fields: content, category). Each document has it's own category, but some of them are labeled as uncategorized. Is there any way to classify these documents easily in java?

                  推荐答案

                  从 Lucene 5.2.1 开始,您可以使用 索引文档以对新文档进行分类.开箱即用,Lucene 提供了一个朴素贝叶斯分类器,一个 k-最近邻分类器(基于 MoreLikeThis 类)和基于感知器的分类器.

                  As of Lucene 5.2.1, you can use indexed documents to classify new documents. Out of the box, Lucene offers a naive Bayes classifier, a k-Nearest Neighbor classifier (based on the MoreLikeThis class) and a Perceptron based classifier.

                  缺点是所有这些类都标有实验性警告,并附有维基百科的链接.

                  The drawback is that all of these classes are marked with experimental warnings and documented with links to Wikipedia.

                  这篇关于如何对用 lucene 索引的文档进行分类的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持html5模板网!

                  上一篇:Solr:结合 EdgeNGramFilterFactory 和 NGramFilterFactory 下一篇:在包含 1 亿个字符串的大型文本文件中进行高效

                  相关文章

                  最新文章

                  <tfoot id='1rnur'></tfoot>
                3. <i id='1rnur'><tr id='1rnur'><dt id='1rnur'><q id='1rnur'><span id='1rnur'><b id='1rnur'><form id='1rnur'><ins id='1rnur'></ins><ul id='1rnur'></ul><sub id='1rnur'></sub></form><legend id='1rnur'></legend><bdo id='1rnur'><pre id='1rnur'><center id='1rnur'></center></pre></bdo></b><th id='1rnur'></th></span></q></dt></tr></i><div id='1rnur'><tfoot id='1rnur'></tfoot><dl id='1rnur'><fieldset id='1rnur'></fieldset></dl></div>
                    <bdo id='1rnur'></bdo><ul id='1rnur'></ul>
                  <legend id='1rnur'><style id='1rnur'><dir id='1rnur'><q id='1rnur'></q></dir></style></legend>

                  <small id='1rnur'></small><noframes id='1rnur'>