Lucene-获取停用词位置

我正在使用Lucene StopAnalyzer分析带有停用词The Brown Fox的字符串the,因此将其分析为两个术语[brown,fox]。我能够知道brown(4,9)和fox(10,13)的偏移量,但是如何获取排除项the的偏移量?

final Offsetattribute attribute = tokenStream.addAttribute(Offsetattribute.class);
            tokenStream.reset();
            final List<String> analyzedTerms = Lists.newArrayList();
            final StringBuilder stringBuilder = new StringBuilder();
            while (tokenStream.incrementToken()) {
                final int startOffset = attribute.startOffset();
                final int endOffset = attribute.endOffset();
                final String original = text.substring(startOffset,endOffset);
                System.out.println(original);
            }
wang2005727 回答:Lucene-获取停用词位置

暂时没有好的解决方案,如果你有好的解决方案,请发邮件至:iooj@foxmail.com
本文链接:https://www.f2er.com/3136090.html

大家都在问