java - Java Lucene 如何找到类 TermFreqVector 和类 TermPositionVector?
问题描述
我的 IDE 看不到这个界面,我不明白如何使用它。
(Windows+NetBeans+Lucene 7.4.0)
我的 pom.xml:
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>com.javacodegeeks</groupId>
<artifactId>jstringsearch</artifactId>
<version>1.0.0</version>
<packaging>jar</packaging>
<name>jstringsearch</name>
<url>http://maven.apache.org</url>
<properties>
<project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
<maven.compiler.source>1.6</maven.compiler.source>
<maven.compiler.target>1.6</maven.compiler.target>
<java.version>1.8</java.version>
<lucene.version>7.4.0</lucene.version>
</properties>
<!-- Build plugins -->
<build>
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-dependency-plugin</artifactId>
<executions>
<execution>
<id>copy-dependencies</id>
<phase>prepare-package</phase>
<goals>
<goal>copy-dependencies</goal>
</goals>
<configuration>
<outputDirectory>${project.build.directory}/lib</outputDirectory>
<overWriteReleases>false</overWriteReleases>
<overWriteSnapshots>false</overWriteSnapshots>
<overWriteIfNewer>true</overWriteIfNewer>
</configuration>
</execution>
</executions>
</plugin>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-jar-plugin</artifactId>
<version>3.2.0</version>
<configuration>
<archive>
<manifest>
<addClasspath>true</addClasspath>
<classpathPrefix>lib/</classpathPrefix>
<mainClass>com.javacodegeeks.jstringsearch.Main2</mainClass>
<addDefaultImplementationEntries>true</addDefaultImplementationEntries>
<addDefaultSpecificationEntries>true</addDefaultSpecificationEntries>
</manifest>
</archive>
<source>1.5</source>
<target>1.5</target>
</configuration>
</plugin>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-source-plugin</artifactId>
<version>3.2.1</version>
<executions>
<execution>
<id>attach-sources</id>
<goals>
<goal>jar</goal>
</goals>
</execution>
</executions>
</plugin>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-assembly-plugin</artifactId>
<version>3.2.0</version>
<configuration>
<archive>
<manifest>
<mainClass>com.javacodegeeks.jstringsearch.Main2</mainClass>
</manifest>
</archive>
<descriptorRefs>
<descriptorRef>jar-with-dependencies</descriptorRef>
</descriptorRefs>
</configuration>
<executions>
<execution>
<id>make-assembly</id> <!-- this is used for inheritance merges -->
<phase>package</phase> <!-- bind to the packaging phase -->
<goals>
<goal>single</goal>
</goals>
</execution>
</executions>
</plugin>
</plugins>
</build>
<dependencies>
<!-- Full text search -->
<dependency>
<groupId>org.apache.lucene</groupId>
<artifactId>lucene-core</artifactId>
<version>${lucene.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.apache.lucene</groupId>
<artifactId>lucene-analyzers-common</artifactId>
<version>${lucene.version}</version>
</dependency>
<dependency>
<groupId>org.apache.lucene</groupId>
<artifactId>lucene-queryparser</artifactId>
<version>${lucene.version}</version>
<type>jar</type>
</dependency>
<!-- To include highlight support -->
<dependency>
<groupId>org.apache.lucene</groupId>
<artifactId>lucene-highlighter</artifactId>
<version>${lucene.version}</version>
</dependency>
<dependency>
<groupId>com.googlecode.json-simple</groupId>
<artifactId>json-simple</artifactId>
<version>1.1.1</version>
</dependency>
<!-- Detect languages of text -->
<dependency>
<groupId>io.github.kju2.languagedetector</groupId>
<artifactId>language-detector</artifactId>
<version>1.0.6-SNAPSHOT</version>
<type>jar</type>
</dependency>
<!-- Helps to create test units -->
<dependency>
<groupId>junit</groupId>
<artifactId>junit</artifactId>
<version>3.8.1</version>
<scope>test</scope>
</dependency>
</dependencies>
</project>
部分代码:
Directory directory = new RAMDirectory();
IndexWriterConfig indexWriterConfig = new IndexWriterConfig(new StandardAnalyzer());
IndexWriter writer = new IndexWriter(directory, indexWriterConfig);
Document doc = new Document();
// Field.Store.NO, Field.Index.ANALYZED, Field.TermVector.YES
FieldType type = new FieldType();
type.setStoreTermVectors(true);
type.setStoreTermVectorPositions(true);
type.setStoreTermVectorOffsets(true);
type.setStored(true);
type.setIndexOptions(IndexOptions.DOCS_AND_FREQS_AND_POSITIONS_AND_OFFSETS);
Field fieldStore = new Field("tags", "Kite good world.", type);
doc.add(fieldStore);
writer.addDocument(doc);
writer.close();
DirectoryReader reader = DirectoryReader.open(directory);
IndexSearcher searcher = new IndexSearcher(reader);
QueryParser queryParser = new QueryParser("tags", new StandardAnalyzer());
Query query = queryParser.parse("\"Kite World\"~1");
TopDocs results = searcher.search(query, 1);
for ( ScoreDoc scoreDoc : results.scoreDocs) {
//Here Red Errors!!!
IndexReader re = DirectoryReader.open(directory);
TermFreqVector tfvector = re.getTermFreqVector(scoreDoc.doc, "tags");
TermPositionVector tpvector = (TermPositionVector) tfvector;
int index = termPositionVector.indexOf("the");
在教程中,我看到有人只是写:
import org.apache.lucene.index.TermFreqVector;
import org.apache.lucene.index.TermPositionVector;
但是当我这样做时,我的 IDE 不会寻找它,它会向我显示错误。
你对此有何看法?如何找到类 TermFreqVector 和类 TermPositionVector?
解决方案
TermFreqVector 和 TermPositionVector 是非常古老的 Lucene 代码,直到 3.xx 版本才存在
最有可能的是,您在 POM 文件中的 lucene 版本要更新得多(8.xx),因此您必须对其进行调整。
我建议您使用 8.xx 并修复您的代码。
推荐阅读
- service-worker - PWA Service Worker (Workbox) 设置的“/”代表什么?
- go - golang 中是否存在可见性问题?
- r - 如何对日期的对数回报进行回归
- c# - 在 ASP.NET Core 中启动项目时检查 jwt 令牌
- python-3.x - 在 Raspberry Pi 4 中打开任何笔记本都会出现“500 内部服务器错误”。如何解决?
- firebase - 如何检查密码重置链接是否与firebase一起发送
- c++ - 结构声明 C++ 中的 If 语句
- c# - 如何在 C# 中从 MongoDB 中以正确的 JSON 格式检索数据
- mysql - MySql - 无法正确启动 (0xc0150002)
- php - Maatwebsite Laravel Excel 导入时出现未定义索引错误