lucene搜索返回内容的一个有关问题-Java教程-爱易网页

lucene搜索返回内容的一个有关问题

日期：2014-05-18　浏览次数：20782 次

lucene搜索返回内容的一个问题
我用lucene实现搜索，现在处于最基础的阶段，即只是实现对文档内容的搜索，返回结果数。
建立索引程序代码如下：

package com.langhua;



import java.io.File;

import java.io.FileReader;

import java.io.IOException;

import java.util.Date;



import org.apache.lucene.analysis.standard.StandardAnalyzer;

import org.apache.lucene.document.DateTools;

import org.apache.lucene.document.Document;

import org.apache.lucene.document.Field;

import org.apache.lucene.index.IndexWriter;

import org.apache.lucene.store.Directory;

import org.apache.lucene.store.SimpleFSDirectory;

import org.apache.lucene.util.Version;

/**

 * 创建索引 Lucene 3.0+

 * @author Administrator

 *

 */

public class Indexer {



	/**

	 * @param args

	 * @throws IOException 

	 */

	public static void main(String[] args) throws IOException {

		//保存索引文件的地方

		String indexDir = "F:\\testroomtxt\\index";

		//将要搜索TXT文件的地方

		String dateDir = "F:\\testroomtxt\\testroom";

		IndexWriter indexWriter = null;

		//创建Directory对象，目录，磁盘,将索引存放在磁盘上

		Directory dir = new SimpleFSDirectory(new File(indexDir));

		//创建IndexWriter对象,第一个参数是Directory,第二个是分词器,第三个表示是否是创建,如果为false为在此基础上面修改,第四表示表示分词的最大值，比如说new MaxFieldLength(2)，就表示两个字一分，一般用IndexWriter.MaxFieldLength.LIMITED 

		indexWriter = new IndexWriter(dir,new StandardAnalyzer(Version.LUCENE_30),true,IndexWriter.MaxFieldLength.LIMITED);

		File[] files = new File(dateDir).listFiles();   //返回一个抽象路径名数组，这些路径名表示此抽象路径名所表示目录中的文件。

		for (int i = 0; i < files.length; i++) {

			//建立一个document对象，Document，相当于一个要进行索引的单元，任何可以想要被索引的文件都必须转化为Document对象才能进行索引。用户提供的源是一条条记录，它们可以是文本文件、字符串或者数据库表的一条记录等等。

			Document doc = new Document();

			//创建Field对象，并放入doc对象中，一个Document可以包含多个信息域，例如一篇文章可以包含“标题”、“正文”、“最后修改时间”等信息域，这些信息域就是通过Field在Document中存储的。 

			doc.add(new Field("contents", new FileReader(files[i]))); //用于读取字符流

			doc.add(new Field("filename", files[i].getName(), 

								Field.Store.YES, Field.Index.NOT_ANALYZED));

			doc.add(new Field("indexDate",DateTools.dateToString(new Date(), DateTools.Resolution.DAY),Field.Store.YES,Field.Index.NOT_ANALYZED));

			//写入IndexWriter

			indexWriter.addDocument(doc);

		}

		//查看IndexWriter里面有多少个索引

		System.out.println("numDocs"+indexWriter.numDocs());

		indexWriter.close();

		

	}



}

搜索程序代码如下

package com.langhua;



import java.io.BufferedReader;

import java.io.File;

import java.io.FileReader;

import java.io.IOException;

import java.io.InputStreamReader;



import org.apache.lucene.analysis.standard.StandardAnalyzer;

import org.apache.lucene.document.Document;

import org.apache.lucene.queryParser.ParseException;

import org.apache.lucene.queryParser.QueryParser;

import org.apache.lucene.search.IndexSearcher;

import org.apache.lucene.search.Query;

import org.apache.lucene.search.ScoreDoc;

import or

免责声明： 本文仅代表作者个人观点，与爱易网无关。其原创性以及文中陈述文字和内容未经本站证实，对本文以及其中全部或者部分内容、文字的真实性、完整性、及时性本站不作任何保证或承诺，请读者仅作参考，并请自行核实相关内容。

lucene搜索返回内容的一个有关问题

相关资料更多>

推荐阅读更多>