1673-159X

CN 51-1686/N

基于改进模糊集合方法的用户查询词扩展的信息检索

Information Retrieval by Using Query Term Extension Approach Based on an Improved Fuzzy Set Method

  • 摘要: 基于模糊集合方法和Wordnet的查询扩展技术, 提出了一种用户查询词扩展的信息检索方法。先用Wordnet找出查询词的同义词, 再利用广义Jaccard系数来计算2个同义词之间的相似性, 选取相似性较大的同义词进行查询词扩展后实现信息检索。此方法不仅保留了模糊集合方法对查询词处理简单且容易理解的特性, 还很好地解决了模糊集合方法不能对文档进行精确排序的问题。最后, 提出了基于矩阵的布尔式向析取范式转化的算法, 该算法转换简单快速, 解决了模糊集合方法中随着查询词数量的增加使得布尔表达式转化成析取范式变得很复杂的问题。

     

    Abstract: A query term extension approach was used based on fuzzy set method and Wordnet for information retrieval in this paper.Wordnet was employed to find the synonyms for a query word, and the generalized Jaccard coefficient was used to calculate the similarities between the two synonyms.Then, the synonym with more similarities was selected to expand the query words for information retrieval.This approach not only kept the query word processing features simple and easy to understand for fuzzy set method, but also gave a good solution to the problem of fuzzy set method that can not accurately sort the documents.At last, the algorithm was proposed to quickly transform a Boolean expression into disjunctive normal form.The algorithm was simple, fast and solved the complicated problem for the transformation of Boolean expression into disjunctive normal form when inquires increased in the number of words.

     

/

返回文章
返回