Page 108 - 《中国图书馆学报》2013年第1期

P. 108

研究的重要基础。目前，在查询意图分类领域还没
有权威的数据集.各学者在进行研究时往往根据自 5 结语
身需要选取不同的数据集。根据数据集来源情况，
笔者将其分为三类，并对各类数据集的优缺点做了本文详细介绍了 Web查询的类目体系、分类
简要对比分析（见表 2）。特征、分类方法、数据集及评价方法。从上文可以
目前，查询意图研究领域没有特定的评价方看出，经过多年的努力，查询意图研究取得了很大
法，常用的评价方法通常来自文本分类.包括正确进展。但是，查询意图领域依然存在如下问题和挑
率、召回率、精确度、错误率和 F值等指标。需要注战：缺乏权威的评测标准；各种分类方法在大规模
意的是，正确率、召回率和 F值都只针对单个类别查询集合上的性能还不确定；有效提取或者获得查
进行评价，代表局部性能。因此，需要引人能够在询特征的方法仍然值得深入；查询意图分类体系的
全局（所有类别）上对分类器效果进行评价的方完备性和类目间的独立不相关性尚不确定等。此
法。目前主要有宏平均（macro averaging）和微平均外，当前查询意图分类研究大多只考虑把一个查询
（micro averaging）两种评价方法来度量在所有类别分到一个类目的情况，如何将分面分类应用到查询
上的分类性能.微平均评价指标容易受到大类分类意图研究中也是一个值得探索的问题。而如何将
性能的影响，而宏平均评价指标容易受到小类分类这些识别出的意图应用在搜索引擎优化中，将是今
性能的影响。后查询意图研究的一大趋势。

参考文献：
[ 1 ] Kang I,Kim C. Query type classification for Web document retrieval[ C]// Proceedings of the 26th Annual International
ACM SICIR Conference on Research and Development in Information Retrieval, 2003: 64 -71.
[ 2 ] Broder A. Ataxonomy of Web search[J]. SICIR Forum, 2002, 36(2) : 3 -10.
[ 3 ] Rose D E,Levinson D. Understanding user goals in Web search[ Cl// WWW 2004: Proceedings of the 13th Internation-
al Conference on World Wide Web, 2004: 13 -19.
[ 4 ] Marchionini C. Exploratorysearch: From finding to understanding[J]. Communications of the ACM, 2006, 49(4) : 41
- 46.
[ 5 1 Lee U,Liu Z,Cho .I. Automatic identification of user goals in Web search [C]// WWW 2005: Proceedings of the 14th
International Conference on World Wide Web, 2005: 391 -401.
[ 6 ] Mendoza M,Ricardo Baeza-Yates. A Web search analysis considering the intention behind queries[ C ]//LA-WEB 2008:
Proceedings of the Latin American Web Conference, 2008: 66 -74.
[ 7 ] Waller V. Not just information: Who searches for what on the search engine Coogle?[J]. Journal of the American Socie-
ty for Information Science and Technology, 2011, 62(4) : 761 -775.
[ 8 ] 吴晓晖，宋萍萍，张荣欣.有无查询意图的分类与实现架构模型研究[J].情报科学，2009，27（12）：1830一l833.
（Wu Xiaohui，Song Pingping，Zhang Rongxin.Research on implementation framework model and cl8ssific丑【ion baBed on
query intention and non—queIy intention[ J ] .Information Science，2009，Z7（12）：1830 —1833.）
[ 9] 杨艺，周元.基于用户查询意图识别的 Web搜索优化模型[J].计算机科学，2012，39 （1 ）：264 —267.（YangYi ，
Zhou Yuan.Web retrieval optimizHtion model based on user'8 queIy intention identification[ J ].Computer Science，
2012，39 （1 ）：264 —267.）
[10] Lux M,Kofler C,Marques 0. A classification scheme for user intentions in image search[ C ]//Proceedings of the 28th

103 104 105 106 107 108 109 110 111 112 113