吴超,郑彦宁,化柏林.数值信息抽取研究进展综述[J].中国图书馆学报,2014,40(2):107~119
Numerical Information Extraction:A Review of Research
数值信息抽取研究进展综述
Received:May 28, 2013  Revised:September 21, 2013
DOI:
Key words:Numerical information  Numeric knowledge element  Numerical information extraction  Named entity recognition
中文关键词:  数值信息  数值知识元  数值信息抽取  命名实体识别
基金项目:
Author NameAffiliationE-mail
Wu Chao 中国科学技术信息研究所,北京 北京 100038  
Zheng Yanning 中国科学技术信息研究所,北京 北京 100038  
Hua Bolin 北京大学信息管理系,北京 北京 100871 huabolin@istic.ac.cn 
Hits: 4359
Download times: 2796
Abstract:
This paper first makes a quantitative analysis on the documents of numerical information extraction from three aspects:document type, subject area and high frequency keywords. Then the research context is summarized from four aspects:data source type, object for extraction, extraction method and technique, result evaluation and corresponding application. Our findings are as follows:news corpus and web pages are the main data sources; cardinal numbers and quantitative phrases are the main objects for extraction; extraction method and technique are mainly rule-based and the result evaluation indicators are relatively simple but have a wide scope for application. 4figs. 3tabs. 56refs.
中文摘要:
      通过对数值信息抽取文献的调研,先从文献类型、学科领域、高频
View Full Text   View/Add Comment  Download reader