文章摘要

吴超,郑彦宁,化柏林.数值信息抽取研究进展综述[J].中国图书馆学报,2014,40(2):107~119
数值信息抽取研究进展综述
Numerical Information Extraction:A Review of Research
投稿时间:2013-05-28  修订日期:2013-09-21
DOI:
中文关键词: 数值信息  数值知识元  数值信息抽取  命名实体识别
英文关键词: Numerical information  Numeric knowledge element  Numerical information extraction  Named entity recognition
基金项目:
作者单位E-mail
吴超 中国科学技术信息研究所,北京 北京 100038  
郑彦宁 中国科学技术信息研究所,北京 北京 100038  
化柏林 北京大学信息管理系,北京 北京 100871 huabolin@istic.ac.cn 
摘要点击次数: 4353
全文下载次数: 2789
中文摘要:
      通过对数值信息抽取文献的调研,先从文献类型、学科领域、高频
英文摘要:
This paper first makes a quantitative analysis on the documents of numerical information extraction from three aspects:document type, subject area and high frequency keywords. Then the research context is summarized from four aspects:data source type, object for extraction, extraction method and technique, result evaluation and corresponding application. Our findings are as follows:news corpus and web pages are the main data sources; cardinal numbers and quantitative phrases are the main objects for extraction; extraction method and technique are mainly rule-based and the result evaluation indicators are relatively simple but have a wide scope for application. 4figs. 3tabs. 56refs.
查看全文   查看/发表评论  下载PDF阅读器