吴超,郑彦宁,化柏林.数值信息抽取研究进展综述[J].中国图书馆学报,2014,40(2):107~119 |
数值信息抽取研究进展综述 |
Numerical Information Extraction:A Review of Research |
投稿时间:2013-05-28 修订日期:2013-09-21 |
DOI: |
中文关键词: 数值信息 数值知识元 数值信息抽取 命名实体识别 |
英文关键词: Numerical information Numeric knowledge element Numerical information extraction Named entity recognition |
基金项目: |
|
摘要点击次数: 4353 |
全文下载次数: 2789 |
中文摘要: |
通过对数值信息抽取文献的调研,先从文献类型、学科领域、高频 |
英文摘要: |
This paper first makes a quantitative analysis on the documents of numerical information extraction from three aspects:document type, subject area and high frequency keywords. Then the research context is summarized from four aspects:data source type, object for extraction, extraction method and technique, result evaluation and corresponding application. Our findings are as follows:news corpus and web pages are the main data sources; cardinal numbers and quantitative phrases are the main objects for extraction; extraction method and technique are mainly rule-based and the result evaluation indicators are relatively simple but have a wide scope for application. 4figs. 3tabs. 56refs. |
查看全文
查看/发表评论 下载PDF阅读器 |