王晓光,翁梦娟,侯西龙,雷珏莹.古籍注疏的知识表示与语义化建模研究[J].中国图书馆学报,2023,49(3):75~91
The Knowledge Representation and Semantic Modeling of Ancient Books Commentaries
古籍注疏的知识表示与语义化建模研究
Received:June 23, 2022  Revised:August 24, 2022
DOI:
Key words:Ancient books commentaries  Hermeneutics ontology  Knowledge representation  Nanopublication  Collation of ancient books
中文关键词:  古籍注疏  阐释本体  知识表示  纳米出版物  古籍整理
基金项目:
Author NameAffiliation
WANG Xiaoguang 武汉大学信息管理学院 湖北 武汉 430072 
WENG Mengjuan 武汉大学信息管理学院 湖北 武汉 430072 
HOU Xilong 曲阜师范大学传媒学院 山东 日照 276800 
LEI Jueying 武汉大学信息管理学院 湖北 武汉 430072 
Hits: 348
Download times: 434
Abstract:
This paper proposes a semantic representation technical path of commentary knowledge based on ontology and nanopublication,aiming at realizing the minimal publication of commentary knowledge based on revealing the internal semantic relationship of ancient books commentaries and ensuring the traceability of the main responsible entities related to re publishing activities and the credibility of publication contents. Five steps are included in the path:firstly,identify the minimized knowledge unit from the ancient books commentaries;secondly,realize attributes and resource extraction;thirdly,annotate the semantic relationship between resources and attributes;fourthly,fill the nanopublication template;and finally,generate credible URIs for nanopublications.
To illustrate the feasibility and practicability of this path,firstly,we identify four types of knowledge units from the ancient books commentaries:interpretation unit,citation unit,provenance unit,and alignment unit. Then,we construct the hermeneutics ontology to describe the knowledge units. The SWRL rules that infer author reference relations and ancient book reference relations from the sentence reference relations are included in the ontology. And then,we use the excerpts of some ancient books commentaries as a corpus to realize the semantic representation with nanopublication as an independent publishing unit and citation relationship inference. Finally,we use the MD5 algorithm to generate trusty URIs for nanopublication. Experiments have proved that the hermeneutics ontology can be a data model for the structuring of single discourse ancient books commentaries and the association of cross discourse ancient books commentaries,and that nanopublication can be a method for ensuring traceability and credibility.
The semantic representation technical path of commentary knowledge can provide a reference for the construction of smart data,semantic publishing and digital reconstruction of ancient books.
The innovation point of research lies in the fact that,for the first time,the nanopublication and the ontology are combined for the semantic representation of the ancient books commentaries. In addition,we also design the ontology for ancient books commentaries from the perspective of interpretation,which realizes the extensive semantic association between the ancient books commentaries and the external literature. 6 figs. 5 tabs. 41 refs.
中文摘要:
      注疏是对古代典籍注释及再注释而形成的文本,不仅反映注疏者对古籍文本的认知理解,也是后人理解、传承与传播思想与文化的重要基础。利用本体和纳米出版物等语义技术对注疏文本进行知识表示和语义化建模,可以揭示注疏文献中蕴含的知识间的语义关系,并实现注疏文献的语义化出版与再造。为验证注疏知识表示和语义化建模方法的可行性及实用性,本文设计了包含引用关系的阐释本体,并以部分注疏文本为语料,实现了以纳米出版物为独立出版单位的注疏语义化表示与引用关系推断。实验证明,阐释本体可作为单语篇注疏知识单元结构化和跨语篇注疏知识单元关联化的数据模型,助力注疏文献的数据化处理与价值增值。注疏知识的语义化表示路径可以为古籍知识库建设、语义出版和数字化再造提供参考。图6。表5。参考文献41。
View Full Text   View/Add Comment  Download reader