贾君枝.资源描述中的词表重用类型与实现方式[J].中国图书馆学报,2021,47(4):76~85
Type and Implementation of Vocabulary Reuse in Resource Description
资源描述中的词表重用类型与实现方式
Received:December 07, 2020  Revised:January 07, 2021
DOI:
Key words:Resource description  Vocabulary resue  Vocabulary  Datesets
中文关键词:  资源描述  词表重用  词表  数据集
基金项目:本文系国家社会科学基金一般项目“数据开放环境中的词表重用问题研究”(编号:19BTQ023)的研究成果之一
Author NameAffiliation
JIA Junzhi 中国人民大学信息资源管理学院 北京 100872 
Hits: 531
Download times: 584
Abstract:
The purpose of the resource description is to add meaningful data to the described objects,so that machine and human users can effectively identify,find,and discover various types of resources In order to improve the normalization and standardization of resource description,and enhance the interoperability between resources,various types of vocabulary are constantly created and used to describe resources and can express semantic content contained in resources as much as possible Vocabulary reuse needs to select proper classes and attributes from the existing vocabularies to define the internal and external characteristics of the described objects,and use shared terms to define the common mode of information exchange This will realize the accurate description and formal representation of datasets to improve the interoperability between datasets, and avoid ambiguity and conflict of expressions In the process of data modeling,whether the dataset can be described accurately,whether the connection between data sets can be established clearly,and whether the human computer readable and comprehensible environment can be provided largely depend on the reuse of vocabularies The paper analyzes the basic structure of resource description and divides its processing into two phases:data model and resource annotation According to the function,it is deeply classified into vocabulary layer,schema layer,and the data layer Then it discusses the types and implementation methods of vocabulary reuse by taking the bibliographic data of the British Library as an example and shows the differences and applicable environments of various ways,guiding the users of vocabulary reuse to accurately describe classes,attributes,and value ranges of datasets by using existing vocabulary,so as to improve the quality of datasets and offer the basis for their further application There are two types of vocabulary reuse:vocabulary layer reuse and conceptual layer reuse occurred at the modeling stage according to different users focus The vocabulary layer reuse focuses on the selection of vocabularies,and the concept layer reuse focuses on the matching degree of class and attribute It is evident that current users prefer the vocabulary layer to the conceptual layer With the increasing openness of the vocabularies and sustainable improvement of its ecological environment,the reuse of the concept layer will be further widely used Taking classes and attributes in the data model as the dividing objects,the realization of vocabulary reuse can be mainly divided into six types:classes or properties reuse,as a superclass/subclass or subperproperty/subproperty,domain or range,classes operations,inverse properties,and value restrictions The choice of vocabulary reuse ways depends on the matching degree of the target vocabulary and the class and attribute in the data model Different ways of reuse of the same vocabulary influence each other and there are issues of class and attribute coordination and conflict resolution in multiple vocabularies reuse 2 figs 1 tab 16 refs.
中文摘要:
      为提高资源描述的规范化及标准化,提升资源之间的互操作能力,各类型词表不断被创建及使用,词表重用已成为资源描述中的关键问题。本文从资源描述的基本结构出发,基于数据模型和资源标注两个阶段对词表层、模式层及数据层进行详细分析;探讨词表层、概念层两种重用类型,认为当前用户更关注于词表层重用,概念层重用将随着词表生态环境的完善得到进一步发展;以数据模型中的类与属性为划分对象,以RDF三元组形式入手对词表重用实现方式进行深入研究,有助于明确各种方式之间的差异性及适用环境,能够有效地指导用户运用已有词表对数据集的类、属性及其取值范围进行准确描述,从而提升数据集的质量。图2。表1。参考文献16。
View Full Text   View/Add Comment  Download reader