马亚雪,毛进,李纲.面向科学社会计算的数据组织与建模方法[J].中国图书馆学报,2021,47(1):76~87
Data Organization and Modeling for the Social Computing of Science
面向科学社会计算的数据组织与建模方法
Received:July 28, 2020  
DOI:
Key words:Scientific society  Social computing of science  Data organization  Data modeling  Data fusion
中文关键词:  科学社会  科学社会计算  数据组织  数据建模  数据融合
基金项目:本文系国家自然科学基金创新研究群体项目“信息资源管理” (编号:71921002)和国家自然科学基金青年项目“基于学术异质网络表示学习的知识群落发现”(编号:71804135)的研究成果之一
Author NameAffiliation
MA Yaxue 武汉大学信息管理学院 湖北 武汉430072 
MAO Jin 武汉大学信息管理学院 湖北 武汉430072 
LI Gang 武汉大学信息管理学院 湖北 武汉430072 
Hits: 1100
Download times: 564
Abstract:
Science is a complex,dynamic and open system,which enhances the technical barriers to organize the massive and multi source data for revealing the operation mechanism of the science system. Extending the definition of the science system in the sociology of science,this study emphasizes the role of scientific knowledge. The science system is regarded as a scientific society where the scientific community works as the main participants,and the diffusion of existing knowledge is the driving force to facilitate scientific creation and production. Based on the idea,the concept of “social computing of science” is proposed to generalize the research on the theories and methods that focus on the complex,changeable phenomena and issues in the scientific society.
To assist the social computing of science,the general framework of data organization and modeling is generated. First,we propose a ternary link model to profile the operating process of the scientific society. The model consists of four types of entities,ie,the participants,the locations,the activities,and the knowledge that connected the other three components. Through the connection among the entities,the operation mode and potential laws of the scientific society can be more comprehensively reflected and can assist in modeling the scientific society from multiple perspectives. Then,based on the ternary link model,we analyze the path of data organization by summarizing the related data and attributes of the four entities. The extraction of the entities attributes is regarded as a classification task,based on which the data fusion method of the entities is developed. Finally,from the perspective of the multi layer network,we explore the relationship among different types of entities,and propose the relationship integration model for the scientific society. Taking the cases on the interpersonal networks analysis and the cross community knowledge diffusion as examples,the paper illustrates the process and advantages of social computing of science that is based on the fusion of multi source data.
The originality of this study mainly consists of:1) A ternary link model is proposed to conceptualize the components and their relationships in the scientific society,which emphasizes the influence of knowledge on other elements. 2) The two research scenarios visualize the concept of social computing of science and provide ideas for the specific application of the proposed data organization and modeling method. 3) This study can build a bridge between the underlying data and computing technology,and provide directional guidance for using multi source data to comprehensively analyze the composition of scientific society. 4 figs. 1 tab. 40 refs.
中文摘要:
      科学系统的动态性与开放性使得针对系统内部海量多源数据的组织与建模难度不断增强,进而提升了科学系统研究的技术壁垒。为延伸科学社会学对科学系统的认知,本文强调科学知识对科学系统运行的影响,将科学视作一个以科学共同体为主体、知识流动为动力的社会系统,探索性地提出科学社会计算的概念,并探究面向科学社会计算的数据组织与建模方法。首先,通过构建三元链路模型,以厘清科学社会内部实体与数据构成,实现对科学社会的抽象化表示;然后,采用多源数据对科学社会中的实体进行数据化表征,并根据三元链路模型进行“实体—属性—关系—数据”的关联,实现科学社会数据建模;最后,以科学社会人际网络和知识扩散研究为例,阐释基于多源数据开展科学社会计算的过程与研究优势。本研究旨在在底层数据与计算技术之间搭建桥梁,为利用多源数据全景化剖析科学系统构成以及开展科学社会计算研究提供方向性的指导。图4。表1。参考文献40。
View Full Text   View/Add Comment  Download reader