Page 168 - Journal of Library Science in China, Vol.47, 2021
P. 168

NIU Li, GAO Chenxiang, ZHANG Yufeng, YAN Shi, XU Yongjun & LI Anrunze / Discovering, reorganizing   167
                                        and storytelling: paths and methods of archives research on the perspective of digital humanities

























                              Figure 5. Technical system of archival data research from DH perspective


               3.1 Archival data processing technology from the perspective of value preservation

               Archival data processing technology from the perspective of value preservation is used to support
               the “discovering” session in archival data research methodology. In the extraction of archival data
               elements, this session is divided into three main parts: “metadata annotation”, “object detection
               and extraction” and “context recognition”. At the level of “metadata annotation”, consideration
               should be given to embedding new electronic record metadata standards such as “signature”,
               “confirmation” and “format permanence” in the traditional archival metadata classification
                     [33]
               system , so that when digital archives are transformed into archival data from DH perspective,
               their source relationships and evidentiary characteristics are retained in the process of granular
               segmentation and entity reorganization, and they are treated as attributes of relevant archival
               resources.
                 The “Object Detection and Extraction” session focuses on the application of natural language
               processing and image recognition framework based on deep learning, extracting key entities such
               as characters, buildings, and time from standardized texts, images, and videos, and using objective
               algorithms combined with the form of manual recognition to extract entities and their relationships
               in archival resources, so as to supplement the semantic connections between entities on the basis of
               avoiding subjective bias and objective technical defects.
                 “Context Recognition” emphasizes the identification and association of elements such as
               archival creators, users, element structures, archival functions and activities, business scenarios
               and institutional functions. One of the essential differences between archival resources and other
               information resources is retaining context [34] . At present, the context identification process of
               archival resources still relies on the basic methods and skills of humanities research, which requires
   163   164   165   166   167   168   169   170   171   172   173