Page 12 - JOURNAL OF LIBRARY SCIENCE IN CHINA 2018 Vol. 42
P. 12

SU Xinning / Opportunities and challenges faced by digital libraries in the big data era  011


               with other resources like literature resources and target resources, it will become a highly efficient
               resource of knowledge service.


               2.2  Application of digital library technology


               In the big data era, digital library technology has migrated from local data processing to data
               processing of a wider field. From the perspective of the technology system of digital libraries, it
               includes data acquisition, data processing, architecture construction, knowledge mining, analysis,
               prediction, result presentation, service technology, etc. How to apply and incorporate big data
               related technology into digital library field, is now a problem faced by the digital library field.
                 (1) Semantic technology
                 The big data environment needs semantic technology to help connect the large quantity of
               complicated data. Digital libraries should think about how to automatically fuse the semantics
               defined in dictionaries (subject headings, classification charts, etc.) into the related information of
               digital literatures. Of course it is not practical to label manually the semantic relations of such a
               large amount of data. With the aid of dictionaries, we must adopt artificial intelligence technology,
               ontology, semantic analysis technology, automatically label the semantic relations of data and
               make the data meaningful so as to promote the knowledge expansion and knowledge exploration
               of the collected resources.
                 The application of semantic technology in the field of book intelligence is not uncommon.
               Tools like book classification and Chinese Thesauri were created based on semantic relations. But
               these tools only help us define the semantic relations between literatures rather than the semantic
               relations among the knowledge points inside the literature or information which is required in a
               big data environment. The establishment of the semantic relations among knowledge points can
               facilitate users’ access to knowledge, and open a channel for them to obtain the useful knowledge
               through keywords and semantic relations.
                 (2) Data clustering technology
                 Clustering is a process which puts similar and related information or data together. It is one of
               the effective means of efficient utilization of multifarious and a huge amount of data. In digital
               libraries, clustered information resources can play a more significant role in information service,
               information analysis and information utilization. Clustering technology can not only gather
               literature information resources according to certain attributes or characteristics, but also be
               applied to user requirements, user search behavior processing and analysis.
                 In a big data environment, only a small portion of data is utilized. Clustering technology
               provides a good way of full usage of this small portion of data. For instance, the clustering of data
               resources can separate them into small data sets of close relations and similar themes, and these
               small data sets are perfect for problem oriented demand, as well as users’ retrieval, selection and
   7   8   9   10   11   12   13   14   15   16   17