Current location - Training Enrollment Network - Books and materials - What changes have big data brought to digital libraries?
What changes have big data brought to digital libraries?
Digital library faces challenges.

"Various types of data are growing rapidly and are developing towards massive data. The National Digital Library faces many challenges such as long-term preservation of digital resources, resource integration, information security and service innovation. " Wei Dawei said that by the end of 20 13, the total digital resources of the National Digital Library had reached 874.5TB, including 737.9TB of self-built digital resources, 45.7TB of network information collection, 273 outsourced Chinese and foreign databases and 290 million pieces of metadata collected by Jinwen Search. With the expansion of reader service to computer, digital TV, mobile phone, handheld reader, tablet computer, electronic touch screen and other service terminals, the service volume is increasing, and various business systems generate a lot of log data every day, which contains a lot of user behavior information. For example, the average daily log data generated by Aleph system is about 20GB, and the average daily log data generated by Jinwen search system is above 300G g g.

A very large metadata warehouse will be built.

Wei Dawei pointed out that in the face of the new environment and background, the National Library takes resource integration as the starting point to achieve a high degree of integration of traditional business and digital library business and maximize the service efficiency of the National Library.

He further emphasized that the integration of digital resources must be combined with the characteristics of big data and the status quo of resources, be user-oriented, learn from others' strengths, highlight characteristics, and be implemented in stages and in a planned way. Building a super-large metadata warehouse is one of the ideas of integrating resources in digital libraries in the future, so as to realize the unified aggregation and one-stop retrieval of resources, combine cloud services with related data, realize the organization and aggregation of digital collections, and build a "resource-user" relationship model. However, resource integration is also facing challenges in terms of capital, talents and technology.