SUN Tan, ZHANG Zhixiong, ZHOU Lihong, WANG Dongbo, ZHANG Hai, LI Baiyang, YONG Suhua, ZUO Wangmeng, YANG Guanglei
Accepted: 2024-01-20
" AI for Science " (AI4S) is a new scientific research paradigm that deeply integrates AI technology with scientific research to promote the discovery of new knowledge and the solution of scientific problems. As the application of AI4S in the natural sciences and humanities and social sciences advances, its development line, opportunities and challenges, needs and tasks, and ways of realization deserve further discussion. In order to advance AI4S research, promote scientific and technological (S&T) innovation and progress, and facilitate the effective strengthening of the discipline of information resources management, our journal has invited seven experts to organize this academic conversation on AI4S. 1) Supporting knowledge services for AI4S: In the current landscape of intelligent knowledge services, the requirements for supporting AI4S have increased, including the need for multi-level knowledge discovery and acquisition, cross-disciplinary research and innovation, and user-friendly participatory services. In addition, knowledge service scenarios are moving towards diversification, complexity, depth, specialization, and personalization in ubiquitous knowledge discovery, generative content services, and multi-round interactive service exploration. In response, professional science and technology information organizations need to reassess the role of knowledge services in the AI4Science environment and their significance in comprehensively supporting the S&T innovation process. This involves establishing a broad literature perspective, deepening full-text knowledge elements, balancing universal and specialized depth, autonomously developing core products, and deeply engaging with professional fields to support interdisciplinary innovation. 2) As a knowledge base for AI4S: In the development of AI4S, S&T literature serves as a high-quality corpus of great importance and utility. The Documentation and Intelligence Center of the Chinese Academy of Sciences has developed the concept and general framework for an AI4S knowledge base utilizing S&T literature. It is dedicated to building four types of knowledge bases to support intelligent services such as evidence-based retrieval, situational awareness, inference prediction, and insight generation required for AI4S applications. In addition, to advance the AI for Science knowledge base, it is essential to actively promote the construction of an intelligent data system, develop an AI engine for technical literature knowledge, conduct key technology research on in-depth mining and intelligent analysis of S&T literature, and promote collaboration with scientific research units across various fields, leading AI companies, and teams of field scientists. This approach aims to fully exploit the innovative and developmental value of the discipline of information resource management. 3) Powering AI4S with scientific data: Effective aggregation of scientific data is the foundation for unleashing the powerful capabilities of AI4S. This is essential for libraries to adapt their roles and functions in the AI era and is a crucial prerequisite for catalyzing the transformation of scientific research services, deepening scientific research support, and accelerating S&T innovation. Currently, libraries face various macro and meso challenges in effectively aggregating valuable scientific data to provide support for AI4S. To address these challenges, the following ways can be pursued: defining the roles and functions of libraries in scientific data management; promoting a conducive environment for scientific data management; establishing a collaborative network for scientific data management; and enhancing the service capacity of scientific data management. 4) AI4S and intelligent language modeling for classical literature: AI4S technology can be used to analyze documents and texts, enabling a faster and more comprehensive understanding of a vast amount of historical documents and cultural materials. The development of intelligent language modeling for classical literature represents a significant breakthrough in the field of ancient literature research, bringing new opportunities and challenges. With the increasing popularity of multimodal and generative GPT models in the context of AI4S, the intelligent language modeling of classical literature will focus on integrating diverse information, enhancing adaptability, improving knowledge representation, and addressing a wider range of application scenarios. 5) Library Digital Scholarly Services for AI4S: The concept of using LLM-based AI4S and AIGC to drive the development of smart libraries is consistent with the vision for digital scholarly services in libraries, and presents both opportunities and challenges. Given the trends towards AI4S platformization and the characteristics of "middle-end" digital scholarly service, as well as the longstanding tradition of libraries in serving scholarly research, the reengineering path for the library's digital scholarly services platform includes three approaches: building an AI4S service platform independently, purchasing and utilizing third-party AI4S platforms, and promoting embedded knowledge services as a component of scientific intelligence. This innovative approach addresses the dilemmas of financial resources, human resources, cognitive and practical gaps, and emphasizes the importance of user needs in the AI4S environment. It also focuses on knowledge organization and service delivery to meet user needs in the AI4S landscape. 6) Historical evolution and logical structure of the scientific intelligence paradigm (AI4S): AI4S is a scientific paradigm change dominated by the full application of AI technology to various disciplines, and its logical structure includes "data+model"-driven, knowledge ecology created by machine conjecture, and application scenarios expanded by algorithmic thinking. In the era of digital civilization, AI4S-driven scientific progress and social development must carry forward the value of science and technology for the good, effectively select the theoretical arguments and proposals for extending AI4S to the field of social sciences and humanities, and improve the series of mechanisms for integrating human decision-making and machine intelligence. 7) Development opportunities and prospects of AI4S in the era of generative AI: With the advances in generative AI, pre-training algorithms and large-scale pre-trained models have provided significant opportunities for AI4S in various disciplinary domains. These technologies have shown immense potential and value for applications in diverse fields such as industrial inspection, robotics, and medicine. Additionally, it is crucial to emphasize the importance of key factors such as the constraints of technical implementation conditions for large pre-trained models, the sustainability of data/computing resources, and the transparency, fairness, and accessibility of the technology.