Current location - Training Enrollment Network - Mathematics courses - What is the core technology of artificial intelligence?
What is the core technology of artificial intelligence?
Artificial intelligence technology is related to whether artificial intelligence products can be successfully applied to our life scenes. In the field of artificial intelligence, it generally includes seven key technologies: machine learning, knowledge mapping, natural language processing, human-computer interaction, computer vision, biometrics and AR/VR.

First, machine learning.

Machine learning is an interdisciplinary subject, involving statistics, system identification, approximation theory, neural network, optimization theory, computer science, brain science and many other fields. It is the core of artificial intelligence technology to study how computers simulate or realize human learning behavior in order to acquire new knowledge or skills, reorganize existing knowledge structures and continuously improve their own performance. Data-based machine learning is one of the important methods of modern intelligent technology. The research starts from the observed data (samples) to find laws, and uses these laws to predict future data or unobservable data. According to different learning modes, learning methods and algorithms, machine learning has different classification methods.

According to the learning mode, machine learning can be divided into supervised learning, unsupervised learning and reinforcement learning.

According to learning methods, machine learning can be divided into traditional machine learning and deep learning.

Second, the knowledge map

Knowledge map is essentially a structured semantic knowledge base, which is a graphic data structure composed of nodes and edges. It describes the concepts and their relationships in the physical world in symbolic form, and its basic unit is "entity-relationship-entity" triplet, as well as entities and their related "attribute-value" pairs. Different entities are connected with each other through relationships to form a network knowledge structure. In the knowledge map, each node represents the "entity" in the real world, and each edge is the "relationship" between entities. Generally speaking, knowledge map is a relational network that connects all kinds of different information together, and provides the ability to analyze problems from the perspective of "relationship".

Knowledge map can be used in anti-fraud, inconsistency verification, group fraud and other public security fields, and it needs data mining methods such as anomaly analysis, static analysis and dynamic analysis. In particular, knowledge map has great advantages in search engine, visual display and precision marketing, and has become a popular tool in the industry. However, there are still great challenges in the development of knowledge map, such as the noise of data, that is, the data itself is wrong or redundant. With the in-depth application of knowledge mapping, there are still a series of key technologies to be broken through.

Third, natural language processing.

Natural language processing is an important direction in the field of computer science and artificial intelligence. The research on various theories and methods that can effectively communicate with computers in natural language involves many fields, mainly including machine translation, machine reading comprehension and question answering system.

machine translation

Machine translation technology refers to the process of translating one natural language into another by using computer technology. The machine translation method based on statistics has broken through the limitations of previous translation methods based on rules and examples, and the translation performance has been greatly improved. The successful application of machine translation based on deep neural network in some scenes such as daily oral English has shown great potential. With the development of context representation and logical reasoning ability of knowledge, the knowledge map of natural language is expanding, and machine translation will make greater progress in the fields of multi-round dialogue translation and text translation.

Semantic understanding

Semantic understanding technology refers to the process of using computer technology to understand the text and answer questions related to the text. Semantic understanding pays more attention to the understanding of context and the control of the accuracy of answers. With the release of MCTest data set, semantic understanding has received more attention and achieved rapid development, and related data sets and corresponding neural network models have emerged one after another. Semantic understanding technology will play an important role in intelligent customer service, product automatic question answering and other related fields, and further improve the accuracy of question answering and dialogue system.

Question answering system

Question answering system is divided into open domain dialogue system and domain-specific question answering system. Question answering system technology refers to the technology that allows computers to communicate with people in natural language like human beings. People can submit questions expressed in natural language to the question answering system, and the system will return highly relevant answers. Although there have been many application products in question answering system, most of them are used in actual information service systems and smart phone assistants, and there are still problems and challenges in the robustness of question answering system.

Natural language processing faces four challenges:

First, there are uncertainties at different levels such as morphology, syntax, semantics, pragmatics and pronunciation;

Second, new words, new terms, new semantics and new grammar lead to the unpredictability of unknown language phenomena;

Third, data resources are insufficient to cover complex language phenomena;

Fourthly, the fuzziness and complexity of semantic knowledge are difficult to be described by simple mathematical models, and semantic calculation requires nonlinear calculation of huge parameters.

Fourth, human-computer interaction.

Human-computer interaction mainly studies the information exchange between people and computers, including the information exchange between people and computers. It is an important peripheral technology in the field of artificial intelligence. Human-computer interaction is a comprehensive subject closely related to cognitive psychology, ergonomics, multimedia technology and virtual reality technology. Traditional information exchange between people and computers mainly depends on interactive devices, including keyboard, mouse, joystick, data suit, eye tracker, position tracker, data gloves, pressure pen and other input devices, as well as printers, plotters, monitors, helmet monitors, speakers and other output devices. In addition to the traditional basic interaction and graphic interaction, human-computer interaction technology also includes voice interaction, emotional interaction, somatosensory interaction and brain-computer interaction.

Verb (abbreviation of verb) computer vision

Computer vision is a science that uses computers to imitate human visual system, so that computers have the ability to extract, process, understand and analyze images and image sequences similar to human beings. Automatic driving, robotics, intelligent medical care and other fields all need to extract information from visual signals and process it through computer vision technology. In recent years, with the development of deep learning, preprocessing, feature extraction and algorithm processing are gradually integrated, forming an end-to-end artificial intelligence algorithm technology. According to the problems solved, computer vision can be divided into five categories: computational imaging, image understanding, three-dimensional vision, dynamic vision and video coding and decoding.

At present, computer vision technology has developed rapidly and has initially formed an industrial scale. The future development of computer vision technology mainly faces the following challenges:

First, how to better integrate with other technologies in different application fields. Computer vision can use a lot of big data to solve some problems, and it has gradually matured and surpassed human beings, but it cannot achieve high accuracy on some issues;

Second, how to reduce the development time and labor cost of computer vision algorithm. At present, computer vision algorithm needs a lot of data and manual annotation, and it takes a long development cycle to achieve the accuracy and time-consuming required by the application field;

The third is how to speed up the design and development of new algorithms. With the emergence of new imaging hardware and artificial intelligence chips, the design and development of computer vision algorithms for different chips and data acquisition devices is also one of the challenges.

VI. Biometrics

Biometric identification technology refers to the technology of identifying and authenticating the identity of individuals through their physiological or behavioral characteristics. From the application process, biometrics is usually divided into two stages: registration and identification. In the registration stage, the biometric information of human body is collected by sensors, such as optical information such as fingerprints and faces collected by image sensors, acoustic information such as voice collected by microphone, and the collected data is processed by data preprocessing and feature extraction technology to obtain corresponding features for storage.

In the identification process, the information collection method consistent with the registration process is adopted to collect information, preprocess data, extract the features of the person to be identified, and then compare the extracted features with the stored features to complete the identification. From the perspective of application tasks, biometrics is generally divided into two tasks: identification and confirmation. Identity recognition refers to the process of determining the identity of the recognized person from the knowledge base, which is a one-to-many problem. Confirmation refers to the process of comparing the information of the person to be identified with the information of a specific person in the repository to determine the identity, which is a one-to-one problem.

Biometrics involves a wide range of contents, including fingerprint, palm print, face, iris, finger vein, voiceprint, gait and other biological features, and its recognition process involves many technologies such as image processing, computer vision, voice recognition, machine learning and so on. At present, biometric identification, as an important intelligent identity authentication technology, has been widely used in finance, public security, education, transportation and other fields.

Seven. Virtual reality/augmented reality

Virtual Reality (VR)/ Augmented Reality (AR) is a new audio-visual technology with computer as the core. Combined with relevant science and technology, a digital environment that is highly similar to the real environment in vision, hearing and touch is generated in a certain range. Users interact with objects in the digital environment through necessary equipment to get feelings and experiences close to the real environment. These feelings and experiences are realized through display devices, tracking and positioning devices, tactile interaction devices, data acquisition devices, special chips and so on.

Virtual reality/augmented reality can be divided into five aspects according to different processing stages: acquisition and modeling technology, analysis and utilization technology, exchange and distribution technology, display and interaction technology, technical standards and evaluation system. Acquisition and modeling technology studies how to digitize and model the physical world or human creativity. The difficulty is the digitization and modeling technology of the three-dimensional physical world. Analysis and utilization technology focuses on the analysis, understanding, search and intelligent methods of digital content, and its difficulty lies in the semantic representation and analysis of content; Exchange and distribution technology mainly emphasizes the circulation, conversion, integration and personalized service of large-scale digital content for different end users in various network environments, and its core is open content exchange and copyright management technology; Display and exchange technology focuses on various display technologies and interactive modes of digital content that are in line with human habits, in order to improve people's cognitive ability of complex information, and its difficulty lies in establishing a natural and harmonious human-computer interaction environment; The standards and evaluation system focuses on the standards of virtual reality/augmented reality basic resources, content cataloging, source code and corresponding evaluation technologies.

At present, the challenges faced by virtual reality/augmented reality are mainly reflected in four aspects: intelligent acquisition, ubiquitous devices, free interaction and perceptual integration. There are a series of scientific and technical problems in hardware platforms and devices, core chips and devices, software platforms and tools, and related standards and specifications. Generally speaking, virtual reality/augmented reality presents the development trend of intelligent virtual reality system, seamless integration of virtual and real environment objects, natural interaction and comfort.