Introduction

Artificial intelligence (AI) has been called a revolutionary tool for science1,2 and it has been predicted to play a creative role in research in the future3. In the context of theoretical chemistry, for example, it is believed that AI can help solve problems “in a way such that the human cannot distinguish between this [AI] and communicating with a human expert”4. However, this excitement has not been shared by all scientists. Some have questioned whether advanced computational approaches can go beyond ‘numerics’5,6,7,8,9 and contribute on a fundamental level to gaining of new scientific understanding10,11,12.

In this Perspective, we discuss how advanced computational systems, and AI in particular, can contribute to scientific understanding: we overview what is currently possible and what might lie ahead. In addition to the review of the literature, we surveyed dozens of scientists working at the interface of biology, chemistry or physics on the one hand, and AI and advanced computational methods on the other. These personal narratives (see Supplementary Information) focus on the concrete discovery process of ideas and are a vital augmentation of the scientific literature. We discuss the literature overview and personal accounts in the context of the philosophical theory of scientific understanding recently developed by Dennis Dieks and Henk de Regt12,13. We then identify three fundamental dimensions for AI contributing to new scientific understanding (Fig. 1). (We encapsulate all advanced artificial computational systems under the term AI, independent of their working principles. In this way, we are focusing on the operational objective rather than the methodology.) First, AI can act as an instrument revealing properties of a physical system that are otherwise difficult or even impossible to probe. Humans then lift these insights to scientific understanding. Second, AI can act as a source of inspiration for new concepts and ideas that are subsequently understood and generalized by human scientists. Third, AI acts as an agent of understanding. AI reaches new scientific insight and — importantly — can transfer it to human researchers. Although there have not yet been any examples of AI acting as a true ‘agent of understanding’ in science, we outline important characteristics of such a system and discuss possible ways to achieve it.

In the first two dimensions, the AI enables humans to gain new scientific understanding, whereas in the last, the machine gains understanding itself. Distinguishing between these classes allows us to map out a vibrant and mostly unexplored field of research, and will hopefully guide direction for future AI developments in the natural sciences.

Fig. 1: The three dimensions of computer-assisted scientific understanding.
figure 1

The current state-of-the-art computational microscopes could be developed further with more complex systems, which could be simulated thanks to advances in algorithms and hardware, and with more advanced data representations (left-hand panel). As resources of inspiration, computational systems can help the human scientist by identifying surprises in data (a), identifying surprises in the scientific literature (b), finding surprising concepts by inspecting models (c), probing the behaviour of artificial agents (d) or by extracting new concepts from interpretable solutions (e). The scientific understanding test discussed in the main text is illustrated in the right-hand panel.

The focus of this Perspective is how advanced computational systems and AI specifically can contribute to new scientific understanding. There are many related, interesting topics that we cannot cover here. For example, we will not discuss the relationship between scientific understanding and cognitive science, but refer the reader to a good overview14. Furthermore, we will only discuss ‘understanding’ in the context of the natural sciences, in which we can use concrete criteria from the philosophy of science and, therefore, will not touch on ‘understanding’ in a broader context (such as understanding by babies and animals, language understanding in AI and related topics). Many other works contribute to related questions and should be mentioned here. One important field of research in AI is explainable AI, which aims to interpret and explain how advanced AI algorithms come up with their solutions; see, for instance, refs.15,16,17,18. Whereas it is not necessary, and we believe also not sufficient, to interpret the internal workings of the AI to get new scientific understanding, many of these tools and techniques can be very useful. We will briefly explain them below with concrete examples in the natural sciences. AI pioneer Donald Michie classified machine learning (ML) into three classes: weak, strong and ultrastrong, in which ultrastrong requires the machine to teach the human19. The ultrastrong ML is related to the idea of agent of understanding, as we will define and go into more detail below. A very useful and comprehensible collection of computational and AI methods for science can be found in ref.20. Different levels of automation in the design of molecules are described in ref.21, with a final step involving a computer that chooses the initial ideas. Other works have investigated what full automation might look like based on specific scientific methodologies, leading to the idea of the ‘Nobel Turing Challenge’22, the development of an AI system capable of making Nobel-prize-level scientific discoveries. We note that our take purposefully does not depend on any specific scientific method (to avoid problems at a foundational level23). Rather, we focus on ‘scientific understanding’, how scientists can get it and how advanced AI can help humans gain new scientific understanding.

Scientific understanding

Imagine an oracle providing non-trivial predictions that are always true. Although such a hypothetical system would have a significant scientific impact, scientists would not be satisfied. They would want “to be able to grasp how the predictions are generated, and to develop a feeling for the consequences in concrete situations”13. Colloquially, we refer to this goal as ‘understanding’, but what does this really mean? To find criteria for scientific understanding, we seek guidance from the philosophy of science. Although hardly any scientist would argue against ‘understanding’ as a fundamental aim of science (along with explanation, description and prediction24), this view has not always been accepted by philosophers. Carl Hempel, who made foundational contributions clarifying the meaning of ‘scientific explanation’, argued that ‘understanding’ is subjective and merely a psychological by-product of scientific activity and is, therefore, not relevant for the philosophy of science25. Numerous philosophers criticized this conclusion, trying to formalize what ‘scientific understanding’ actually means. These proposals suggest that ‘understanding’ is connected to the ability to build causal models (for example, Lord Kelvin said “It seems to me that the test of ‘Do we or not understand a particular subject in physics?’ is, ‘Can we make a mechanical model of it?’”13), connected to providing visualizations (or Anschaulichkeit, as its strong proponent Erwin Schrödinger called it26,27) or that understanding corresponds to providing a unification of ideas28,29.

More recently, Henk de Regt and Dennis Dieks have developed a new theory of scientific understanding, which is both contextual and pragmatic12,13,24. They found that techniques such as visualization or unification are ‘tools for understanding’, thereby connecting previous ideas in one general framework. Their theory is agnostic to the specific ‘tool’ being used, making it particularly useful for application in a variety of scientific disciplines. de Regt and Dieks extended Werner Heisenberg’s insights30 and, rather than merely introducing theoretical or hypothetical ideas, the main motivation behind their theory is that a “satisfactory conception of scientific understanding should reflect the actual (contemporary and historical) practice of Science”. Simply put, they argue that: “A phenomenon P can be understood if there exists an intelligible theory T of P such that scientists can recognise qualitatively characteristic consequences of T without performing exact calculations”12,13. de Regt and Dieks defined two interlinked criteria:

  • Criterion of understanding phenomena: a phenomenon P can be understood if a theory T of P exists that is intelligible.

  • Criterion for the intelligibility of theories: a scientific theory T is intelligible for scientists (in context C) if they can recognise qualitatively characteristic consequences of T without performing exact calculations.

We decided to use this specific theory because it can be used to ‘experimentally’ evaluate whether scientists have ‘understood’ new concepts or ideas, rather than by inspecting their methodology, by simply looking at the scientific outcome and the consequences. This approach also coincides with Angelika Potochnik’s argument that “understanding requires successful mastery, in some sense, of the target of understanding”11.

Scientific discovery versus scientific understanding

Scientific understanding and scientific discovery are both important aims in science. The two are distinct in the sense that scientific discovery is possible without new scientific understanding (we use the precise terminology in refs.12,13).

Let’s examine three examples. First, to design new efficient molecules for organic laser diodes, a search space of 1.6 million was explored using ML and quantum chemistry insights31. The top candidate was experimentally synthesized and investigated. Thereby, the authors of this study discovered new molecules with very high quantum efficiency. Whereas these discoveries could have important technological consequences, the results do not provide new scientific understanding. From the results per se, one cannot derive qualitative consequences without performing further detailed computations. Second, the recent ML-enabled breakthrough in protein folding32,33 will undoubtedly change the landscape of biochemistry. However, so far, AlphaFold has been a black box: an oracle and, as such, it does not directly provide new scientific understanding in the sense of de Regt and Dieks. Third, many discoveries in physics occur before (sometimes long before) a theory or explanation, which provides scientific understanding, is uncovered. Examples include the discovery of superconductivity (and its high-temperature version), the discovery of the cosmological microwave background, neutrino oscillations and the discovery of a zoo of particles before the invention of the quark model.

These examples show that scientific discoveries can lead to scientific and technological disruption without directly contributing to scientific understanding11,24.

Over the past few years, scientists working at the interface between AI and the natural sciences have been trying to rediscover physical laws or concepts with machines. Examples include the heliocentric world view34, the arrow of time35 or mechanical equations of motions36,37. These applications are good benchmarks to show that the algorithms work in principle. The question remains, however, whether an AI that can rediscover physical laws and concepts would also be capable of contributing to new scientific understanding. We believe that this is not guaranteed. The human creators of these AI systems know what they are looking for in these case studies. Therefore, it is unclear how both conscious and unconscious biases (in the broadest sense, for example, by choosing particular representations) in the code or the data can be prevented. Consequently, even if an algorithm can rediscover interesting physical phenomena, it is unclear whether and how it can be used to advance science by uncovering new scientific ideas. To go beyond rediscovery tasks, one needs to focus explicitly on the question of how to get ‘new’ scientific understanding.

Loosely speaking, gaining new understanding from advanced computational systems means uncovering new ideas, principles, concepts or even theories that scientists can apply and use in different situations without (complete) computations. In the next sections, we will outline how this could be done, what previous approaches have achieved and how we can go further.

Three dimensions of computer-assisted understanding

As already alluded to in the introduction, we surveyed the scientific literature and used personal anecdotes from dozens of scientists. Then, within the framework provided by the philosophy of science, we introduced a new classification of AI’s contribution to scientific understanding that helps map out different directions of future investigation. We call this classification ‘dimensions’, as they are independent and non-exclusive.

An AI system can contribute to new scientific understanding in three ways. First, as a ‘computational microscope’, it can provide information not (yet) attainable through experimental means. Second, as a ‘resource of inspiration’ or an artificial muse, expanding the scope of human imagination and creativity. In those two dimensions, the human scientist is essential to identify and refine the new insight and inspiration and develop it to full understanding. For the former, the machine creates new data (and represents it potentially in advanced ways) and, thereby, the human scientist extracts from it her new understanding. In the latter, the machine explicitly looks for surprising or interesting new ideas or unexpected connections, which it presents to the human scientist, who uses them to reach new scientific understanding. These two dimensions could exist even without advanced computational systems or AI. However, AI can significantly boost and extend their opportunities.

The third dimension is that of AI as an ‘agent of understanding’, replacing the human in generalizing observations and transferring these new scientific concepts to different phenomena, and — importantly — conveying these insights to human scientists.

As the focus of this Perspective is AI-assisted new scientific understanding, we stress that such an agent of understanding must be able to transfer its insights to humans. We will briefly discuss the situation of an agent that cannot convey its insights.

The three dimensions described above should not be understood dogmatically but, rather, provide a framework to guide future directions. In the following sections, based on concrete examples, we discuss each dimension in more detail and propose avenues for pushing the boundaries of what is currently possible.

Computational microscope

Microscopes are perhaps the best known type of instrument that allows the investigation of objects and phenomena invisible to the naked eye. Similarly, computational microscopes enable the investigation of objects or processes that cannot be visualized or probed in any other way, for example, biological, chemical or physical processes that happen at length and time scales not accessible in experiments.

In the context of ‘understanding’, the new computer-generated data by a computational microscope need to be generalized to other contexts without complete computation13. We illustrate this with two concrete examples.

The first example is molecular dynamics simulations of SARS-CoV-2. The authors of ref.38 uncovered new biological functions that show different behaviours in the open and closed conformations of the spike protein. This explanation changed the view of glycans in biological systems and inspired new ways to analyse these systems without the need to perform full computations.

In the second example, the authors of ref.39 described how molecular dynamics simulations helped them uncover fundamental patterns called glycoblocks. The systematic use of glycoblocks can be used to both understand the sequence–structure–property relationships of biomolecules and inform the design of synthetic structures with desired functions without the need for simulating the entire system.

Can computational microscopes be improved further? We discuss two vibrant directions. First, more advanced computational systems will allow the analysis of increasingly complex physical systems. Second, representing the information in a more interpretable way will facilitate the extractions of scientific insight.

Increasingly complex systems

An obvious, but nevertheless important, research direction is increasing the complexity and the accuracy of computer simulations40. For example, increasing the size of the simulated systems, the timescale of the simulations and the number of interactions that can be modelled will significantly enhance the ability to study complex dynamic systems. In general, such advances can be achieved by algorithmic or hardware improvements, or both. In that regard, we expect that AI technologies together with advanced hardware such as GPUs, TPUs or even OPUs41,42 will have an enormous impact. Furthermore, the progress in experimental quantum computing for quantum chemistry43 and physics44,45,46 promises that entirely new algorithms, based on quantum mechanics, will play an important role in this area47,48. Algorithmic improvements could involve adaptive and intelligent resolution during the simulation and advanced visualization methods13, which directly leads to the second direction of future development.

Advanced data representation

In the first dimension of computer-assisted understanding, the human scientist is supposed to generalize the new data from the computational microscope. Therefore, we believe that advances in data representation could significantly help humans to grasp the underlying structures and facilitate new scientific understanding. Scientists are currently predominantly analysing data in (potentially animated) 2D graphical representations. We believe that genuinely 3D environments (realized via either virtual or augmented reality glasses or holography) will significantly help the understanding of complex systems or complex data. Initial advances in that regard have been demonstrated in chemistry49,50,51 and astophysics52, and we expect this to become a standard tool for scientists. Furthermore, the time dimensions could be used to represent even more structured data; for example, via videos (3D videos). Alternatively, sound could be used as an additional data dimension, as the human auditory sense is excellent in detecting structure or symmetries in (periodic) time-dependent data. This opportunity has been explored in high-energy physics53 and in dozens of projects in astronomy54. A powerful algorithm might be able to identify symmetries in the underlying data and project them into 3D video with sound, which might help the human to recognize and subsequently understand new properties within the data generated by the computational microscope.

Resource of inspiration

Surprising and creative ideas are the foundation of advances in science. Computer algorithms can inspire such ideas systematically, thereby significantly accelerating scientific and technological progress. Already 70 years ago, Alan Turing noted that “Machines take me by surprise with great frequency”55. A more recent study56 collected stories from dozens of researchers of artificial life and evolution. These anecdotes showcase how computer algorithms can “produce surprising and creative solutions”. Accordingly, we believe that AI could be artificial muses of science in a metaphorical sense. Next, we will outline a number of ways in which computer algorithms can provide a source of inspiration for new scientific ideas.

Identifying surprises in data

Exceptional data points or unexpected regularities obtained from experiments or simulations can trigger new ideas and concepts. Our survey shows that these exceptional points are usually identified by humans, such as in the following two examples, which use high-throughput computations in chemistry57 and quantum optics58,59.

The first example deals with an unexpected phase of crystal structures in high-pressure physics. In ref.60, the authors found an unexpected stable configuration of alternating NH2 and NH4 layers, rather than a dense NH3 phase. They conceptualized this phenomenon as spontaneous ionization, a common process in acid–base chemistry, which is now a widely accepted phenomenon in the high-pressure phase diagram of NH3. Spontaneous ionization in the high-pressure behaviour of matter has become a more general principle that can be used without performing any simulations.

In the second example, a search for new quantum experiments uncovered a solution with considerable larger quantum entanglement than expected58. The authors of the study understood the underlying principles and thereby discovered a new concept of entanglement generation61,62. The principle can be used without any computation and, for example, acts now as a new representation in more advanced AI systems for quantum physics63, demonstrating the application of the computer-inspired idea in more general different contexts.

In contrast to these examples, data anomalies can manifest themselves in a more involved combination of variables, which might be very difficult for humans to grasp. Accordingly, applying advanced statistical methods and ML algorithms (for example, ref.64) to this type of problem will be an important future research direction. Exciting works in the direction of autonomous anomaly detection have been applied to scientific data from the Large Hadron Collider (LHC) at CERN65,66,67 (see a recent review on this topic68). Such techniques have the potential to identify new physics signatures, which can then be conceptualized and understood by human physicists69,70. An interesting technique for detecting outliers, previously used for the discovery of quantum phases, is leaving out one (or a number) of training examples and observing the consequence for the neural networks71. The introduction of influence functions, via the computation of Hessians, is one computationally efficient way to identify the impact of individual training examples72. Neural networks that autonomously discover symmetries could become an efficient discovery tool for outliers in scientific data in which the underlying rules might not be known beforehand73,74.

Estimating the confidence of predictions will be another method to directly search for anomalies in data75. The ability to uncover hidden regularities was demonstrated in mathematics, in which an AI hinted at relations between previously unconnected invariants in knot theory, which allowed mathematicians to conjecture and prove new theorems76. Alternatively, an AI capable of constructing new scientific hypotheses could uncover outliers or unexpected patterns that are not discernible with standard statistical methods.

It would be truly exciting to see an AI uncover hidden patterns or irregularities in scientific data previously overlooked by humans, which could lead to new ideas and, ultimately, to new conceptual understanding. As of now, we are not aware of such cases.

Finally, we note that the data points for these analyses could be obtained from computational methods (involving those described in the previous section), with exciting opportunities for mathematics or theoretical physics77. Alternatively, the data could be obtained directly from experiments. Here, we can imagine a closed-loop approach, in which an algorithm tries to explore the environment and steer the exploration into unexpected regions. If the data source is an experiment, a future AI system will require access to complex lab automation with large parameter spaces to explore, as demonstrated in biology78, chemistry79,80,81,82,83,84 or physics85,86.

Identifying surprises in the scientific literature

The number of scientific papers in essentially every scientific domain is growing enormously87,88. Consequently, researchers have to specialize in narrow subdisciplines, which makes finding new interdisciplinary ideas difficult. In the future, we believe that computers will be able to use the scientific literature in an automated way89,90,91,92 and identify exceptional and surprising phenomena for further investigation. Whereas the large-scale automated analysis of the scientific literature, to our knowledge, has not yet been able to induce new scientific understanding, there is significant progress in the field. One promising approach towards this goal is unsupervised word embedding of a large corpus of scientific papers. In that technique, the content of the scientific literature is transformed into a high-dimensional vector space. This approach has been used in the domain of materials science93 and rediscovered central scientific concepts, such as the periodic table of the elements. Additionally, the results also suggested the existence of previously undiscovered structure–property relationships. Examples include new candidates for thermoelectric materials. Moreover, several other advanced computational techniques are being developed in materials science to extract knowledge from the scientific literature and investigate it systematically by AI technologies94, and can lead to complex scientific conclusions, as demonstrated, for instance, on the relations between different crystal structures95.

An alternative approach aims to build semantic knowledge networks from large bodies of scientific literature. In these networks, scientific concepts are nodes and edges carry relational information. In the simplest case, that means two scientific concepts are mentioned in the same scientific paper96,97. Thus, scientific knowledge is represented as an evolving network, which can be used to identify both islands and unexplored regions of the scientific literature. This type of network was used in biochemistry to identify efficient global research strategies96 and in quantum physics to predict and suggest future research directions97. Advances in AI technology could improve this type of system significantly. For example, natural language processing architectures such as BERT98, or GPT-3 (ref.99) could help extract more scientific knowledge from research papers, and large graph-based neural networks could improve the prediction of new research topics from semantic networks100.

Surprising concepts by inspecting models

We also expect considerable progress by rationalizing what AI algorithms have learned in order to solve a specific problem, in other words, explainable or interpretable AI101,101,102,103. One idea towards this goal is inspired by DeepDreaming, a method first used in computer vision104,105. Put simply, the idea is to invert a neural network and probe its behaviour. This approach has been applied to rediscover thermodynamical properties35 and design principles for functional molecules106. An alternative and remarkable application is the ‘disentanglement of variables’ in neural networks107. The goal is to understand the internal representation that the neural network has learned. Astronomical data, represented in geocentric coordinates, was used to train a neural network and the disentanglement of variables enabled the rediscovery of heliocentric coordinates via the internal representation of the model34. Symmetries, or their conserved quantities, can also be autonomously extracted by using a pair of neural networks108,109. The pair is then trained to identify whether two different physical situations are equivalent under some unknown symmetry. The final layer is an information bottleneck consisting of only a single neuron. In that way, the neural networks learn to identify conservation properties and compress the entire information into the output of one neuron. The output can then be easily compared and, moreover, readily interpreted by the human researcher. The authors of ref.109 show how this idea can be used to rediscover conserved properties in classical mechanics (energy and angular momentum) or electromagnetism (such as the Lorentz invariants). In a related study, using gradient boosting with decision trees, feature importance has been used to explain properties of molecules and quantum optics circuits110. Related to this is a study in which the internal representation of an unsupervised deep generative model for quantum experiments has been inspected to understand the model’s internal worldview111. In chemistry, counterfactual explanations for ML models have been shown to produce the rationale behind a model’s prediction. Counterfactual explanations illustrate what differences in an event or instance would generate a change in an outcome. Reference112 showed how this can be achieved in a model-independent way (it has been demonstrated for random forest, sequence models and graph neural networks), indicating a great future potential for opening the black box of AI in science. Albeit not in science, a study has investigated what the chess-playing AI AlphaZero has learned about chess and how human-like knowledge is encoded in the internal representation113. The concepts rediscovered in all of those works were not new and, thus, the most important challenge for the future is to learn how to extract previously unknown concepts.

New concepts from interpretable solutions

Rather than getting inspiration from the AI algorithms, scientists can also be surprised by the solutions they provide. When solutions are represented in an interpretable way, they can lead to new ideas and concepts. An example of interpretable representation is a mathematical formula. Thus, scientists can inspect formulae derived by computer algorithms to solve mathematical problems directly and derive more general solution strategies. Several studies demonstrated extracting symbolic models from experimental data of mechanical systems36,114, of quantum systems115 and in astronomy116; see also ref.117. It will be exciting to see how these approaches, for example, combined with methods such as causal inference118, can be improved to propose reasonable physical models of unknown systems that advance scientific understanding. Altogether, exciting advances have been achieved in the field of mathematics119,120, and we foresee similar approaches making a significant impact in the physical sciences.

One recent, concrete example in astronomy is the rediscovery of Newton’s law of gravitation from real-world observational data of planets and moons in our Solar System from the last 30 years (ref.37). The application of graph neural networks allowed for the high-quality prediction of the object’s motion. Furthermore, a symbolic regression technique called PySR (introduced in ref.116) was able to extract reasonable mathematical expressions for the learned behaviour. Interestingly, besides the equations of motion, the method simultaneously predicts the masses of the planetary objects correctly. The technique required the assumption of several symmetries and other physical laws. It remains to be seen whether these prerequisites can be reduced further and how related approaches can be applied to modern physics questions.

Another example of this methodology has been showcased in the field of quantum optics63. There, an AI algorithm with a graph-theoretical representation of quantum optical setups designs configurations for previously unknown quantum systems. The final solutions were represented in a physically interpretable graph-theoretical representation. From there, human scientists can quickly interpret the underlying reasons why the solutions work and apply them in other contexts without further computation. Accordingly, developing interpretable representations and methods to extract underlying concepts in other domains will be an important future research direction.

Probing the behaviour of artificial agents

Another only rarely explored opportunity is interpreting the behaviour of machines when tasked to solving a scientific problem121. Algorithms that take action such as genetic algorithms or reinforcement learning agents adopt policies to navigate the problem space. Human scientists can observe how they navigate this space. Instead of following a strict external reward, for example, maximizing a specific property of a physical system, intrinsic rewards such as artificial curiosity can be implemented122,123. Instead of maximizing directly some functions, the artificial agent tries to learn and predict the behaviour of the environment. It then chooses actions that lead to situations it cannot predict well, thus maximizing its own understanding of the environment. It has been shown using curious agents in simulated virtual universes124 and robot agents in real laboratories84 that curiosity is an efficient exploration strategy. Alternative intrinsic rewards for artificial agents are ‘computational creativity’125,126 and ‘surprise’127. These intrinsic rewards can produce exceptional and unexpected solutions and agent behaviour, ultimately inspiring human scientists.

Agent of understanding

The third and final dimension we consider are algorithms that can autonomously acquire new scientific understanding, and ultimately explain these insights to humans. This feat has been described by neither the respondents of our survey nor in the scientific literature. Therefore, we will approach this class by listing the requirements of these agents, proposing tests to detect their successful realization and speculating what such systems could look like.

The idea of a machine that translates insights to humans has been discussed in Donald Michie’s pioneering work of 1988 called ‘Machine learning in the next five years’19,128,129. Michie classified AI algorithms into three classes: weak ML, strong ML and ultrastrong ML. Weak ML achieves improved prediction quality with a larger amount of training data. Arguably, most ML approaches today fall into this category, in which the algorithm is treated as a black box. The class of strong ML approaches requires providing a symbolic representation of their hypothesis, for example, via Boolean (logical) expressions or mathematical equations. And, finally, ultrastrong ML approaches require that the algorithm teaches the human operator such that the human performance is improved compared with the human learning from data alone. Interestingly, experiments on certain logical tasks have shown that ultrastrong ML algorithms might already exist and thereby have demonstrated the “existence of a class of relational concepts which are hard to acquire for humans, though easy to understand given an abstract explanation”129. The idea of ultrastrong ML is related to our third dimension, the agent of understanding. Both require that the machine gets new insights and teaches them to the human. Besides our constraint to the natural sciences, there are also other differences: whereas Michie requires that the insights are transferred in a symbolic way, the agent of understanding is more flexible and allows for any teaching method, for example, via discussions in natural language, such as GPT-3. Furthermore, the agent needs to provide new scientific understanding (in the strict sense of de Regt), rather than just explanations or interpretations. Therefore, the agent of understanding is more flexible regarding the technical implementation, but stricter regarding what it teaches the human. We will go into more detail below.

First, it is important to realize that finding ‘new’ scientific understanding is context-dependent. What is new depends on whether we consider an individual scientist and their field of expertise, a scientific domain, the whole scientific community or even the entire scientific endeavour throughout history. Hence, true agents of understanding must be able to evaluate whether insight is new, at least in the context of a specific scientific domain that requires access to the knowledge of that scientific field.

Secondly, de Regt emphasized the importance of underlying scientific theories that allow us to recognize qualitatively characteristic consequences12. It is not enough to simply interpolate data points or predict new ones using advanced statistical methods such as ML. Thus, even though such methods can approximate complex and expensive computations, naïve applications of neural networks cannot be agents of understanding. Scientific understanding requires more than mere calculation. To illustrate this point even further, let us consider one concrete example in quantum physics from the literature: a computational method solved an open question about the generation of important resource states for quantum computing. Then it extracted the conceptual core of the solution in the form of a new quantum interference effect in such a fashion that human scientists can both understand the results and apply the acquired understanding in different contexts63. Even if the computer itself was able to apply the conceptual core to other situations, it would not be a priori clear whether the computer truly acquired scientific understanding. What is still missing is an explanation of the discovered technique in the context of a scientific theory. In this particular example, the AI and the human scientist would need to recognize the underlying quantum interference in the context of the theory of quantum physics. Thus, we can propose the first sufficient condition for agents of understanding:

Condition

(Condition for scientific understanding I).

An AI gained scientific understanding if it can recognize qualitatively characteristic consequences of a theory without performing exact computations and use them in a new context.

This condition closely follows the ideas of de Regt and Dieks13. Let us go one step further and imagine that there is an AI capable of explaining discoveries in the context of scientific theories. How could human scientists recognize that the machine acquired new scientific understanding? We argue that human scientists would do it in the exact same way they can recognize that other human scientists acquired new scientific understanding. That is, let the other human scientists convey the newly acquired understanding to others. This suggests the second sufficient condition for agents of understanding:

Condition

(Condition for scientific understanding II).

An AI gained scientific understanding if it can transfer its understanding to a human expert.

We argue that one can only recognize indirectly whether a computer (or human) has gained scientific understanding. Therefore, finally, we propose a test in the spirit of the Turing test55 or the Feigenbaum test130 (or adaptations thereof in the natural sciences, such as the chemical Turing test or the Feynman test4).

The scientific understanding test

A human (the student) interacts with a teacher, either a human or an artificial scientist. The teacher’s goal is to explain a scientific theory and its qualitative, characteristic consequences to the student. Another human (the referee) tests both the student and the teacher independently. If the referee cannot distinguish between the qualities of their non-trivial explanations in various contexts, we argue that the teacher has scientific understanding.

In principle, there is no reason why the student, or the referee, cannot be an AI. However, to keep the test as simple as possible, we restrict the number of possible variations.

The formulation of the test implies that humans need to understand the new concepts that AI devised. If a machine truly understands something, it should be able to explain it and transfer the understanding to someone else. (We leave aside the question of whether the explanation of the AI is true or false. It has been argued that also false theories can lead to genuine understanding131.) We believe that this should always be possible, even if the understanding is far beyond what human experts know at this point. Note that, as the gap between humans’ and artificial scientists’ capabilities grows, it will become increasingly challenging and time-consuming for the AI to transfer its understanding to humans. We envision that computers will use advanced human–computer interaction techniques together with the tools we described for the next-level computational microscopes.

Additionally, scientific discussions between a human and a computer could be realized using advanced queries in natural language processing tools such as BERT98 or GPT-3 (ref.99). That way, the scientist could probe the computer with scientific questions. Suppose the scientist gains new scientific understanding by communicating with the algorithm, as judged by our scientific understanding test. In that case, they can confirm that the computer truly acquired understanding. We would like to point out that our test, like the ones originated by Turing and Feigenbaum, are not clear-cut, leaving room for situations that do not allow a clear judgement. We are optimistic that more efforts will be directed at developing the necessary technologies, which will lead to ever more convincing demonstrations of artificial scientists acting as true agents of understanding in the future.

Conclusion

Undoubtedly, advanced computational methods in general and in AI specifically will further revolutionize how scientists investigate the secrets of our world. We outline how these new methods can directly contribute to acquiring new scientific understanding. We suspect that significant future progress in the use of AI to acquire scientific understanding will require multidisciplinary collaborations between natural scientists, computer scientists and philosophers of science. Thus, we firmly believe that these research efforts can — within our lifetimes — transform AI into true agents of understanding that will directly contribute to one of the main goals of science, namely, scientific understanding.