Category Archives: uncertainty

Credible predictions for regulatory decision-making

Regulators are charged with ensuring that manufactured products, from aircraft and nuclear power stations to cosmetics and vaccines, are safe. The general public seeks certainty that these devices and the materials and chemicals they are made from will not harm them or the environment. Technologists that design and manufacture these products know that absolute certainty is unattainable and near-certainty in unaffordable. Hence, they attempt to deliver the service or product that society desires while ensuring that the risks are As Low As Reasonably Practical (ALARP). The role of regulators is to independently assess the risks, make a judgment on their acceptability and thus decide whether the operation of a power station or distribution of a vaccine can go ahead. These are difficult decisions with huge potential consequences – just think of the more than three hundred people killed in the two crashes of Boeing 737 Max airplanes or the 10,000 or so people affected by birth defects caused by the drug thalidomide. Evidence presented to support applications for regulatory approval is largely based on physical tests, for example fatigue tests on an aircraft structure or toxicological tests using animals. In some cases the physical tests might not be entirely representative of the real-life situation which can make it difficult to make decisions using the data, for instance a ground test on an airplane is not the same as a flight test and in many respects the animals used in toxicity testing are physiologically different to humans. In addition, physical tests are expensive and time-consuming which both drives up the costs of seeking regulatory approval and slows down the translation of new innovative products to the market. The almost ubiquitous use of computer-based simulations to support the research, development and design of manufactured products inevitably leads to their use in supporting regulatory applications. This creates challenges for regulators who must judge the trustworthiness of predictions from these simulations. [see ‘Fake facts & untrustworthy predictions‘ on December 4th, 2019]. It is standard practice for modellers to demonstrate the validity of their models; however, validation does not automatically lead to acceptance of predictions by decision-makers. Acceptance is more closely related to scientific credibility. I have been working across a number of disciplines on the scientific credibility of models including in engineering where multi-physics phenomena are important, such as hypersonic flight and fusion energy [see ‘Thought leadership in fusion energy‘ on October 9th, 2019], and in computational biology and toxicology [see ‘Hierarchical modelling in engineering and biology‘ on March 14th, 2018]. Working together with my collaborators in these disciplines, we have developed a common set of factors which underpin scientific credibility that are based on principles drawn from the literature on the philosophy of science and are designed to be both discipline-independent and method-agnostic [Patterson & Whelan, 2019; Patterson et al, 2021]. We hope that our cross-disciplinary approach will break down the subject-silos that have become established as different scientific communities have developed their own frameworks for validating models. As mentioned above, the process of validation tends to be undertaken by model developers and, in some sense, belongs to them; whereas, credibility is not exclusive to the developer but is a trust that needs to be shared with a decision-maker who seeks to use the predictions to inform their decision [see ‘Credibility is in the eye of the beholder‘ on April 20th, 2016]. Trust requires a common knowledge base and understanding that is usually built through interactions. We hope the credibility factors will provide a framework for these interactions as well as a structure for building a portfolio of evidence that demonstrates the reliability of a model.

References:

Patterson EA & Whelan MP, On the validation of variable fidelity multi-physics simulations, J. Sound & Vibration, 448:247-258, 2019.

Patterson EA, Whelan MP & Worth A, The role of validation in establishing the scientific credibility of predictive toxicology approaches intended for regulatory application, Computational Toxicology, 17: 100144, 2021.

Image: Extract from abstract by Zahrah Resh.

Puzzles and mysteries

2 Replies

Puzzles and mysteries are a pair of words that have taken on a whole new meaning for me since reading John Kay’s and Mervyn King’s book called ‘Radical uncertainty: decision-making for an unknowable future‘ during the summer vacation [see ‘Where is AI on the hype curve?‘ on August 12th, 2020]. They describe puzzles as well-defined problems with knowable solutions; whereas mysteries are ill-defined problems, that have no objectively correct solution and are imbued with vagueness and indeterminacy. I have written before about engineers being creative problems-solvers [see ‘Learning problem-solving skills‘ on October 24th, 2018] which leads to the question of whether we specialise in solving puzzles or mysteries, or perhaps both types of problems. The problems that I set for students to solve for homework to refine and evaluate their knowledge of thermodynamics [see ‘Problem-solving in thermodynamics‘ on May 6th, 2015] clearly fall into the puzzle category because they are well-defined and there is a worked solution available. Although for many students these problems might appear to be mysteries, the intention is that with greater knowledge and understanding the mysteries will be transformed into mere puzzles. It is also true that many real-world mysteries can be transformed into puzzles by research that advances the collective knowledge and understanding of society. Part of the purpose of an engineering education is to equip students with the skills to make this transformation from mysteries to puzzles. At an undergraduate level we use problems that are mysteries only to the students so that success is achievable; however, at the post-graduate level we use problems that are perceived as mysteries to both the student and the professor with the intention that the professor can guide the student towards a solution. Of course, some mysteries are intractable often because we do not know enough to define the problem sufficiently that we can even start to think about possible solutions. These are tricky to tackle because it is unreasonable to expect a research student to solve them in limited timeframe and it is risky to offer to solve them in exchange for a research grant because you are likely to damage your reputation and prospects of future funding when you fail. On the other hand, they are what makes research interesting and exciting.

Image: Extract from abstract by Zahrah Resh.

Forecasts and chimpanzees throwing darts

4 Replies

During the coronavirus pandemic, politicians have taken to telling us that their decisions are based on the advice of their experts while the news media have bombarded us with predictions from experts. Perhaps not unexpectedly, with the benefit of hindsight, many of these decisions and predictions appear to be have been ill-advised or inaccurate which is likely to lead to a loss of trust in both politicians and experts. However, this is unsurprising and the reliability of experts, particularly those willing to make public pronouncements, is well-known to be dubious. Professor Philip E. Tetlock of the University of Pennsylvania has assessed the accuracy of forecasts made by purported experts over two decades and found that they were little better than a chimpanzee throwing darts. However, the more well-known experts seemed to be worse at forecasting [Tetlock & Gardner, 2016]. In other words, we should assign less credibility to those experts whose advice is more frequently sought by politicians or quoted in the media. Tetlock’s research has found that the best forecasters are better at inductive reasoning, pattern detection, cognitive flexibility and open-mindedness [Mellers et al, 2015]. People with these attributes will tend not to express unambiguous opinions but instead will attempt to balance all factors in reaching a view that embraces many uncertainties. Politicians and the media believe that we want to hear a simple message unadorned by the complications of describing reality; and, hence they avoid the best forecasters and prefer those that provide the clear but usually inaccurate message. Perhaps that’s why engineers are rarely interviewed by the media or quoted in the press because they tend to be good at inductive reasoning, pattern detection, cognitive flexibility and are open-minded [see ‘Einstein and public engagement‘ on August 8th, 2018]. Of course, this was well-known to the Chinese philosopher, Lao Tzu who is reported to have said: ‘Those who have knowledge, don’t predict. Those who predict, don’t have knowledge.’

References:

Mellers, B., Stone, E., Atanasov, P., Rohrbaugh, N., Metz, S.E., Ungar, L., Bishop, M.M., Horowitz, M., Merkle, E. and Tetlock, P., 2015. The psychology of intelligence analysis: Drivers of prediction accuracy in world politics. Journal of experimental psychology: applied, 21(1):1-14.

Tetlock, P.E. and Gardner, D., 2016. Superforecasting: The art and science of prediction. London: Penguin Random House.

Where is AI on the hype curve?

4 Replies

I suspect that artificial intelligence is somewhere near the top of the ‘Hype Curve’ [see ‘Hype cycle’ on September 23^rd, 2015]. At the beginning of the year, I read Max Tegmark’s book, ‘Life 3.0 – being a human in the age of artificial intelligence’ in which he discusses the prospects for artificial general intelligence and its likely impact on life for humans. Artificial intelligence means non-biological intelligence and artificial general intelligence is the ability to accomplish any cognitive task at least as well as humans. Predictions vary about when we might develop artificial general intelligence but developments in machine learning and robotics have energised people in both science and the arts. Machine learning consists of algorithms that use training data to build a mathematical model and make predictions or decisions without being explicitly programmed for the task. Three of the books that I read while on vacation last month featured or discussed artificial intelligence which stimulated my opening remark about its position on the hype curve. Jeanette Winterson in her novel, ‘Frankissstein‘ foresees a world in which humanoid robots can be bought by mail order; while Ian McEwan in his novel, ‘Machines Like Me‘, goes back to the early 1980s and describes a world in which robots with a level of consciousness close to or equal to humans are just being introduced to the market the place. However, John Kay and Mervyn King in their recently published book, ‘Radical Uncertainty – decision-making beyond numbers‘, suggest that artificial intelligence will only ever enhance rather replace human intelligence because it will not be able to handle non-stationary ill-defined problems, i.e. problems for which there no objectively correct solution and that change with time. I think I am with Kay & King and that we will shortly slide down into the trough of the hype curve before we start to see the true potential of artificial general intelligence implemented in robots.

The picture shows our holiday bookshelf.

Realize Engineering

An engineering commentary for everyone on the first Wednesday of every month