Quantifying Risk in Clinical Anesthesia
- 1. Department of Anesthesiology and Pain Management, The University of Texas Southwestern Medical Center Dallas, Texas, USA
Abstract
This article discusses risk and uncertainty of general anesthesia, taking into account consideration of unknown and random variables and probabilities as well as the acknowledgment of known and possibly controlled variables as it applies to anesthesia. It shows the difficulty in quantifying risk of individual case and the somewhat arbitrary and even incorrect and naïve assignment of risk in individual patient care management. Effective and honest communication remains at the core of physician - patient relationship in discussing, evaluating, and managing the individual case for optimum outcome as well as patients’ and family satisfaction and the acceptance of the inherent risks involved in the administration of general anesthesia in humans.
Risk is an integral part of life that is brought by natural forces as well as human activity. Though it is reasonable to assume that many people pondered on the nature of the risk, one can, somewhat arbitrarily, trace the beginnings to 16th century Italian mathematician and physician Gorelamo Cardano, who was more interested in risk related to money wagering and potential monetary gain than to outcomes of medical practice [1]. The assessment of the inherent risk of general anesthesia administration became possible only after introduction of anesthesia record over 100 years ago, which allowed more substantiated and reproducible comparisons [2]. Once risk is quantified it can and should be used to guide the decision process through the meaningful narrative. It is common to express risk as a probability or probability distribution. That method, even in simple models, forces us to make certain assumptions and often tends to obscure the difference between the uncertainty about the sensitivity and specificity of the collected data and the uncertainty about the accuracy and significance of the results.
In this review we attempt to broadly define the risk in anesthesia and discuss the relation between risk and uncertainty. We also want to bring to light some imperfection of human mind that are relevant in addressing risk. The decision process, studied by cognitive psychologists, has innate flaws that, even with obvious data limit our ability to recognize, to address, and to properly react to issues related to risk.
Citation
Hill GE (2014) Quantifying Risk in Clinical Anesthesia. Int J Clin Anesthesiol 2(1): 1020.
RISK MEASURES OFTEN USED IN MEDICINE
In medicine, the most common practice is the estimation of probability or, equivalently, the relative frequency of some event. There are several measures that can be further derived and we summarize them below.
Absolute risk reduction is a subtraction. It answers the question: how much the risk increases or decreases as a result of treatment? In more general terms: how much does being a member of the group changes the risk when compared to a different group or population?
Another, often used measure in medicine, is a number needed to treat. It is a reciprocal of absolute risk and as the term implies it measures the number of patients that need to be treated to achieve a target outcome. Obviously, the smaller the number the better.
Example. Let’s assume that the rate of myocardial infarction in general male population is 0.0217 and in a male population receiving a small dose of aspirin is 0.0126. The difference is 0.0091. The number needed to treat to avoid 1 case of infarction is 1/0.0091 or about 110 patients.
Relative risk is a division or a ratio: how many more times is the outcome or characteristic prevalent in one group compared to another. Since it is often reported without base rates of particular variables, it tends to exaggerate the small differences. It is calculated over time to a defined endpoint.
The term hazard ratio is derived from survival analysis. It is a ratio of two outcomes over a period of time. One can think about relative risk as a cumulative form of hazard ratio. Alternatively, the hazard ratio is an instantaneous measure of relative risk, determined before the endpoint of the study.
Using the above example of infarction and aspirin we can determine that relative risk reduction due to aspirin is 0.0091/0.0217 =0.42 or 42%.
We included those rudimentary calculations here to show that when dealing with relative measures it helps to refer to absolute values or, to so called, base. Otherwise, the exclusive reliance on relative differences may be misleading. The following 2 examples should make it clear.
When the Department of Justice reports [3] that Hispanic registered voters were at least a 46.5% more likely to lack the necessary ID, one may arrive at conclusion that requiring the ID amounts to discrimination. Fortunately, the DoJ cites also two additional measures: the percentage of ID lacking voters among Hispanics is 6.3% and among non-Hispanics is 4.3%. The difference, 2%, indeed amounts to 46.5% (2/4.3).
When it was reported that daily consumption of processed meat increases a risk of death by 13% [4] it does not mean that after 10 years of such diet one faces certain death. The analysis expanded by David Spiegelhalter translates the relative risk into absolute one and clearly shows that the person eating processed meat daily may live 1 year less than the person who moderates the diet (79 years vs 80 years) [5].
Once the risk is known, and a very large body of such measures is indeed completed, we face immediate obstacle: how to clearly present it.
Even when we, physicians, develop a fairly good knowledge and a command of assessing risk, we face another very important threshold: how to communicate it clearly to the patients. The difficulty arises from the fact that they constitute a group with diverse literacy and cognitive abilities, most often tainted by some degree of anxiety dictated by their circumstance. In addition language introduces imprecise terms as “likely” or “probable”. Even if such statements are supported by more exacting numerical information, the information may be misinterpreted by a large fraction of patients [6]. Nevertheless good risk communication should be the integral part of our strategy.
There are several methods aimed at helping physicians and patients to discuss the subject of risk. One such method is putting things in perspective. If we know the chances of winning any prize in a lottery like Power Ball (1:32), dying of any cause during next year (1:100), being struck by lightning (1:280,000), we can communicate the estimated risk of anesthesia in reference to those recognizable events [7].
THE IMPORTANCE OF DISTRIBUTION
The risk may be viewed as acceptable and unacceptable.
When we try to determine the risk of vomiting after laparoscopic cholecystectomy or of bradycardia during colonoscopy, we have to start form counting such events. We may quickly realize that there are two major factors that one needs to consider: prevalence and severity. Most frequently occurring events cary small consequences. On the other hand infrequent events often have large consequences. The graph below illustrates that concept: events in the increasing order of severity are paired with their corresponding frequency (Figure 1). It shows the events carrying the largest consequences are infrequent. It also helps to visualize that events carrying relatively small consequences may be unacceptable if they happen often.
The concept of viewing risk as acceptable and unacceptable may refer to different categories of risk. For example, the failure risk of anesthesia machine [8] and, unrelated, the risk of nausea after cholecystectomy [9]. The consequence in each category will form a range from nearly inconsequential to very severe. For example, the consequences of nausea and vomiting may range from nuisance (frequent) to medical emergency in the form of esophageal rupture (rare) [10]. Another example of how visualizing a distribution can enhance the understanding of risk comes from the relation between the heart rate and myocardial oxygen demand as described by Slogoff [11]. The graph below is an idealized relationship between the heart rate and the volume of ischemic left ventricular wall (Figure 2). As the heart rate increases above certain level, so may the volume of ischemic myocardium. It is possible that two different individuals will experience different magnitude of ischemia at the same heart rate, thus calling for individualized control of the heart rate for each patient [12]. In fact, there will be a whole distribution of the results for each heart rate, as shown on the graph below.
The main determinant of the difficulty of risk analysis is the underlying process and its distribution of outcomes. As the complexity of the process increases, its results become more uncertain. Indeed, the term uncertainty as it applies to anesthesia and as it limits our ability to estimate risk will be discussed below.
In terms more general than medicine, one has to realize that the term “risk” encompasses questions related to relatively simple events like the outcome of a die throw to more complicated like risk of arrhythmia as a function of potassium concentration in serum, to very complicated, like Brownian motion. The statistical analysis of a process may reveal that in some cases we may deal with the easy problem that has known probability distribution and applicable tested mathematical methods. On the other side of the spectrum we may encounter limits: the estimation of the distribution parameters are only rough approximations carrying sizable error making risk analysis exceedingly difficult [13]. Because different types of distributions call for slightly different methods to calculate parameters useful in assessing risk, it may be argued that it would be prudent to determine what kind of probability distribution one is dealing with.
Expected, i.e. calculated, occurrence of adverse event requires from us the knowledge (or assumption) of the distribution. We don’t want to apply the methodology developed for normal distribution if we are dealing with another type of distribution because it may lead to underestimation of the impact of rare events (contained in the tails of a distribution).
There is, however, a subtle paradox in our ability to determine what exactly is the distribution of outcomes for any process [14]. In order to appropriately determine the type of distribution we need to collect enough data. How many? It depends on the type the distribution that will adequately model the studied process, i.e. the very quality that we want to discover. To break out of such circular argument we accept some necessary assumptions supported by the collateral knowledge about the process. The downside of such necessary method is a heightened uncertainty (see below).
It is widely accepted that in biology and medicine the majority of processes under investigation are normally distributed. Most often used standard probability distributions are: for infrequently occurring discrete events it is assumed that the data fit Poisson distribution and for frequently occurring events it is assumed that the data approximate normal distribution. Alternatively, the data may be mathematically transformed, most often in the form of logarithms to “force” fit normal distribution [15]. The importance of the analysis of distribution is illustrated by the study published by Riou and coworkers [16]. The authors analyzed cases of a number of blunt trauma victims. They modeled patient’s a priori probability distribution of survival showing that it is bimodal. Such finding is important for the assumptions used in subsequent hypothesis testing, like inclusion criteria for further studies. The authors hypothesize that lumping all trauma victims into one study cohort and disregarding the true distribution of survival may have been responsible for the negative results of several earlier studies and trials.
RISK AND UNCERTAINTY
What is simply denoted as risk of anesthesia is in reality a composite of both risk and uncertainty. The meaning of the term “uncertainty” may be a source of confusion. For the purpose of this discussion we will consider two different meanings of this term.
The most common use refers to stochastic uncertainty as in “I am uncertain of a value or a measurement because of a small error”. It means that we can not assign an exact value to a parameter in a statistical model that was used for analysis; therefore the parameter is reported with the measure of uncertainty, like confidence limits, standard deviation, etc. In general, the uncertainty here refers to the model and not to reality.
In the second case we restrict the taxonomy to epistemic uncertainty, i.e. the reality, its exact state.
The term refers to an unknown or undetermined part of the risk, to our inability to determine “the state of the world” or in other terms our lack of knowledge. While it is possible to calculate risk, it is impossible to calculate epistemic uncertainty [17]. While risk may be known prior to each event, in the purely uncertain situation the true risks are being discovered as that situation unfolds. The difference between the risk and epistemic uncertainty can be summarized as follows:
Risk
The outcomes of the process are governed by a known probability distribution and there are known tools to analyze it (moments of a distribution: mean, variance, skewness, etc.)
Uncertainty
There is not enough knowledge about the process, or the distribution of outcome is unknown, or, in the extreme case, the distribution is probably known but the tools to determine the risk are very limited (as in fat tails distributions).
Both the risk and uncertainty mesh in almost everything we do as physicians.
Uncertainty, as applied to medicine and bracketing both categories, can be appreciated in light of very widespread errors in reported research and a low overall probability of any results being actually correct [18]. Such findings should increase our skepticism, they should serve as a remainder that uncertainty is a big part of what we think we know.
A suitable example of uncertainty includes a burning of the corner of the mouth during tonsillectomy. It may happen when the surgeon uses long non-insulated cautery. Since there is no reliable method to determine the true probability of such event, it is relegated to uncertainty. Similarly, the possibility of explosion in the contemporary anesthesia machine either due to chemical reaction [19,20], or a malfunction of an electronic part [8]. An instructive example of uncertainty in anesthesia is a brief history of perioperative use of beta blocking agents. In the early 1970s the prevailing opinion was that beta blocking agents should be avoided in perioperative period due to their negative inotropism. That point of view changed in the late 1970s and early 1980s when it was observed that sudden cessation of beta blocking agents may lead to increased perioperative incidents of ischemia and heart attacks. It culminated in a series of guidelines formulated by ACC/AHA advocating the use of beta blocking agents [21], and was later incorporated by Center for Medicare Services in its Surgical Care Improvement Project (SCIP). In 2008 the results of a large perioperative ischemic evaluation study (POISE) revealed that indeed the incidence of myocardial infarction is lower in the group treated with metoprolol, but the mortality in treated group was higher: 3.1% versus 2.3% [22,23]. Metoprolol prevented myocardial infarction in 1.5% but at the same time it caused excess deaths in 0.8% and stroke in 0.5% of the patients. Since POISE studied only acute perioperative treatment with metoprolol, the conclusions do not necessarily apply to chronic treatment.
Similar discussion, currently taking place in anesthesia literature, refers to long term outcomes as a function of cumulative duration of deep hypnotic time as measured by BIS. A number of studies seem to support the association between the cumulative time of deep anesthesia and mortality up to 2 years later. At the same time a number of studies failed to demonstrate that association (for brief reviews see [24,25] Other examples of considerable uncertainty in anesthesia include regional anesthesia in presence of neurological disease [26], diseases linked to malignant hyperthermia [27], prediction of difficult intubation [28,29], the long term effect of anesthesia on the immune system [30], the platelet count as a restrictive factor for neuroaxial block in obstetrics [31,32], or the hemoglobin levels that would trigger a decision to transfuse [33,34].
Uncertainty about the information or a measure also plays a role in anesthesia practice but under different circumstances. Consider a case of 45 years old male who is being prepared for cholecystectomy. It is reasonable to suspect that he has some degree of coronary atherosclerosis, since the lifetime risk of coronary artery disease events at that age are approximately 40% [35]. Because the presence of the disease carries a potential impact on the outcome, an anesthesiologist may not be content with the known probability but may wish to increase his confidence of the absence or presence of the disease in this particular case. The frequency of an adverse event may be approximated from a priori large scale epidemiological studies and from personal experience. An anesthesiologist has to determine the probability of an adverse event for a given patient, i.e. assign the numeric value or a linguistic equivalent. It is usually a guess and thus there is a degree of uncertainty about its value. Based on the interview, tests and his own experience the anesthesiologist adjusts the confidence that the probability has a certain value. In other words, as the information is gathered the uncertainty decreases. As the uncertainty decreases, the confidence about a probability of an event changes. The above does not change the true probability of any given event occurring. We often rely on a published mean occurrence or magnitude of an event of interest. We may apply it as a risk measure in a particular situation involving our patient. The fact about such practice worth remembering is that our patient may not be a typical member of the cohort used for the original study. Depending on how atypical he is, the usefulness of the mean value varies. During the evolution of the time series (progression of the case) the initial conclusions are constantly reassessed. The perceived probability may suddenly change during the case. This may occur due to an unknown factor that revealed itself during the case and it constitutes the epistemic uncertainty discussed above. Alternatively, it may be attributed to the process of anesthesia and progression of the surgical procedure.
COMPOUNDING PROBLEMS
Multiple co-morbidities and other variables attributed to patient (age, weight, sex, and other genetic factors, etc.) significantly complicate the task to gauge the risk. The assessment of a probability of a single event is an oversimplified problem. The reality challenges us to consider more complicated situations in the form of conditional probabilities. Such empirical data are available in the form of different indices.
A conditional probability can be defined as follows: what is the probability of A given the presence (absence) of condition B. There is a lot of epidemiological data accumulated over time based on the above question. Given the dynamic nature of medicine where practice is undergoing a slow but continuous change, and the dynamic nature of societal factors relevant to health, the accumulated data “age” over time and will serve as the estimates only. One may say that the probability estimation is thus conditional on accumulated knowledge at the time of the study. Real life situations pose even more challenging tasks in the form of multiple nested conditional and joint probabilities.
Other factors complicating the assessment of risk include the fact that the more information we seek, the more likely it is that we will include erroneous one. Each test has the inherent limitations summarized as its sensitivity and specificity. As the information is being gathered, its predictive value reaches a plateau. The individual gathering the information will not necessarily gain any new insight or, more importantly, more information will not help him to correct possibly wrong initial conclusion. However, as numerous psychological experiments show, more information improves only self confidence even if the conclusion is incorrect [36,37]. Finally, the process of obtaining additional information intended to reduce risk may have the opposite results: perioperative consultations seem to increase mortality [38]. Variability encountered between individuals administering anesthesia, surgeons, nurses, implemented systems like infection control, medication checks, frequency with which any given case is done in a given hospital, and so on, all add further layers of nested probabilities. All of it impacts the risk.
The psychological constrains on assessing risk and decision making warrant a little more attention. There are several innate mechanisms severely limiting our ability to make rational decisions, to recognize, to address, and to react properly to the issues related to risk. Those limitations have been studied by cognitive psychologists and apply to all of us. Indeed, the recent study estimated the frequency of cognitive errors among anesthesiologists [39] and reported 14 types of cognitive errors such as anchoring ( focusing on one issue at the expense of understanding the whole situation) or premature closure (accepting a diagnosis prematurely). Seven out of 14 errors were made with a frequency higher than 50%. Cognitive and decision errors are made even when the outlining probabilities of some event are known. Kahneman determined the decision weights when people have to make a decision in a situation with an upfront known probability of outcome. There is a strong propensity to overweight small and underweight high probabilities. When the probability of an event is 1%, the corresponding decision weight is 5.5, when the probability is 5% the corresponding weight is 13.2. The opposite is true at the high probability of an event: when it is 80% the corresponding decision weight is 60.1, when it is 90% the weight is only 71.2, and when it is 99% the weight is only 91.2. It shows that there is a tendency to deliberate over and emphasize unlikely outcomes but hope for the best when the outcome is most likely unfavorable [40]. Such innate psychological constrains that may or may not be modified by training should be recognized as an additional risk generating factor.
CONCLUSION
A very complicated system of interdependencies that we sketched here implicates that risk assessment contains “elements of craft-like judgment”. Those craft-like elements are heuristics and professional judgment. In the somewhat narrow meaning, the term professional judgment refers to one’s ability to make appropriate decisions under mixed conditions of risk and uncertainty. We acquire that ability by repetitive performance of the same task and by continuous intellectual challenge encountered during postgraduate training. Such repetition allows us to gather the empirical and theoretical evidence about our work, and to develop some ability to implement accumulated experience with a good outcome. It also decreases the variance of outcome, that is, it allows us to develop techniques to assure as uniform outcome as possible.
The professional judgment embodies the intuitive understanding of possible outcomes without underestimation of the uncertainty. It is in essence our ability to develop a good sense of posterior probabilities related to each individual case. In that sense it is what is defined as subjective probability: our educated guess about how likely is an occurrence of a particular event. There even may be a stark difference between on one hand the ability of an individual to correctly solve an exercise based on Bayes theorem and on the other a correct implementation of that theorem in every day practice. We can rarely or never grasp all the elements relevant to risk estimation in an individual who is about to undergo a particular surgical procedure. Open and honest dialog with our patients based on published literature, accepted practice standards, and our own clinical accumen remain the foundation of our ability to communicate risk to our patients.
REFERENCES
3. Perez T. Voters ID. Legal opinion. 12 March 2012
5. Spiegelhalter D. What does a 13% increased risk of death mean? 21 March 2012.
8. Schulte TE, Tinker JH. Narkomed 6400 anesthesia machine failure. Anesth Analg. 2008; 106: 1018-1019.
15. Zar J. Data transformation. In: Biostatistical analysis. Prentice-Hall; 1984. p. 236-43.
18. oannidis JP. Why most published research findings are false. PLoS Med. 2005; 2: e124.