- Order:
- Duration: 10:21
- Published: 2010-01-27
- Uploaded: 2011-02-18
- Author: StudyWithSubstanceP
- http://wn.com/Sensitivity,_Specificity,_Odds_Ratio,_Relative_Risk__USMLEntertainment
- Email this video
- Sms this video
:
where qx = 1 − px. An odds ratio of 1 indicates that the condition or event under study is equally likely to occur in both groups. An odds ratio greater than 1 indicates that the condition or event is more likely to occur in the first group. And an odds ratio less than 1 indicates that the condition or event is less likely to occur in the first group. The odds ratio must be greater than or equal to zero if it is defined. It is undefined if p2q1 equals zero.
where p11, p10, p01 and p00 are non-negative "cell probabilities" that sum to one. The odds for Y within the two subpopulations defined by X = 1 and X = 0 are defined in terms of the conditional probabilities given X:
Thus the odds ratio is
:
The simple expression on the right, above, is easy to remember as the product of the probabilities of the "concordant cells" (X = Y) divided by the product of the probabilities of the "discordant cells" (X ≠ Y). However note that in some applications the labeling of categories as zero and one is arbitrary, so there is nothing special about concordant versus discordant values in these applications.
we would have gotten the same result
:
Other measures of effect size for binary data such as the relative risk do not have this symmetry property.
In this case, the odds ratio equals one, and conversely the odds ratio can only equal one if the joint probabilities can be factored in this way. Thus the odds ratio equals one if and only if X and Y are independent.
:
where p1 = p11 + p10, p1 = p11 + p01, and
:
In the case where R = 1, we have independence, so p11 = p1p1.
Once we have p11, the other three cell probabilities can easily be recovered from the marginal probabilities.
Suppose that in a sample of 100 men, 90 have drunk wine in the previous week, while in a sample of 100 women only 20 have drunk wine in the same period. The odds of a man drinking wine are 90 to 10, or 9:1, while the odds of a woman drinking wine are only 20 to 80, or 1:4 = 0.25:1. The odds ratio is thus 9/0.25, or 36, showing that men are much more likely to drink wine than women. The detailed calculation is:
:
This example also shows how odds ratios are sometimes sensitive in stating relative positions: in this sample men are 90/20 = 4.5 times more likely to have drunk wine than women, but have 36 times the odds. The logarithm of the odds ratio, the difference of the logits of the probabilities, tempers this effect, and also makes the measure symmetric with respect to the ordering of groups. For example, using natural logarithms, an odds ratio of 36/1 maps to 3.584, and an odds ratio of 1/36 maps to −3.584.
One approach to inference uses large sample approximations to the sampling distribution of the log odds ratio (the natural logarithm of the odds ratio). If we use the joint probability notation defined above, the population log odds ratio is
:
If we observe data in the form of a contingency table
then the probabilities in the joint distribution can be estimated as
where p̂ = nij / n, with n = n11 + n10 + n01 + n00 being the sum of all four cell counts. The sample log odds ratio is
:.
The distribution of the log odds ratio is approximately normal with:
The standard error for the log odds ratio is approximately
:.
This is an asymptotic approximation, and will not give a meaningful result if any of the cell counts are very small. If L is the sample log odds ratio, an approximate 95% confidence interval for the population log odds ratio is L ± 1.96SE. This can be mapped to exp(L − 1.96SE), exp(L + 1.96SE) to obtain a 95% confidence interval for the odds ratio. If we wish to test the hypothesis that the population odds ratio equals one, the two-sided p-value is 2P(Z< −|L|/SE), where P denotes a probability, and Z denotes a standard normal random variable.
An alternative approach to inference for odds ratios looks at the distribution of the data conditionally on the marginal frequencies of X and Y. An advantage of this approach is that the sampling distribution of the odds ratio can be expressed exactly.
:
so is an estimate of this conditional odds ratio. The interpretation of is as an estimate of the odds ratio between Y and X when the values of Z1, ..., Zp are held fixed.
The odds ratio p11p00 / p01p10 for this distribution does not depend on the value of f. This shows that the odds ratio (and consequently the log odds ratio) is invariant to non-random sampling based on one of the variables being studied. Note however that the standard error of the log odds ratio does depend on the value of f. This fact is exploited in two important situations:
* Suppose it is inconvenient or impractical to obtain a population sample, but it is practical to obtain a convenience sample of units with different X values, such that within the X = 0 and X = 1 subsamples the Y values are representative of the population (i.e. they follow the correct conditional probabilities).
* Suppose the marginal distribution of one variable, say X, is very skewed. For example, if we are studying the relationship between high alcohol consumption and pancreatic cancer in the general population, the incidence of pancreatic cancer would be very low, so it would require a very large population sample to get a modest number of pancreatic cancer cases. However we could use data from hospitals to contact most or all of their pancreatic cancer patients, and then randomly sample an equal number of subjects without pancreatic cancer (this is called a "case-control study").
In both these settings, the odds ratio can be calculated from the selected sample, without biasing the results relative to what would have been obtained for a population sample.
If the absolute risk in the control group is available, conversion between the two is calculated by:. Another alternative estimator is the Mantel-Haenszel estimator.
{| cellpadding="5" cellspacing="0" align="center" |- ! rowspan=2 | ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;" colspan=2 | OR = 1, LOR = 0 ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;" colspan=2 | OR = 1, LOR = 0 ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;" colspan=2 | OR = 4, LOR = 1.39 ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;border-right:1px solid black;" colspan=2 | OR = 0.25, LOR = −1.39 |- ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;" | Y = 0 ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;" | Y = 0 ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;" | Y = 0 ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;border-right:1px solid black;" | Y = 0 |- ! style="background:#ffdead;border-left:1px solid black;border-top:1px solid black;" | X = 1 ! style="border-left:1px solid black;" | 10 ! 10 ! style="border-left:1px solid black;" | 100 ! 100 ! style="border-left:1px solid black;" | 20 ! 10 ! style="border-left:1px solid black;" | 10 ! style="border-right:1px solid black;" |20 |- ! style="background:#ffdead;border-bottom:1px solid black;border-left:1px solid black;" | X = 0 ! style="border-bottom:1px solid black;border-left:1px solid black;" | 5 ! style="border-bottom:1px solid black;" | 5 ! style="border-left:1px solid black;border-bottom:1px solid black;" | 50 ! style="border-bottom:1px solid black;" | 50 ! style="border-left:1px solid black;border-bottom:1px solid black;" | 10 ! style="border-bottom:1px solid black;" | 20 ! style="border-left:1px solid black;border-bottom:1px solid black;" | 20 ! style="border-right:1px solid black;border-bottom:1px solid black;" | 10 |}
The following joint probability distributions contain the population cell probabilities, along with the corresponding population odds ratio (OR) and population log odds ratio (LOR):
{| cellpadding="5" cellspacing="0" align="center" |- ! rowspan=2 | ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;" colspan=2 | OR = 1, LOR = 0 ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;" colspan=2 | OR = 1, LOR = 0 ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;" colspan=2 | OR = 16, LOR = 2.77 ! style="background:#efefef;border-left:1px solid black;border-top:1px solid black;border-right:1px solid black;" colspan=2 | OR = 0.67, LOR = −0.41 |- ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;" | Y = 0 ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;" | Y = 0 ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;" | Y = 0 ! style="background:#ffdead;border-left:1px solid black;" | Y = 1 ! style="background:#ffdead;border-right:1px solid black;" | Y = 0 |- ! style="background:#ffdead;border-left:1px solid black;border-top:1px solid black;" | X = 1 ! style="border-left:1px solid black;" | 0.2 ! 0.2 ! style="border-left:1px solid black;" | 0.4 ! 0.4 ! style="border-left:1px solid black;" | 0.4 ! 0.1 ! style="border-left:1px solid black;" | 0.1 ! style="border-right:1px solid black;" | 0.3 |- ! style="background:#ffdead;border-bottom:1px solid black;border-left:1px solid black;" | X = 0 ! style="border-bottom:1px solid black;border-left:1px solid black;" | 0.3 ! style="border-bottom:1px solid black;" | 0.3 ! style="border-left:1px solid black;border-bottom:1px solid black;" | 0.1 ! style="border-bottom:1px solid black;" | 0.1 ! style="border-left:1px solid black;border-bottom:1px solid black;" | 0.1 ! style="border-bottom:1px solid black;" | 0.4 ! style="border-left:1px solid black;border-bottom:1px solid black;" | 0.2 ! style="border-right:1px solid black;border-bottom:1px solid black;" | 0.4 |}
Category:Epidemiology Category:Medical statistics Category:Statistical terminology Category:Bayesian statistics
This text is licensed under the Creative Commons CC-BY-SA License. This text was originally published on Wikipedia and was developed by the Wikipedia community.