{"id":17,"date":"2019-04-10T13:28:00","date_gmt":"2019-04-10T13:28:00","guid":{"rendered":"https:\/\/wp.media.unc.edu\/biostatistics\/?page_id=17"},"modified":"2020-06-18T12:35:21","modified_gmt":"2020-06-18T12:35:21","slug":"lesson-3-inferential-statistics","status":"publish","type":"page","link":"https:\/\/wp.media.unc.edu\/biostatistics\/lesson-3-inferential-statistics\/","title":{"rendered":"Lesson 3: Inferential Statistics"},"content":{"rendered":"<div class=\"\"><ul class=\"nav nav-tabs\" id=\"oscitas-tabs-0\"><li class=\"active\"><a class=\"\" href=\"#pane-0-0\" data-toggle=\"tab\">Introduction<\/a><\/li><li class=\"\"><a class=\"\" href=\"#pane-0-1\" data-toggle=\"tab\">Terminology<\/a><\/li><li class=\"\"><a class=\"\" href=\"#pane-0-2\" data-toggle=\"tab\">Basic Concepts<\/a><\/li><li class=\"\"><a class=\"\" href=\"#pane-0-3\" data-toggle=\"tab\">Reported Results<\/a><\/li><li class=\"\"><a class=\"\" href=\"#pane-0-4\" data-toggle=\"tab\">Assessment<\/a><\/li><\/ul><div class=\"tab-content\"><div class=\"tab-pane active\" id=\"pane-0-0\"><div id=\"quicktabs-tabpage-lesson_3_-0\" class=\"quicktabs-tabpage \">\n<article id=\"node-84\" class=\"node node-book clearfix\">\n<header><\/header>\n<div class=\"field field-name-body field-type-text-with-summary field-label-hidden\">\n<div class=\"field-items\">\n<div class=\"field-item even\">\n<div class=\"tex2jax\">\n<div class=\"intro\">\n<div class=\"introtopper\">This lesson reviews inferential statistics. By the end of this lesson, you will be able to:<\/div>\n<ol>\n<li>Describe the basic steps of hypothesis testing:\n<ol type=\"a\">\n<li style=\"margin-left: 40px;\">Define null and alternative hypothesis<\/li>\n<li style=\"margin-left: 40px;\">Define alpha<\/li>\n<li style=\"margin-left: 40px;\">Select appropriate test<\/li>\n<li style=\"margin-left: 40px;\">Run analysis<\/li>\n<li style=\"margin-left: 40px;\">Draw conclusion<\/li>\n<\/ol>\n<\/li>\n<li>Define type I and type II error<\/li>\n<li>Define power and it&#8217;s typical value (0.80)<\/li>\n<li>Define power analysis<\/li>\n<li>Describe the importance of sample size<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/article>\n<\/div><\/div><div class=\"tab-pane \" id=\"pane-0-1\"><div id=\"quicktabs-tabpage-lesson_3_-1\" class=\"quicktabs-tabpage quicktabs-hide\">\n<article id=\"node-85\" class=\"node node-book clearfix\">\n<header><\/header>\n<div class=\"field field-name-body field-type-text-with-summary field-label-hidden\">\n<div class=\"field-items\">\n<div class=\"field-item even\">\n<div class=\"tex2jax\">\n<div class=\"terminology\">\n<p>Terms that appear frequently throughout this lesson are defined below:<\/p>\n<table class=\"table\">\n<tbody>\n<tr>\n<td class=\"termdef\"><strong>Term<\/strong><\/td>\n<td class=\"termdef\"><strong>Definition<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"vertical-align: top;\"><strong>Null hypothesis<\/strong><\/td>\n<td style=\"vertical-align: top;\">The opposite of the hypothesis proposed, typically that there is <strong>no<\/strong> difference, relationship, or effect<\/td>\n<\/tr>\n<tr>\n<td style=\"vertical-align: top;\"><strong>Alternative hypothesis<\/strong><\/td>\n<td style=\"vertical-align: top;\">The proposed hypothesis or idea, typically that there <strong>is<\/strong> a difference, relationship, or effect<\/td>\n<\/tr>\n<tr>\n<td class=\"tableseparator\"><strong>Alpha<\/strong><\/td>\n<td class=\"tableseparator\">Probability of incorrectly rejecting the null hypothesis<\/td>\n<\/tr>\n<tr>\n<td style=\"vertical-align: top;\">\n<p style=\"margin-left: 40px;\"><strong>Type I error<\/strong><\/p>\n<\/td>\n<td style=\"vertical-align: top;\">Detecting a difference when there isn&#8217;t one<\/td>\n<\/tr>\n<tr>\n<td style=\"vertical-align: top;\">\n<p style=\"margin-left: 40px;\"><strong>P-value<\/strong><\/p>\n<\/td>\n<td style=\"vertical-align: top;\">Probability that the observed statistic occurred by chance if the null hypothesis is true<\/td>\n<\/tr>\n<tr>\n<td class=\"tableseparator\"><strong>Beta<\/strong><\/td>\n<td class=\"tableseparator\">Probability of incorrectly failing to reject the null hypothesis<\/td>\n<\/tr>\n<tr>\n<td style=\"vertical-align: top;\">\n<p style=\"margin-left: 40px;\"><strong>Type II error<\/strong><\/p>\n<\/td>\n<td style=\"vertical-align: top;\">Failing to detect a difference when there is one<\/td>\n<\/tr>\n<tr>\n<td class=\"tableseparator\"><strong>Power<\/strong><\/td>\n<td class=\"tableseparator\">Ability of a test to detect a difference when a difference exists<\/td>\n<\/tr>\n<tr>\n<td style=\"vertical-align: top;\">\n<p style=\"margin-left: 40px;\"><strong>Power analysis<\/strong><\/p>\n<\/td>\n<td style=\"vertical-align: top;\">Method for determining how large a sample size must be to detect a difference if in fact a difference exists<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/article>\n<\/div><\/div><div class=\"tab-pane \" id=\"pane-0-2\"><div id=\"quicktabs-tabpage-lesson_3_-2\" class=\"quicktabs-tabpage quicktabs-hide\">\n<article id=\"node-86\" class=\"node node-book clearfix\">\n<header><\/header>\n<div class=\"field field-name-body field-type-text-with-summary field-label-hidden\">\n<div class=\"field-items\">\n<div class=\"field-item even\">\n<div class=\"tex2jax\">\n<p><strong>Inferential statistics<\/strong> allows us to make inferences about the population based on the sample that we have studied. Inferential statistics relies on the use of hypothesis testing, which follows these basic steps:<\/p>\n<h2>Hypothesis testing<\/h2>\n<ol>\n<li>Define the hypotheses. The <strong>null hypothesis <\/strong>is the opposite of the hypothesis proposed, typically the idea that a difference does not exist. The <strong>alternative hypothesis <\/strong>is the idea being researched, typically the idea that a difference does exist.<em> Rejecting the null <\/em>means that your statistical test has found a significant difference in the groups being researched.<\/li>\n<li>Define <strong>alpha,<\/strong> or the probability of incorrectly rejecting the null hypothesis. Most statistical analysis uses an alpha-level of 0.05, which means that there is a 1 in 20 chance that you will find a difference when there isn&#8217;t one. Detecting a difference when there isn&#8217;t one is referred to as <strong>Type 1 error<\/strong>. Failing to detect a difference when there is one is referred to as <strong>Type II error<\/strong>. This is based on the beta-level, or the probability of incorrectly failing to reject the null.<\/li>\n<li>Select a test. Based on the measurement scale of the data and other research design considerations, an appropriate statistical test should be selected. Basic statistical tests will be reviewed later in this module.<\/li>\n<li>Run the analysis. The test will provide a<strong> p-value<\/strong>, or the probability that the observed statistic occurred by chance.<\/li>\n<li>Draw a conclusion. The p-value is compared to alpha. A <em>statistically significant relationship <\/em>(p-value =&lt; alpha) means it was unlikely that the difference detected by the statistical test occurred by chance, enabling us to conclude that the independent variable is related to the dependent variable. Failure to find a statistical significance (p-value &gt; alpha) means that our observed data can be explained by chance.<\/li>\n<\/ol>\n<table border=\"1\" width=\"561\" cellspacing=\"1\" cellpadding=\"1\">\n<tbody>\n<tr>\n<td>&nbsp;<\/td>\n<td>\n<p style=\"text-align: center;\">Decision:<br \/>\n<em>Accept Null Hypothesis<\/em><\/p>\n<\/td>\n<td>\n<p style=\"text-align: center;\">Decision:<br \/>\n<em>Reject Hull Hypothesis<\/em><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>Null Hypothesis = True<\/td>\n<td><strong>CORRECT!<\/strong><\/td>\n<td>type 1 error<\/td>\n<\/tr>\n<tr>\n<td>Null Hypothesis = False<\/td>\n<td>type 2 error<\/td>\n<td><strong>CORRECT!<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Power to detect a difference<\/h2>\n<p><strong>Power <\/strong>is the probability that a statistical test will correctly reject the null\/correctly accept the alternative hypothesis. In other words, power is the ability of a test to detect a difference when a difference actually exists. Most researchers set power at 0.8, which means that 80% of the time, we will find a statistical significance if a difference actually exists.<\/p>\n<p>Power is directly related to sample size and a <strong>power analysis<\/strong> can be used to determine how large a sample must be to detect a difference if in fact a difference exists.<\/p>\n<p style=\"text-align: center; font-size: 2.5em; color: #4b9cd3;\"><em>n<\/em> = 2\u03b4<sup>2<\/sup>(Z<sub>\u03b2<\/sub>+Z<sub>\u03b1\/2<\/sub>)<sup>2<\/sup> \/ difference<sup>2<\/sup><\/p>\n<p style=\"text-align: center; font-size: 1.2em;\">where<br \/>\n<em>n<\/em> = <strong>sample size<\/strong> in each group (assumes equal-sized groups)<br \/>\n\u03b4 = <strong>standard deviation<\/strong> of the outcome variable<br \/>\nZ<sub>\u03b2<\/sub> represents the <strong>desired power<\/strong> (typically .84 for 80% power)<br \/>\nZ<sub>\u03b1<\/sub> represents the <strong>level of statistical significance<\/strong> (typically 1.96)<br \/>\ndifference = <strong>effect size<\/strong> (the difference in means)<\/p>\n<div class=\"alert alert-warning\"><span class=\"glyphicon glyphicon-exclamation-sign red\">&nbsp;<\/span><strong> Beware!&nbsp;<\/strong> <span style=\"color: #ff0000;\"><strong>Small sample! <\/strong><\/span>In a small sample, it is harder to detect a statistical difference; no difference may simply mean that there was not enough power in the study to show a relationship even if there was one.<\/div>\n<div class=\"alert alert-warning\"><span class=\"glyphicon glyphicon-exclamation-sign red\">&nbsp;<\/span><strong> Beware!<\/strong> <span style=\"color: #ff0000;\"><strong>Large sample! <\/strong><\/span>It is easier to detect a statistical difference with a large sample, but there may be no practical or clinical significance in the difference. As seen in the table below, a difference of 10 with a sample size of 4 is just as statistically significant as a difference of 0.2 for a sample size of 10,000.<\/div>\n<div class=\"table-responsive\">\n<table class=\"table numbers\">\n<thead>\n<tr>\n<th style=\"text-align: center;\" scope=\"col\">Sample Size<\/th>\n<th style=\"text-align: center;\" scope=\"col\">Sample Mean<\/th>\n<th style=\"text-align: center;\" scope=\"col\">Population Mean<\/th>\n<th style=\"text-align: center;\" scope=\"col\">P value<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"text-align: center;\">4<\/td>\n<td style=\"text-align: center;\">110.0<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">25<\/td>\n<td style=\"text-align: center;\">104.0<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">64<\/td>\n<td style=\"text-align: center;\">102.5<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">100<\/td>\n<td style=\"text-align: center;\">102.0<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">400<\/td>\n<td style=\"text-align: center;\">101.0<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">2,500<\/td>\n<td style=\"text-align: center;\">100.4<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">10,000<\/td>\n<td style=\"text-align: center;\">100.2<\/td>\n<td style=\"text-align: center;\">100.0<\/td>\n<td style=\"text-align: center;\">0.05<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p class=\"citation\">Table adapted from Norman GR, Streiner DL. <cite>PDQ Statistics.<\/cite> BC Decker, INC: Philadelphia; 1986.<\/p>\n<div class=\"alert alert-warning\"><span class=\"glyphicon glyphicon-exclamation-sign red\">&nbsp;<\/span><strong> Beware! <span style=\"color: #ff0000;\">Very small p-values!<\/span><\/strong> Smaller p-values do not indicate the <em>importance<\/em> or <em>magnitude<\/em> of significance. In other words p-value = 0.0001 is NOT more significant than p-value = 0.048. Small p-values only suggest that the observed statistic was less likely to have happened by chance. Statistics calculated from <strong>large sample sizes<\/strong> are more resistant to chance, as there are more people in the sample from the population you are interested in understanding. With more people in the sample from the population you are interested in, it is less likely that any difference detected was by chance, resulting in smaller p-values.<\/div>\n<p>Again, a smaller p-value does not mean the detected difference is bigger or more significant.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/article>\n<\/div><\/div><div class=\"tab-pane \" id=\"pane-0-3\"><div id=\"quicktabs-tabpage-lesson_3_-3\" class=\"quicktabs-tabpage quicktabs-hide\">\n<article id=\"node-87\" class=\"node node-book clearfix\">\n<header><\/header>\n<div class=\"field field-name-body field-type-text-with-summary field-label-hidden\">\n<div class=\"field-items\">\n<div class=\"field-item even\">\n<div class=\"tex2jax\">\n<h2 class=\"example\">Hypotheses<\/h2>\n<p>In the literature, the null hypothesis is often implied rather than clearly stated. Any table with p-values and any results written with a claim of significance are associated with a hypothesis, even if that hypothesis is not stated. When hypotheses are stated, they are sometimes accompanied by symbols <strong>H<sub>0<\/sub><\/strong> for the null and <strong>H<sub>A <\/sub><\/strong>for the alternative.<\/p>\n<h3>Example 1<\/h3>\n<p>Classical hypothesis testing was used to determine whether the period-2 values were significantly different from those from period 1 by testing whether the study mean Ln(R) was significantly different from 0. The hypotheses were:<\/p>\n<p>H<sub>0<\/sub>: \u00b5<sub>Ln(R)<\/sub> = 0; alpha = 0.05<br \/>\nH<sub>A<\/sub>: \u00b5<sub>Ln(R)<\/sub> not equal to 0&nbsp;&nbsp; <a href=\"http:\/\/onlinelibrary.wiley.com\/doi\/10.1046\/j.1365-2710.2002.00382.x\/abstract\" target=\"_blank\" rel=\"noopener noreferrer\">[Miyazawa, et al., 2002]<\/a><\/p>\n<h3>Example 2<\/h3>\n<p>Given the lack of empirical evidence linking student perceptions to actual outcomes in the general health sciences literature, we operated under a null hypothesis of no relationship (correlation) between any student perception indicators and the actual outcomes as measured by PCOA scores. <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3355283\/pdf\/ajpe76463.pdf\" target=\"_blank\" rel=\"noop ener noreferrer noopener\">[Naughton and Friesner, 2012]<\/a><\/p>\n<h2>Power<\/h2>\n<p>The following statements demonstrate the use of a power calculation to determine the sample size needed to detect an effect or difference:<\/p>\n<h3>Example 1<\/h3>\n<p>We needed 45 patients to have 95% power to reject the null hypothesis that the mean serum digoxin concentration was within 10% of the mean predicted digoxin concentration. Patients were recruited from two general practices and had been taking digoxin for at least four months. Exclusion criteria were dementia, low adherence to digoxin, and use of other medications known to interact to a clinically important extent with digoxin.<a href=\"http:\/\/onlinelibrary.wiley.com\/doi\/10.1111\/j.1365-2710.2005.00650.x\/full\" target=\"_blank\" rel=\"noopener noreferrer\"> [Kroese, et al., 2005]<\/a><\/p>\n<h3>Example 2<\/h3>\n<p>Power analysis calculation with an alpha of 0.05 and beta of 0.8, indicated that a minimum sample size of 85 was necessary to find significance associated with a 10-point difference in examination performance. Ninety-five students consented to participate in the study. <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4386745\/\" target=\"_blank\" rel=\"noopener noreferrer\">[McLaughlin, et al., 2014]<\/a><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/article>\n<\/div><\/div><div class=\"tab-pane \" id=\"pane-0-4\"><div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-3\" class=\"h5p-iframe\" data-content-id=\"3\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Assessment (For Lesson 3)\"><\/iframe><\/div><\/div><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"_links":{"self":[{"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/pages\/17"}],"collection":[{"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/comments?post=17"}],"version-history":[{"count":42,"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/pages\/17\/revisions"}],"predecessor-version":[{"id":1228,"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/pages\/17\/revisions\/1228"}],"wp:attachment":[{"href":"https:\/\/wp.media.unc.edu\/biostatistics\/wp-json\/wp\/v2\/media?parent=17"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}