<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Irrational Actor]]></title><description><![CDATA[Wrestling with ideas in economics, politics, and philosophy.]]></description><link>https://www.irrationalactor.com</link><image><url>https://substackcdn.com/image/fetch/$s_!wxuU!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F8a5ab2a4-35b8-4909-af93-61cdc7f2b487_1000x1000.png</url><title>Irrational Actor</title><link>https://www.irrationalactor.com</link></image><generator>Substack</generator><lastBuildDate>Sun, 03 May 2026 23:56:48 GMT</lastBuildDate><atom:link href="https://www.irrationalactor.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Sam Carlen]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[irrationalactor@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[irrationalactor@substack.com]]></itunes:email><itunes:name><![CDATA[Sam Carlen]]></itunes:name></itunes:owner><itunes:author><![CDATA[Sam Carlen]]></itunes:author><googleplay:owner><![CDATA[irrationalactor@substack.com]]></googleplay:owner><googleplay:email><![CDATA[irrationalactor@substack.com]]></googleplay:email><googleplay:author><![CDATA[Sam Carlen]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[JFK was whacked]]></title><description><![CDATA[Imagine if...]]></description><link>https://www.irrationalactor.com/p/jfk-was-whacked</link><guid isPermaLink="false">https://www.irrationalactor.com/p/jfk-was-whacked</guid><dc:creator><![CDATA[Sam Carlen]]></dc:creator><pubDate>Tue, 14 Sep 2021 20:44:24 GMT</pubDate><enclosure url="https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/2a3a936f-11c5-40e5-8585-b806765d7aa9_300x379.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Imagine if, hypothetically, Donald Trump had created a new intelligence agency during his presidency, the &#8220;Central Q Agency&#8221; (CQA). This agency was charged with deciphering the mysterious missives of &#8220;Q&#8221; (the internet poster at the center of the QAnon conspiracy theory) and gathering intelligence in support of this mission and general national security.</p><p>Imagine if President Joe Biden, under advisement from the CQA, made a series of foreign policy blunders so grave they almost lead to nuclear war. Imagine if Biden, due to his reservations about the CQA&#8217;s activities, withheld full air support for a botched CQA military operation to overthrow the socialist Maduro regime in Venezuela, fostering anger among CQA agents.</p><p>The CQA&#8217;s anger only deepened after learning Biden was considering cutting their agency&#8217;s budget by 20%.</p><p>Imagine if the mission&#8217;s failure and Biden&#8217;s subsequent diplomatic overtures to the Maduro government (aimed at easing tensions) created resentment among anti-Maduro Venezuelan refugees, who longed to topple the Maduro government, install a US-backed liberal democracy, and return to their home country. In fact, some refugees even took part in the CQA&#8217;s failed mission, and continued to work closely with the agency.</p><p>Imagine if Maduro, following his rise to power, banned gambling and shut down Mob-run casinos, angering many an East Cost wise guy. Imagine if the CQA had long relied on the Mob for its failed assassination attempts on Maduro.</p><p>Now, imagine if President Biden was assassinated. The Shooter acted alone and was motivated by pro-Maduro and anti-American sentiments (or so you are told).</p><p>Imagine if, two days later, The Shooter was killed. The Second Shooter claims to have been motivated by his love for Biden and anger at The Shooter.</p><p>The Shooter&#8217;s sympathies for Maduro are evident in his defection to Venezuela years earlier, founding of a pro-Maduro organization, visits to Cuban and Venezuelan embassies, and socialist beliefs (or so you are told).</p><p>You later discover The Shooter returned to the U.S. following his supposed defection to Venezuela and even received a government loan upon his return. You later discover that the address listed on the socialist pamphlets The Shooter distributed on behalf of his pro-Maduro organization was that of a restaurant that served as a front for CQA and anti-Maduro Venezuelan refugee operations. You later discover The Shooter was the only member of this organization. You later come across a mysterious photo depicting The Shooter standing aside a top CQA official.</p><p>You later discover The Shooter never visited the Cuban and Venezuelan embassies at all: rather, someone impersonating The Shooter made these visits. The security cameras were mysteriously offline as well.</p><p>You later discover The Second Shooter had ties to the Mob and anti-Maduro Venezuelan militants, and that his financial troubles mysteriously lifted in the weeks preceding Biden&#8217;s assassination. You learn The Second Shooter personally witnessed the assassination (and, that morning, asked an undercover government agent if he &#8220;wanted to see the fireworks&#8221;), and left immediately afterwards to go to the hospital where Biden died.</p><p>You later discover one of the doctors who treated The Second Shooter in jail and diagnosed his psychosis (as The Second Shooter&#8217;s defense rested upon a &#8220;temporary insanity&#8221; argument) was a central participant in the CQA&#8217;s mind-control program and even used some of that program&#8217;s techniques on The Second Shooter. The Second Shooter was never the same after these visits.</p><p>You later discover that in the months before Biden&#8217;s assassination, another plot to murder the president was uncovered, one organized by anti-Maduro militant groups. You later discover the Secret Service agent who discovered this plan was arrested for mysterious &#8220;misconduct&#8221; charges and tied up in court during the entirety of the Pelosi Commission&#8217;s investigation into Biden&#8217;s murder.</p><p>You later discover the CQA handler of its anti-Maduro and Mafia partners was in charge of the flow of information between the CQA and the Pelosi Commission. But this was concealed by the handler&#8217;s use of an assumed name and only disclosed years later. You later encounter declassified documents confirming the CQA withheld information from the Pelosi Commission.</p><p>You watch in horror as several high-ranking Mob bosses are mysteriously gunned down in their homes days before each was slated to testify to the Pelosi Commission.</p><p>You hear news of the Pelosi Commission&#8217;s finding that The Shooter acted alone. You later discover several members of the Committee voiced opposition to this conclusion, and that the Committee&#8217;s meager funding limited the scope of its inquiry.</p><p>A subsequent Congressional investigation concludes Biden was murdered as the result of a conspiracy. You hear of forensic audio evidence on which this conclusion rested indicating the presence of a second shooter.</p><div><hr></div><p>Now, read the above again, but replace &#8220;Biden&#8221; with &#8220;John F. Kennedy,&#8221; &#8220;Central Q Agency&#8221; with &#8220;Central Intelligence Agency,&#8221; &#8220;Maduro&#8221; with &#8220;Castro,&#8221; &#8220;Venezuela&#8221; with &#8220;Cuba,&#8221; &#8220;Lee Harvey Oswald&#8221; with &#8220;The Shooter,&#8221; &#8220;Jack Ruby&#8221; with &#8220;The Second Shooter,&#8221; and &#8220;Pelosi Commission&#8221; with &#8220;Warren Commission.&#8221; And imagine if the above, concerning JFK and Oswald and co., was all factually true - because it is.</p><p>Still believe the official narrative that Lee Harvey Oswald acted alone when he shot JFK?</p>]]></content:encoded></item><item><title><![CDATA[p-hacking 101]]></title><description><![CDATA[A common misuse of statistics, explained]]></description><link>https://www.irrationalactor.com/p/p-hacking-101</link><guid isPermaLink="false">https://www.irrationalactor.com/p/p-hacking-101</guid><dc:creator><![CDATA[Sam Carlen]]></dc:creator><pubDate>Fri, 09 Apr 2021 23:19:39 GMT</pubDate><enclosure url="https://cdn.substack.com/image/fetch/h_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p><em>&#8220;If you torture the data long enough, it will confess to anything&#8221;</em></p><p>- <em>Ronald Coase</em></p></blockquote><p>In my <a href="https://irrationalactor.substack.com/p/coming-soon">last post</a>, I briefly mentioned p-hacking in my discussion of how many social science papers are unreliable, and how terrible research practices partially underlie the sorry state of the literature. In this post, I want to describe &#8220;p-hacking&#8221; in greater detail and illustrate the concept by doing some p-hacking myself.</p><div><hr></div><h2>p-values, explained</h2><p>If you haven&#8217;t taken a statistics course or it&#8217;s been a while, you might have forgotten what a p-value is in the first place (or never learned, which is also fine!). Here is a quick refresher.</p><p>The p-value of a statistical test is the <strong>probability of obtaining a </strong><em><strong>false positive</strong></em><strong> result</strong> (i.e., a result that <em>seems</em> significant, but isn&#8217;t). To illustrate the concept, imagine you are a researcher analyzing <a href="https://www.kaggle.com/andrewmvd/okcupid-profiles">this dataset</a> of ~60 thousand OkCupid profiles.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Let&#8217;s assume this is a representative sample of the population of all OkCupid users:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Jy5k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Jy5k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 424w, https://substackcdn.com/image/fetch/$s_!Jy5k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 848w, https://substackcdn.com/image/fetch/$s_!Jy5k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 1272w, https://substackcdn.com/image/fetch/$s_!Jy5k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Jy5k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png" width="506" height="379.5" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/de351b15-227a-4479-87df-22e7672c7b15_960x720.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:960,&quot;resizeWidth&quot;:506,&quot;bytes&quot;:46810,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Jy5k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 424w, https://substackcdn.com/image/fetch/$s_!Jy5k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 848w, https://substackcdn.com/image/fetch/$s_!Jy5k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 1272w, https://substackcdn.com/image/fetch/$s_!Jy5k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fde351b15-227a-4479-87df-22e7672c7b15_960x720.png 1456w" sizes="100vw" loading="lazy" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sample (our <a href="https://www.kaggle.com/andrewmvd/okcupid-profiles">dataset</a>) vs. population of all OkCupid profiles.</figcaption></figure></div><p>Suppose you want to determine whether the average age of male and female OkCupid users is meaningfully different. </p><p>Your first instinct might be to simply&#8230; find the average age by gender and compare. And huzzah! The numbers are different:</p><pre><code>  sex   mean_age
  &lt;chr&gt;    &lt;dbl&gt;
1 f         32.8
2 m         32.0</code></pre><p>Unfortunately, it&#8217;s not that simple. Top-line statistics can be numerically different without indicating a <em>statistically significant</em> difference due to random variation in the specifics of a given sample. To illustrate, suppose I create 1,000 sub-samples from the initial 60 thousand-large dataset (each consisting of 1,000 male profiles selected at random). Here is the distribution of <em>average </em>male age across the samples:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!U2ia!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!U2ia!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 424w, https://substackcdn.com/image/fetch/$s_!U2ia!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 848w, https://substackcdn.com/image/fetch/$s_!U2ia!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 1272w, https://substackcdn.com/image/fetch/$s_!U2ia!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!U2ia!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png" width="570" height="481.6077537058153" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:741,&quot;width&quot;:877,&quot;resizeWidth&quot;:570,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!U2ia!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 424w, https://substackcdn.com/image/fetch/$s_!U2ia!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 848w, https://substackcdn.com/image/fetch/$s_!U2ia!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 1272w, https://substackcdn.com/image/fetch/$s_!U2ia!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2829a320-cb60-41a5-b59e-0bc1f15a10b6_877x741.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As you can see, each particular sample has a slightly different <strong>sample mean</strong> (i.e., the average (of age in this case) for a sample drawn from a larger population).</p><p>The <em>theoretical </em>distribution of potential sample means is called the <strong>sampling distribution</strong>, and in the case of the mean for a normally-distributed population, it looks an awful lot like what I created above (i.e. bell-shaped).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>Before returning to p-values in particular, I want to explain another topic that will frame the p-value discussion: <strong>hypothesis testing</strong>.</p><h3>Hypothesis testing</h3><p>Let&#8217;s return to the initial research question: whether male OkCupid users are (on average) older or younger than female users (i.e., whether there is a statistically meaningful difference in the average age of male and female users).</p><p>For a hypothesis test in statistics, one establishes a &#8220;null hypothesis&#8221; that says there is <em>not</em> a significant difference/effect. With hypothesis testing, you <strong>assume</strong> that there is no such difference between the means (or, in the case of a regression coefficient, that the coefficient is not meaningfully different from 0). In this example, the null hypothesis H_0 would be:</p><pre><code><code>H_0: mean_age_male = mean_age_female</code></code></pre><p>With hypothesis testing, you start by assuming the null hypothesis is <em>true</em>. So in this case, you would assume that the average age of male OkCupid users <strong>comes from the same sampling distribution </strong>as the mean age of female users.</p><p>We then &#8220;test&#8221; this hypothesis by seeing how likely it is that both means do in fact come from the same sampling distribution. If this seems unlikely, we reject the null hypothesis and accept the alternative (which is the logical complement of the null). In this case, the alternative hypothesis H_1 would be that the mean age of female OkCupid users is <em>not</em> equal to the mean age of male users):</p><pre><code>H_1: mean_age_male &#8800; mean_age_female</code></pre><p>To recap: we can think of the sample mean as falling somewhere on a theoretical distribution of <em>all potential sample means </em>called the <strong>sampling distribution</strong>. In case it helps, here&#8217;s a diagram I made (with the m-looking greek letter mu the sample mean for a given sample i (so, mu_1 corresponds to potential sample 1, mu_2 corresponds to potential sample 2, etc.).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vZdc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vZdc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 424w, https://substackcdn.com/image/fetch/$s_!vZdc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 848w, https://substackcdn.com/image/fetch/$s_!vZdc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 1272w, https://substackcdn.com/image/fetch/$s_!vZdc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vZdc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png" width="960" height="720" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/e2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:960,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:89827,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vZdc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 424w, https://substackcdn.com/image/fetch/$s_!vZdc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 848w, https://substackcdn.com/image/fetch/$s_!vZdc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 1272w, https://substackcdn.com/image/fetch/$s_!vZdc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2a0aebb-6755-484e-8190-af1fbc5874aa_960x720.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To determine whether to reject the null hypothesis, first imagine the average age of females is situated on the male sampling distribution (which is what we are saying with the null hypothesis). We then find the number of standard deviations the average age of females is from the center of the distribution.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> Here is where the mean age of females falls on the male sampling distribution:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WvhJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WvhJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 424w, https://substackcdn.com/image/fetch/$s_!WvhJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 848w, https://substackcdn.com/image/fetch/$s_!WvhJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 1272w, https://substackcdn.com/image/fetch/$s_!WvhJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WvhJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png" width="684" height="577.9293044469783" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/edb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:741,&quot;width&quot;:877,&quot;resizeWidth&quot;:684,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!WvhJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 424w, https://substackcdn.com/image/fetch/$s_!WvhJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 848w, https://substackcdn.com/image/fetch/$s_!WvhJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 1272w, https://substackcdn.com/image/fetch/$s_!WvhJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb6e8e2-51a0-46b6-8b79-b7bc09b144bf_877x741.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here is the information we need to calculate the number of standard deviations the mean age of females is from the mean age of men (which is our <strong>test statistic, t</strong>):</p><pre><code>  sex   mean_age sd_age     n
  &lt;chr&gt;    &lt;dbl&gt;  &lt;dbl&gt; &lt;int&gt;
1 f         32.8  10.0  24117
2 m         32.0   9.03 35829
</code></pre><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-HJs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-HJs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 424w, https://substackcdn.com/image/fetch/$s_!-HJs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 848w, https://substackcdn.com/image/fetch/$s_!-HJs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 1272w, https://substackcdn.com/image/fetch/$s_!-HJs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-HJs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg" width="1456" height="588" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:588,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;t\\quad =\\quad {\\;\\overline {X}_{1}-\\overline {X}_{2}\\; \\over {\\sqrt  {\\;{s_{1}^{2} \\over N_{1}}\\;+\\;{s_{2}^{2} \\over N_{2}}\\quad }}}\\,&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="t\quad =\quad {\;\overline {X}_{1}-\overline {X}_{2}\; \over {\sqrt  {\;{s_{1}^{2} \over N_{1}}\;+\;{s_{2}^{2} \over N_{2}}\quad }}}\," title="t\quad =\quad {\;\overline {X}_{1}-\overline {X}_{2}\; \over {\sqrt  {\;{s_{1}^{2} \over N_{1}}\;+\;{s_{2}^{2} \over N_{2}}\quad }}}\," srcset="https://substackcdn.com/image/fetch/$s_!-HJs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 424w, https://substackcdn.com/image/fetch/$s_!-HJs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 848w, https://substackcdn.com/image/fetch/$s_!-HJs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 1272w, https://substackcdn.com/image/fetch/$s_!-HJs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4650f168-2a89-4b17-abca-b316bd52cbda_25x10.svg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The formula for an unequal variances t-test</figcaption></figure></div><p>We find that t is approximately 10. Ten standard deviations out on the distribution! As a result, the p-value (the probability we obtained a false positive result, i.e. the probability our test statistic actually does come from the null distribution) is minute (&lt;2.2e-16), and well below the 0.05 threshold typically used to define &#8220;statistical significance.&#8221; So we reject the null hypothesis in favor of the alternative hypothesis, that the mean age of female OkCupid users is meaningfully different from the mean age of male OkCupid users.</p><h2>p-hacking, explained</h2><p>P-values provide valuable information about the statistical significance of a difference in means (such as in the example above) or the effect of an independent variable on a dependent variable in the case of regression analysis. Unfortunately, p-values are easily-abused because they are calculated with the assumption that statistical tests are conducted one by one rather than repeatedly until the researcher obtains a &#8220;significant&#8221; result.</p><p>Many researchers compare numerous variables simultaneously in the hopes that one or more relationships are significant. And they do so <strong>without correcting for multiple comparisons </strong>(which is required due to the single-test assumption). Researchers also sometimes try to expand their sample until they reach significance. From <a href="https://www.r-bloggers.com/2018/11/gazing-into-the-abyss-of-p-hacking-harking-vs-optional-stopping/">this </a>overview of p-hacking:</p><blockquote><p>A lot of pressure rests on researchers to produce statistically significant results. For many researchers, statistical significance is the cornerstone of their academic career&#8230;</p><p>Now, what does a researcher do confronted with messy, non-significant results? According to several much-cited studies (for example&nbsp;<a href="https://szociologia.tk.mta.hu/uploads/files/archive/john_et_al_2012.pdf">John et al., 2012</a>;&nbsp;<a href="https://journals.sagepub.com/doi/pdf/10.1177/0956797611417632">Simmons et al., 2011</a>), a common reaction is to start sampling again (and again, and again, &#8230;) in the hope that a somewhat larger sample size can boost significance. Another reaction is to wildly conduct hypothesis tests on the existing sample until at least one of them becomes significant (see for example:&nbsp;<a href="http://journals.sagepub.com/doi/pdf/10.1177/0956797611417632">Simmons et al., 2011</a>;&nbsp;<a href="http://www2.psych.ubc.ca/~schaller/528Readings/Kerr1998.pdf">Kerr, 1998&nbsp;</a>). These practices, along with some others, are commonly known as&nbsp;<em>p-hacking</em>, because they are designed to drag the famous&nbsp;<em>p</em>-value right below the mark of .05 which usually indicates statistical significance. Undisputedly,&nbsp;<em>p</em>-hacking works (for a demonstration try out the&nbsp;<a href="https://www.nicebread.de/introducing-p-hacker/">p-hacker app</a>).</p><p>&#8230;</p><p>As a showcase, we want to introduce two researchers: The&nbsp;<em>HARKer</em>&nbsp;takes existing data and conducts multiple independent hypothesis tests (based on multiple uncorrelated variables in the data set) with the goal to publish the ones that become significant. For example, the HARKer tests for each possible correlation in a large data set whether it differs significantly from zero. On the other hand, the&nbsp;<em>Accumulator</em>&nbsp;uses optional stopping. This means that he collects data for a single research question test in a sequential manner until either statistical significance or a maximum sample size is reached.</p><p>&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iqXq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iqXq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iqXq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iqXq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iqXq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iqXq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg" width="1204" height="731" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/c1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:731,&quot;width&quot;:1204,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iqXq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iqXq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iqXq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iqXq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1a97013-e626-4a24-a423-22502b85f653_1204x731.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We can see that HARKing produces higher false positive rates than optional stopping with the same number of tests. This can be explained through the dependency on the first sample in the case of optional stopping: Given that the null hypothesis is true, this sample is not very likely to show extreme effects in any direction (however, there is a small probability that it does). Every extension of this sample has to &#8220;overcome&#8221; this property not only by being extreme in itself but also by being extreme enough to shift the test on the overall sample from non-significance to significance. In contrast, every sample in the multiple testing case only needs to be extreme in itself.</p></blockquote><p>To illustrate the ease with which one can obtain a fake result, I will <em>randomly generate </em>data and find significant relationship between my made-up variables. If you want to try this for yourself, I recommend checking out <a href="http://shinyapps.org/apps/p-hacker/">this p-hacking demo</a>.</p><p>For this example, suppose we are studying the predictors of cognitive ability, and imagine our dependent variables y1, y2, and y3 (which are meant to measure cognitive ability, the outcome of interest) are IQ, SAT score, and GRE score. Suppose we have a sample size of 100 and randomly generate data for y1, y2, and y3 as follows:</p><pre><code># y (IQ)
# normally distributed with a mean of 100 and a standard deviation of # 15
y1 = rnorm(100,mean=100,sd=15)

# SAT
# normally distributed with a mean of 1051 and a standard deviation 
# of 211
y2 = rnorm(100,mean=1051,sd=211)

# GRE
# normally distributed with a mean of 150 and a standard deviation of # 9
y3 = rnorm(100,mean=150,sd=9)</code></pre><p>And suppose we randomly generate data for ten regressors, x1, x2, &#8230;, x10:</p><pre><code># Regressors:
x1 &lt;- rbinom(100,size=1,prob=0.5)
x2 &lt;- rbinom(100,size=1,prob=0.2)
x3 &lt;- rbinom(100,size=5,prob=0.3)
x4 &lt;- rnorm(100,mean=0,sd=1)
x5 &lt;- rnorm(100,mean=10,sd=2)
x6 &lt;-rnorm(100,mean=100,sd=10)
x7 &lt;- rnorm(100,mean=0,sd=1)
x8 &lt;- rpois(100,lambda=150)
x9 &lt;- rlnorm(100,meanlog=0,sd=1)
x10 &lt;- rbernoulli(100,p=0.1)</code></pre><p>Lets first regress y1 (IQ) against all 10 regressors:</p><pre><code>&gt; m1 &lt;- lm(y1~.,data=df)
&gt; summary(m1)

Call:
lm(formula = y1 ~ ., data = df)

Residuals:
    Min      1Q  Median      3Q     Max 
-30.812  -9.426  -2.442   9.792  41.549 

Coefficients:
              Estimate Std. Error t value Pr(&gt;|t|)  
(Intercept) 127.854450  49.017901   2.608   0.0107 *
y2            0.005603   0.007924   0.707   0.4814  
y3            0.110255   0.238585   0.462   0.6451  
x1           -3.540796   3.404144  -1.040   0.3012  
x2           -2.959543   4.639410  -0.638   0.5252  
x3           -4.319908   1.646945  -2.623   0.0103 *
x4            0.623863   2.026799   0.308   0.7590  
x5            1.345545   0.912235   1.475   0.1438  
x6           -0.101083   0.170585  -0.593   0.5550  
x7           -0.754545   1.735537  -0.435   0.6648  
x8           -0.277310   0.140113  -1.979   0.0510 .
x9           -0.776470   0.842022  -0.922   0.3590  
x10          -2.239609   5.221288  -0.429   0.6690  
---
Signif. codes:  0 &#8216;***&#8217; 0.001 &#8216;**&#8217; 0.01 &#8216;*&#8217; 0.05 &#8216;.&#8217; 0.1 &#8216; &#8217; 1

Residual standard error: 15.79 on 87 degrees of freedom
Multiple R-squared:  0.151,&#9;Adjusted R-squared:  0.03395 
F-statistic:  1.29 on 12 and 87 DF,  p-value: 0.239
</code></pre><p>And voila! We have found a statistically significant relationship (under p&lt;0.05 criteria) between x3 and y1 (IQ), with a respectable p-value<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> of 0.0103. And we also found a just barely-insignificant relationship between x8 and y1. </p><p>This isn&#8217;t surprising: under p &lt; 0.05 significance criteria, each time an independent variable is regressed against the dependent variable there is a <strong>5% chance of obtaining a spurious relationship </strong>(1-0.95=0.05)<strong>. </strong>So in the case of ten dependent variables, the probability of obtaining at least one false positive is about 40% (1-(0.95)^10  = 0.401).</p><p>Yet many seemingly-upstanding researchers would take the above results and write about the relationship between x3 and y1 without mentioning the insignificant p-values for the other regressors. Indeed, the studies cited in the quote above (John et al. (2012) and Simmons et al. (2011)) testify to the stunning prevalence of p-hacking techniques and other shady research methods. In their survey of nearly 6,000 academic psychologists, <a href="https://szociologia.tk.hu/uploads/files/archive/john_et_al_2012.pdf">John et al. (2012)</a> find that, for the &#8220;Bayesian-truth-serum (BTS) condition&#8221; (one portion of respondents had their answers run through a scoring algorithm &#8220;used to provide incentives for truth telling&#8221;):<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><blockquote><p>[N]early 1 in 10 research psychologists has introduced false data into the scientific record and&#8230;the majority of research psychologists have engaged in practices such as selective reporting of studies, not reporting all dependent measures, collecting more data after determining whether the results were significant, reporting unexpected findings as having been predicted, and excluding data post hoc.</p></blockquote><div><hr></div><p>This article is getting to be a bit long, so I&#8217;ll save further discussion of p-hacking and other questionably research practices for another post. I hope this article gave you a better understanding of p-values, why p-hacking is so disturbingly easy, and the frightening ubiquity of p-hacking techniques (and thus unreliable research!).</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.irrationalactor.com/p/p-hacking-101/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.irrationalactor.com/p/p-hacking-101/comments"><span>Leave a comment</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.irrationalactor.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share&quot;,&quot;text&quot;:&quot;Share Irrational Actor&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.irrationalactor.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share"><span>Share Irrational Actor</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>For R users that want to follow along, the code I used in this post can be found <a href="https://github.com/samcarlen/p-hacking101/blob/main/p-hacking%20demo.R">here</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>The sampling distribution is bell-shaped but <strong>not </strong>normally-distributed. Rather it is <strong><a href="https://en.wikipedia.org/wiki/Student%27s_t-distribution">t-distributed</a>. </strong>The t-distribution is similar to the normal distribution but has &#8220;fatter&#8221; tails:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2wxA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2wxA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2wxA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2wxA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2wxA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2wxA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg" width="1456" height="689" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:689,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;t-distribution-vs-normal-distribution&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="t-distribution-vs-normal-distribution" title="t-distribution-vs-normal-distribution" srcset="https://substackcdn.com/image/fetch/$s_!2wxA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2wxA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2wxA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2wxA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9958b863-400e-4d87-8c6a-8db41af78a7e_2235x1057.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://analystprep.com/cfa-level-1-exam/quantitative-methods/t-distribution/">Source</a></figcaption></figure></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Since the standard deviation for age of women is different from that of men, we have to use a <a href="https://en.wikipedia.org/wiki/Welch%27s_t-test">unequal variance t-test</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>In the regression output above, p-values are listed in the right-hand column (under &#8220;Pr(&gt;|t|)&#8221;, and the dots and stars indicate the presence and magnitude of statistical significance (the &#8220;Signif. codes&#8221; row shows what the stars and dots indicate).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>&#8220;[R]espondents were told that we would make a donation to a charity of their choice, selected from five options, and that the size of this donation would depend on the truthfulness of their responses, as determined by the BTS scoring system&#8221; (John et al. 2012, at p. 526). Note: the quoted passage below this footnote has been lightly edited for clarity.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Social science is hard; blogging is good]]></title><description><![CDATA[Welcome to Irrational Actor!]]></description><link>https://www.irrationalactor.com/p/coming-soon</link><guid isPermaLink="false">https://www.irrationalactor.com/p/coming-soon</guid><dc:creator><![CDATA[Sam Carlen]]></dc:creator><pubDate>Mon, 15 Mar 2021 00:54:00 GMT</pubDate><enclosure url="https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/214ee419-5f32-4aa7-b1aa-6435f3b353c7_1500x1500.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Social science is <em>hard</em>. One motivation for starting this blog is to have a place where I can sift through various social science theories/studies/facts and figure out which are credible and which should be disregarded. I&#8217;m hoping that converting my thoughts about various social science theories (and the public policy and moral frameworks they inform) into words will help me clarify my thinking and expose half-baked ideas and intuitions. Indeed, writing something down forces a minimum level of clarity, logic, and persuasiveness.</p><p>My focus on social science isn&#8217;t just because it&#8217;s difficult, since STEM fields are plenty challenging in their own right. There is also my personal interest in social science (especially economics, which was one of my majors in college), and the fact that findings in social science are highly relevant to public policy debates (many of which I am personally invested in).</p><p>But my emphasis on social science and desire to interrogate its theories in these posts is also due to a recognition that social science is actually harder than other kinds of academic inquiry. Unlike researchers in the natural sciences, those in social science are studying largely invisible social phenomena that are imprecisely defined and measured. They usually resort to using observational data (data gathered passively rather than for the express purpose of academic research) and can rarely run the tightly-controlled experiments commonplace in some natural science fields.</p><p>Here is how <em>Nature</em> editors described some of the challenges facing social scientists in <a href="https://www.nature.com/articles/487271a">a piece responding to</a> Congressional Republicans&#8217; <a href="https://www.washingtonpost.com/blogs/wonkblog/post/jeff-flakes-plan-to-politicize-the-national-science-foundation/2012/05/12/gIQAVuddKU_blog.html?tid=usw_passupdatepg">2012 push</a> to end NSF funding for political science research (emphasis mine):</p><blockquote><p> The social sciences are an easy target for this type of attack because they are less cluttered with technical terminology and so seem easier for the layperson to assess. &#8230; </p><p>Part of the blame must lie with the practice of labelling the social sciences as soft, which too readily translates as meaning woolly or soft-headed. <strong>Because they deal with systems that are highly complex, adaptive and not rigorously rule-bound, the social sciences are among the most difficult of disciplines, both methodologically and intellectually. </strong>They suffer because their findings do sometimes seem obvious. Yet, equally, the common-sense answer can prove to be false when subjected to scrutiny. There are countless examples of this, from economics to traffic planning.</p></blockquote><p>And here is Sociologist <a href="https://renebekkers.wordpress.com/2016/11/27/five-reasons-why-social-science-is-so-hard/">Rene Bekkers</a> listing the reasons he thinks social science is especially challenging:</p><blockquote><p>1. No Laws</p><p>All we have is probabilities.</p><p>2. All Experts</p><p>The knowledge we have is continuously contested. The objects of study think they know why they do what they do.</p><p>3. Zillions of Variables</p><p>Everything is connected, and potentially a cause &#8211; like a bowl of well-tossed spaghetti.</p><p>4. Many Levels of Action</p><p>Nations, organizations, networks, individuals, time all have different dynamics.</p><p>5. Imprecise Measures</p><p>Few instruments have near perfect validity and reliability.<br><strong><br>Conclusion</strong></p><p>Social science is not as easy as rocket science. It is way more complicated.</p></blockquote><p>Now, that social science is difficult is not the only reason it&#8217;s worth expending considerable effort scrutinizing its findings and theories (rather than taking them at face value). There is also the specter of shoddy methodology and, far too often, something akin to outright fraud.</p><p>In other words, large swaths of social science research are unreliable due to <strong>avoidable failings</strong> on the part of the researchers. FantasticAnachronism describes the situation <a href="https://fantasticanachronism.com/2020/09/11/whats-wrong-with-social-science-and-how-to-fix-it/#Things-Are-Not-Getting-Better">in grim detail</a>:</p><blockquote><p>There's a popular belief that weak studies are the result of unconscious biases leading researchers down a "garden of forking paths". Given enough "researcher degrees of freedom" even the most punctilious investigator can be misled.</p><p>I find this belief impossible to accept. The brain is a credulous piece of meat&nbsp;but there are limits to self-delusion. Most of them have to know. It's understandable to be led down the garden of forking paths while producing the research, but when the paper is done and you give it a final read-over you will surely notice that all you have is a n=23, p=0.049 three-way interaction effect (one of dozens you tested, and with no multiple testing adjustments of course). At that point it takes more than a&nbsp;<em>subtle unconscious bias</em>&nbsp;to believe you have found something real. And even if the authors really are misled by the forking paths, what are the editors and reviewers doing? Are we supposed to believe they are all gullible rubes?</p><p>&#8230;</p><p>Even when they do accuse someone of wrongdoing they use terms like "Questionable Research Practices" (QRP). How about Questionable Euphemism Practices?</p><ul><li><p>When they measure a dozen things and only pick their outcome variable at the end, that's not the garden of forking paths but the greenhouse of fraud.</p></li><li><p>When they do a correlational analysis but give "policy implications" as if they were doing a causal one, they're not walking around the garden, they're doing the landscaping of forking paths.</p></li><li><p>When they take a continuous variable and arbitrarily bin it to do subgroup analysis or when they add an&nbsp;<em>ad hoc</em>&nbsp;quadratic term to their regression, they're...fertilizing the garden of forking paths? (Look, there's only so many horticultural metaphors, ok?)</p></li></ul></blockquote><p>If you&#8217;re not familiar with the so-called &#8220;Replication Crisis&#8221; and the way researchers routinely produce spurious findings, I encourage you to read <a href="https://www.lesswrong.com/s/BQBqPowfxjvoee8jw/p/SQAfPKZBAAKYMjx25">this piece</a> by Scott Alexander (part of a <a href="https://www.lesswrong.com/s/BQBqPowfxjvoee8jw">larger series</a> on the reliability of scientific research). Here's an enlightening excerpt about a darkly revealing <a href="http://www.socio.mta.hu/dynamic/simmons_et_al_2011.pdf">psychology paper</a>, beginning with four tricks to make almost any research yield &#8220;statistically significant&#8221; results (practices that are employed by a shamefully large number of actual researchers):</p><blockquote><p>1. Measure multiple dependent variables, then report the ones that are significant. For example, if you&#8217;re measuring whether treatment for a certain psychiatric disorder improves life outcomes, you can collect five different measures of life outcomes &#8211; let&#8217;s say educational attainment, income, self-reported happiness, whether or not ever arrested, whether or not in romantic relationship &#8211; and have a <strong>25%-ish probability one of them will come out at significance by chance</strong>. Then you can publish a paper called &#8220;Psychiatric Treatment Found To Increase Educational Attainment&#8221; without ever mentioning the four negative tests.</p><p>2. Artificially choose when to end your experiment. Suppose you want to prove that yelling at a coin makes it more likely to come up tails. You yell at a coin and flip it. It comes up heads. You try again. It comes up tails. You try again. It comes up heads. You try again. It comes up tails. You try again. It comes up tails again. You try again. It comes up tails again. You note that it came up tails four out of six times &#8211; a 66% success rate compared to expected 50% &#8211; and declare victory. Of course, this result wouldn&#8217;t be significant, and it seems as if this should be a general rule &#8211; that almost by the definition of significance, you shouldn&#8217;t be able to obtain it just be stopping the experiment at the right point. But the authors of the study perform several simulations to prove that this trick is more successful than you&#8217;d think:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!17fv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!17fv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 424w, https://substackcdn.com/image/fetch/$s_!17fv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 848w, https://substackcdn.com/image/fetch/$s_!17fv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 1272w, https://substackcdn.com/image/fetch/$s_!17fv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!17fv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png" width="727" height="375" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:375,&quot;width&quot;:727,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!17fv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 424w, https://substackcdn.com/image/fetch/$s_!17fv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 848w, https://substackcdn.com/image/fetch/$s_!17fv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 1272w, https://substackcdn.com/image/fetch/$s_!17fv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F89c8beff-7a7d-466a-b9b9-6497e3dbe1ac_727x375.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>3. Control for &#8220;confounders&#8221; (in practice, most often gender). I sometimes call this the &#8220;Elderly Hispanic Woman Effect&#8221; after drug trials that find that their drug doesn&#8217;t have significant effects in the general population, but it&nbsp;<em>does</em>&nbsp;significantly help elderly Hispanic women. The trick is you split the population into twenty subgroups (young white men, young white women, elderly white men, elderly white women, young black men, etc), in one of those subgroups it will achieve significance by pure chance, and so you declare that your drug must just somehow be a perfect fit for elderly Hispanic women&#8217;s unique body chemistry. This is not&nbsp;<em>always</em>&nbsp;wrong (some antihypertensives have notably different efficacy in white versus black populations) but it is&nbsp;<em>usually</em>&nbsp;suspicious.</p><p>4. Test different conditions and report the ones you like. For example, suppose you are testing whether vegetable consumption affects depression. You conduct the trial with three arms: low veggie diet, medium veggie diet, and high veggie diet. You now have four possible comparisons &#8211; low-medium, low-high, medium-high, low-medium-high trend). One of them will be significant 20% of the time, so you can just report that one: &#8220;People who eat a moderate amount of vegetables are less likely to get depression than people who eat excess vegetables&#8221; sounds like a perfectly reasonable result.</p><p>Then they run simulations to show exactly how much more likely you are to get a significant result in random data by employing each trick:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Nn02!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Nn02!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 424w, https://substackcdn.com/image/fetch/$s_!Nn02!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 848w, https://substackcdn.com/image/fetch/$s_!Nn02!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 1272w, https://substackcdn.com/image/fetch/$s_!Nn02!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Nn02!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png" width="450" height="239" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/e85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:239,&quot;width&quot;:450,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Nn02!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 424w, https://substackcdn.com/image/fetch/$s_!Nn02!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 848w, https://substackcdn.com/image/fetch/$s_!Nn02!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 1272w, https://substackcdn.com/image/fetch/$s_!Nn02!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe85ff505-6e96-4986-9c84-563d5a1ba75c_450x239.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The image demonstrates that by using all four tricks, you can squeeze random data into a result significant at the p &lt; 0.05 level about 61% of the time.</p></blockquote><p>Try <a href="https://shinyapps.org/apps/p-hacker/">this p-hacker demo</a> to see for yourself.</p><p>Due to these research practices, as well as other distortions like <a href="https://en.wikipedia.org/wiki/Publication_bias">publication bias</a>, the existing social science literature is far less reliable than we would expect even if we concede that the research itself is incredibly challenging.</p><p>However, there is a real question as to whether a non-expert blogger with only an undergraduate education (in <em>one </em>social science, economics) is really equipped to identify credible research. But <em>professional researchers themselves </em>don&#8217;t seem to be very good at this either. The graph below shows how papers that *failed* to replicate are cited <em>just as often</em> as papers that passed replication.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GLeJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GLeJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 424w, https://substackcdn.com/image/fetch/$s_!GLeJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 848w, https://substackcdn.com/image/fetch/$s_!GLeJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 1272w, https://substackcdn.com/image/fetch/$s_!GLeJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GLeJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png" width="588" height="397" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/ee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:397,&quot;width&quot;:588,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:10903,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GLeJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 424w, https://substackcdn.com/image/fetch/$s_!GLeJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 848w, https://substackcdn.com/image/fetch/$s_!GLeJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 1272w, https://substackcdn.com/image/fetch/$s_!GLeJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fee617ea5-c6c7-4725-a523-622b5ece6054_588x397.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">From the FantasticAnachronism piece linked to above.</figcaption></figure></div><p>One workaround here is to use broadly applicable rules of thumb to assess a theory/finding. Regardless of the specific field, if a paper has a small sample size, a p-value close to 0.05, uses regression analysis and attempts to &#8220;control for confounders,&#8221; has small effect sizes, and/or conducts extensive subgroup analysis/turns continuous variables into categorical ones, it should probably be regarded as low-quality.</p><p>Likewise, papers that have large sample sizes, <a href="https://psyarxiv.com/mky9j">very small p-values</a>, large effect sizes, etc.; use a randomized controlled trial methodology or another method capable of <a href="https://ftp.cs.ucla.edu/pub/stat_ser/r350.pdf">establishing causality</a>; <a href="https://en.wikipedia.org/wiki/Bonferroni_correction#Alternatives">correct for multiple comparisons</a>; <a href="https://www.cos.io/initiatives/prereg">pre-register</a> their study designs so they can&#8217;t adjust their methods as they go (as described by Scott Alexander above); etc. are more likely to have reliable findings. Again, you can draw these conclusions based on just a few metrics (sample size (<em>n</em>), significance level (<em>p</em>), etc.) even if you know little about the subject matter under study.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://irrationalactor.substack.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share&quot;,&quot;text&quot;:&quot;Share Irrational Actor&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://irrationalactor.substack.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share"><span>Share Irrational Actor</span></a></p><div><hr></div><h2>Blogging as a (promising) genre of writing</h2><p>So I definitely want to use this as a space to examine elements of social science (especially economics), and use theories and evidence I deem credible to understand the institutions and forces most relevant in the world of 2021. I also want to try my hand at making predictions, and hope to weigh in on public policy debates and current events based on my understanding of the relevant theory, literature, and (sometimes) my own analysis. But I have another motivation for starting this blog that I want to touch on briefly.</p><p>Blogs as a genre of writing are situated somewhere between longer-form essays/articles/op-eds (which do a good job articulating things carefully and persuasively at the expense of greater time &amp; effort by the author and fewer opportunities for reader engagement), and social media posts (which basically do the opposite). They strike a nice balance between quantity and quality of posts.</p><p>They also allow for iterative improvement of one&#8217;s ideas (blog post idea &#8212;&gt; writing the post itself and clarifying/refining my thoughts &#8212;&gt; after posting, engaging with reader feedback/criticism &#8212;&gt; a new, &#8220;better&#8221; idea that incorporates what I learned through writing and reading responses to the first post &#8212;&gt; rinse and repeat). The Substack platform is especially good at facilitating this process thanks to its user-friendly social features (leave a comment on this post if you want to see for yourself!). And, because blog posts are usually edited more lightly than essays/news articles and churned out more quickly, this iterative process occurs at a reasonable clip.</p><p>So welcome to Irrational Actor, and I hope you will engage critically and thoughtfully with the ideas I express here. And please share your own ideas as well in the comments section. The more people that read and comment on these posts, the higher the quality of comment-section discourse, so please share this blog with friends and family if you think they would find it interesting!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.irrationalactor.com/p/coming-soon/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.irrationalactor.com/p/coming-soon/comments"><span>Leave a comment</span></a></p>]]></content:encoded></item></channel></rss>