<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Accidental Developer &#187; online ads</title>
	<atom:link href="http://osric.com/chris/accidental-developer/tag/online-ads/feed/" rel="self" type="application/rss+xml" />
	<link>http://osric.com/chris/accidental-developer</link>
	<description>What if Gregor Samsa awoke a computer programmer?</description>
	<lastBuildDate>Mon, 30 Apr 2012 21:16:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Online Advertising Click-Thru Rates, Revisited</title>
		<link>http://osric.com/chris/accidental-developer/2011/05/online-advertising-click-thru-rates-revisited/</link>
		<comments>http://osric.com/chris/accidental-developer/2011/05/online-advertising-click-thru-rates-revisited/#comments</comments>
		<pubDate>Tue, 17 May 2011 02:23:42 +0000</pubDate>
		<dc:creator>Chris Herdt</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[advertising]]></category>
		<category><![CDATA[chi-square]]></category>
		<category><![CDATA[click-thru rates]]></category>
		<category><![CDATA[ctr]]></category>
		<category><![CDATA[online ads]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://osric.com/chris/accidental-developer/?p=388</guid>
		<description><![CDATA[Revisiting a post from 2 years ago, I re-analyze online advertisement click-thru rates using a different statistical technique (a chi-square test).]]></description>
			<content:encoded><![CDATA[<p>A couple years ago, I wrote <a href="http://osric.com/chris/accidental-developer/2009/12/online-advertisements-and-statistical-analysis/">Online Advertisements and Statistical Analysis</a>, in which I did my best to show that a past study of online advertising click-thru rates (CTRs) wasn&#8217;t worth the pixels it was printed on.</p>
<p>About a week ago, my wife and I were visiting friends, and I found myself in a room with 3 neuroscientists. The topic of statistics came up, and I managed to insert into conversation my small triumph in analyzing the click-thru study and determining both a confidence interval and the number of tests that would need to be run in order to have a meaningful confidence interval. &#8220;Sure,&#8221; one of the scientists says, &#8220;but what you should <em>really</em> do instead is a chi-square test for goodness-of-fit.&#8221;<br />
<span id="more-388"></span><br />
I had no idea what that meant, or even how it was spelled (I was thinking <em>Kai</em> at first instead of <em>chi</em>), but I found a description in my statistics textbook.</p>
<p>In the original post, there were 6 different banner advertisements, which varied only slightly. Each one was run for 30,000 impressions, which resulted in 6 click-frequencies:</p>
<table border="1" cellspacing="0">
<thead>
<tr>
<th>Ad</th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>D</th>
<th>E</th>
<th>F</th>
</tr>
</thead>
<tbody>
<tr>
<td>Clicks per 30,000</td>
<td>81</td>
<td>84</td>
<td>90</td>
<td>96</td>
<td>99</td>
<td>108</td>
</tr>
</tbody>
</table>
<p>The chi-square test calls not just for <em>observed</em> values, but also for <em>expected</em> values:</p>
<p><img src="http://osric.com/chris/accidental-developer/wp-content/uploads/2011/05/chi-square.png" alt="Equation: sum of the square of the observed value less expected value over the expected value" title="chi-square" width="314" height="23" class="aligncenter size-full wp-image-391" /></p>
<p>For the expected value, I used the mean of all 6 ads, 93 clicks per 30,000 ad impressions. (I don&#8217;t know if this is the best value to pick, but it falls in line with the textbook examples.)</p>
<p>I got a chi-square value of 5.42. I wasn&#8217;t positive in this case if there were 5 degrees of freedom, or 4, but in either case, using a lookup table, it suggested that the <em>p</em>-value was greater than 0.1, which further suggests that the differences in clicks may have been due to chance, rather than differences in ad design.</p>
<p>My wife uses several stats software packages, so we entered my data into <a href="http://www.graphpad.com/prism/prism.htm">GraphPad Prism</a> and ran a chi-square test there. It came up with a chi-square value very close to mine (though not identical&#8211;I&#8217;ll have to check my figures again), and a <em>p</em>-value of 0.37. That value, if I interpret things correctly, means there was a 37% chance that the results were due to random chance. </p>
<p>I realized part-way through this exercise that there is a way to trick people into believing the truth without any analysis at all, though. Instead of focusing on the clicks, focus on the instances where people didn&#8217;t click. Sure, if one ad gets 81 clicks and another gets 108 clicks, one seems clearly superior to the other. If you instead compare 29,919 non-clicks to 29,892 non-clicks, the difference seems trivial.</p>
<p>I am more satisfied with my result, though. We should be skeptical when people try to convince us that such small differences are significant over so few tests. </p>
]]></content:encoded>
			<wfw:commentRss>http://osric.com/chris/accidental-developer/2011/05/online-advertising-click-thru-rates-revisited/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online Advertisements and Statistical Analysis</title>
		<link>http://osric.com/chris/accidental-developer/2009/12/online-advertisements-and-statistical-analysis/</link>
		<comments>http://osric.com/chris/accidental-developer/2009/12/online-advertisements-and-statistical-analysis/#comments</comments>
		<pubDate>Thu, 03 Dec 2009 18:21:49 +0000</pubDate>
		<dc:creator>Chris Herdt</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[banner ads]]></category>
		<category><![CDATA[click-thru rate]]></category>
		<category><![CDATA[ctr]]></category>
		<category><![CDATA[online ads]]></category>
		<category><![CDATA[online advertising]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[stats]]></category>

		<guid isPermaLink="false">http://osric.com/chris/accidental-developer/?p=249</guid>
		<description><![CDATA[Quite a few years ago, I was in the online advertising business. My team and I created banner ads to run alongside web site content, to entice viewers to click on ads and find out more about advertiser offers. We scheduled ads to run alongside specific content. We targeted ads towards users in specific geographic [...]]]></description>
			<content:encoded><![CDATA[<p>Quite a few years ago, I was in the online advertising business. My team and I created banner ads to run alongside web site content, to entice viewers to click on ads and find out more about advertiser offers. We scheduled ads to run alongside specific content. We targeted ads towards users in specific geographic regions, thanks to a browser cookie that told us their ZIP code. And we constantly managed inventory.</p>
<p>Although we made animated ads, we avoided anything that blinked. There were no monkeys to punch.</p>
<p>Click-Through Rate (or Click-Thru Rate or CTR) was a key measurement of an ad&#8217;s success. Although at first we would sometimes see click CTRs between 1-2% (meaning that an ad was clicked between 10 and 20 out of every 1000 views, or <em>impressions</em>), as online advertising proliferated, and as our systems got better at filtering out false impressions and clicks from various robots, crawlers, and spiders, CTRs trended much lower: 0.25% suddenly looked good, and 0.10% was not uncommon in some cases. That&#8217;s 1 click for every 1000 impressions.</p>
<p>That&#8217;s why we were insanely interested in a blog post we found, now presumably lost to the ages, that ran a set of 6 banner ads, which varied only slightly, and analyzed the results to determine what aspects of the ads could improve CTRs. Did including the phrase &#8220;click here&#8221; really help? If the words &#8220;click here&#8221; were in blue and underlined, like a typical web link, would that improve the CTR?<br />
<span id="more-249"></span><br />
They ran 30,000 impressions of each ad. I don&#8217;t recall if they set a frequency cap (wherein you limit the number of impressions a specific viewer sees); I don&#8217;t believe they did. I don&#8217;t recall exact details of each ad or the results, but it looked something like this:</p>
<ol>
<li>Ad for amazing offer: 0.27% CTR</li>
<li>Same ad with &#8220;click here&#8221; in black on the <em>left</em> side of the ad: 0.28% CTR</li>
<li>&#8220;Click here&#8221; in black on the <em>right</em> side of the ad: 0.30% CTR</li>
<li>&#8220;Click here&#8221; in blue on the right side of the ad: 0.32% CTR</li>
<li>&#8220;Click here&#8221; underlined in blue on the right side of the ad: 0.33% CTR</li>
<li>&#8220;Click here&#8221; in blue, inside a button on the right side of the ad: 0.36% CTR</li>
</ol>
<p>Naturally, the result of this was that all of our ads soon had a gray button in the lower-left corner with the words &#8220;Click Here&#8221; in it, underlined and in blue:</p>
<div id="attachment_254" class="wp-caption alignnone" style="width: 478px"><img src="http://osric.com/chris/accidental-developer/wp-content/uploads/2009/12/amazing-offer-click-here_468x60.gif" alt="Example banner ad" title="Example banner ad" width="468" height="60" class="size-full wp-image-254" /><p class="wp-caption-text">Example banner ad</p></div>
<p>I was a bit skeptical, though. Could we really say from 30,000 impressions that a 0.33% CTR is significantly different from 0.36%? It&#8217;s a difference of 9 clicks. Could that be attributed to random chance?</p>
<p>I don&#8217;t have a background in statistics, so I asked my father, a scientist. He handed me a 1000-page epidemiology textbook and said&#8211;and I love this part&#8211;that a banner ad click is a lot like a disease state: an individual either has the disease (a click) or does not have the disease. Needless to say, I didn&#8217;t make a lot of headway into the world of epidemiology, but the question still troubled me.</p>
<p>Now I am taking an introductory class on statistical analysis, and although my analysis may oversimplify things greatly, I think it is safe to say that we should not have concluded that every ad needed a button-like box with the words &#8220;click here&#8221; in blue in the lower-right corner.</p>
<p>If we look at any one of the banners in isolation, the CTR is really just a sample <del datetime="2009-12-07T15:49:44+00:00">mean</del> proportion [thanks to Patrick for the correction]. We could run millions of impressions of the same banner&#8211;would it have the same CTR? What is the standard error? To find the confidence interval for the best performing ad, we can run it through this equation:</p>
<p><code>p +/- z*sqrt((p(1 - p))/n)<br />
p = 0.0036, z = 1.96 (for 95% confidence), and n = 30,000.</code></p>
<p>The result? 0.36% +/- 0.07%. We are 95% confident that the true population CTR is somewhere between 0.29% and 0.43%. Well&#8211;yikes! I&#8217;m 95% confident that our measurement isn&#8217;t very precise. When we&#8217;re dealing with such low proportions, we could really use more precision. We would need to run a test with more than 30,000 impressions.</p>
<p>What if we wanted to run a test where we were 95% confident that our value was within just one-one hundredth of a percent (0.01%) of the population mean? In other words, CTR% +/- 0.005%? We have an equation for that too:</p>
<p><code>n = (z^2*p(1-p))/e^2<br />
n = desired number of impressions, z = 1.96 (for 95% confidence), p = 0.0036, e = 0.00005</code></p>
<p>To be 95% confident that our sample CTR is within 0.01% of the population CTR, we would need to run<br />
<em><strong>5,517,522</strong></em> impressions.</p>
<p>Although the data presented in that blog post from years ago seemed compelling, I think I was right to be skeptical. As I said, this is based on what I&#8217;ve learned from an introductory course on statistical analysis. If you think I&#8217;m way off base, feel free to enlighten me in the comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://osric.com/chris/accidental-developer/2009/12/online-advertisements-and-statistical-analysis/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

