top | item 15682249

(no title)

portman | 8 years ago

Isn't this particular test susceptible to confounding effects? Traffic fluctuates day-to-day, week-to-week, and month-to-month, so how can you be sure it was the presence-or-absence of ads and not something else? If you randomize at the visitor level, you are sampling from both high-and-low traffic days, and control for any external fluctuations.

discuss

gwern|8 years ago

Because you're randomizing at the 2-day level, on average there will be just as many advertising/high-traffic days as advertising/low-traffic days, and as many no-advertising/high-traffic days as no-advertising/low-traffic days. The randomization is unaffected by traffic and uncorrelated with it. The unit of analysis is each day, not each visitor. This is why it has to be run for several months, otherwise you don't wind up with a decent n=50 pairs.

That's the tradeoff here: it lets you look at the totals, but it takes a lot longer than if you randomize per visitor in which case you could finish the test in a few days, often.

jackgolding|8 years ago

The stats went well over my head but as a web analyst I thought maybe you could have asked a simpler question. Pick a segment of your site such as visits from Google Search who landed on your homepage (most likely people who searched "gwern") which should reduce a lot of those spikes.