Statistics

Stan vs PyMC3 vs Bean Machine

I have been a light user of Stan and RStan for some time and while there are a lot of things I really like about the language (such as the awesome community you can turn to for support and ShinyStan for inspecting Stan output) there are also a few things that I find frustrating.

Last updated on Jan 14, 2022 11 min read

Conventional Attrition Tests Don't Make Much Sense - Here's a Better Way

A while back, I was involved in an education RCT that had pretty high (40% or so) attrition. What’s worse, the remaining treatment and control students appeared to be quite different from each other.

Last updated on Dec 10, 2021 4 min read

Index Variable Weirdness

There are many instances where you have a bunch of variables and you need to boil them down to one or just a few. For example, you may be testing the effect of an education program on students’ confidence, self-efficacy, and learning levels.

Last updated on Dec 10, 2021 4 min read

Estimating seroprevalence with data from an imperfect test on a convenience sample

Update Jan, 2022: Since this post was published in May 2020, Gelman and Carpenter (2020) have published a more comprehensive analysis on how to adjust for test imperfections using a Bayesian approach which goes beyond many of the ideas here.

Last updated on Jan 9, 2022 8 min read

An Alternative Approach to Power Calculations

The typical approach to power calculations goes something like this: first, the evaluator estimates the smallest MDE for which the intervention would be cost-effective. Second, the evaluator calculates the sample required to detect that MDE.

Last updated on Dec 10, 2021 5 min read

Three Stage Sampling

One of IDinsight’s project teams is in the process of designing the sampling strategy for a large scale household survey and is considering using a three stage sampling design in which they would first select districts, then villages (or urban wards), and then households.

Last updated on Jan 7, 2022 5 min read

Simple Random Sampling vs. PPS Sampling

A question came up on one of our evaluations on whether we should use simple random sampling (SRS) or probability proportional to size (PPS) sampling when selecting villages (our primary sampling units) for a matching study.

Last updated on Dec 10, 2021 4 min read

Fixed Effects vs Difference-in-Differences

TL;DR: When you have longitudinal data, you should use fixed effects or ANCOVA rather than difference-in-differences since a difference-in-difference specification will spit out incorrect variance estimates. If the data is from a randomized trial, ANCOVA is probably a better bet.

Last updated on May 25, 2022 5 min read

Multiple Hypothesis Testing

This week, I volunteered to read and summarize one of the articles for IDinsigh’s tech team’s book club. The topic for this week is multiple hypothesis testing and the article I volunteered to summarize is “Multiple Inference and Gender Differences in the Effects of Early Intervention: A Reevaluation of the Abecedarian, Perry Preschool, and Early Training Projects” by Michael Anderson.

Last updated on Dec 10, 2021 6 min read