Paper Sharing Column ← Clicking here leads to a column containing some introductory articles I've written (mainly in Chinese) for some excellent published/working papers. These are usually the papers I shared at workshops.
Congratulations to AJR.
2024-10-14
The history of urban economics
2024-09-27
I happened to see this. It's really nice to read this history: https://x.com/Undercoverhist/status/1372999869204467718
The full paper: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4450141
A revised and translated Chinese version by Shanghai-Hong Kong Development Institute: https://fddi.fudan.edu.cn/37/f7/c21253a276471/page.htm
Many thanks to Prof. Cherrier and her co-author.
Establishing my personal website!
2024-09-04
Just launched the site! It's a bit tricky to write the page by myself, but I'm glad I picked up some HTML when I was learning about web crawlers. I might share some of my academic stuff and daily life here in the future, and I'll also try to archive some of my older records.
How to correctly use cluster-robust inference?
2023-07-18
Paper: MacKinnon, James G., Morten Ørregaard Nielsen, and Matthew D. Webb, "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, 232 (2023), 272–299.
This is a really useful paper on cluster-robust inference. I've given a few presentations on it, totaling about 3 hours, in our workshop. I also added some extra materials and shared my own thoughts beyond what's covered in the paper, which can be found in the [slides]. Plus, I introduce how to use some of the mentioned inference methods in Stata.
Here is some key points:
- When deciding to use asymptotic inference, it is recommended to use asymptotic standard errors calculated from jackknife (leave-one-out) method when sample size is small, since it has better small-sample properties (it is unbiased) and is relatively conservative.
- As the sample size increases, there seems to be a lower bound on the shrinkage of the cluster-robust standard error, and thus the underestimation of the standard error due to ignoring the clustered structure becomes increasingly larger. (In other words, it is more necessary to treat clustered structure carefully when the sample size is large.)
- When using DD estimates, it is important to cluster at least to the geographic level where the treatment is assigned.
- If the number of treatment clusters are small, don't use asymptotic cluster-robust standard errors but use wide-cluster bootstrap standard errors.
- In the vast majority of cases, adding fixed effects is not the reason for not clustering.
-
There are two rules of thumb to decide the cluster level:
- An intuitive rule: to choose the most coarse one among all possible levels, up to the point at which there is concern about having too few clusters
- A conservative rule: to report the largest standard error for the coefficient of interest across all estimated at varying possible cluster levels.
- There are some pre-tests that are helpful in deciding the cluster level, but pre-tests always lead to over-reject because they in and of themselves could make mistakes. Therefore, The ideal procedure is to cluster based on the second rule of thumb and use pre-tests to further support the choice.
- The permutation test ("placebo test") commonly used in DD estimation (especially in Chinese literature) is used to support the statistic inference rather than the causal identification.
- The asymptotic inference relies on the central limit theorem (CLT). The number of clusters and the heterogeneity among clusters both determine whether asymptotic theories work well. Therefore, researchers should report the number of clusters and the heterogeneity among clusters. Additionally, it implies that there will not be a universal threshold of clusters number \( G^* \) beyond which we can be assured of effective asymptotic inference.
- When the number of clusters is small or there are huge heterogeneity among clusters, the asymptotic standard errors usually fail to work effectively.
- In addition to the default asymptotic standard errors in Stata, it's always recommended to report p-values calculated using wilde bootstrap inference, especially restricted wild-cluster bootstrap (WCR).