In social science research, the existence of boundaries seems to imply a powerful force. Whether in politics, economics, or epidemiology, this force shapes how we understand cause and effect. Among them, regression discontinuity design (RDD), as a quasi-experimental design method, uses these boundaries to observe the effect of intervention measures. The core of this method is to set a clear critical point for intervention, and the processing status of the subject is determined based on the level of the critical point. All this seems to change our understanding of causal inference, especially in situations where randomized experiments are difficult to implement.
Regression discontinuous designs can provide strong evidence of the effect of an intervention with little need for random assignment.
The regression discontinuity design was originally proposed by Donald Sislethweit and Donald Campbell in 1960, and its earliest application was in the evaluation of scholarship programs. In recent years, this design has gradually received widespread attention and has even been compared with randomized controlled trials (RCTs) to prove its internal validity. However, many researchers still warn that relying solely on this method for causal inference is limited because it cannot completely eliminate the interference of potentially confounding variables.
When we look at merit-based scholarships, it’s not hard to see the logic behind them. Assuming a scholarship cutoff of 80%, students who barely meet this threshold will be awarded scholarships, while those who do not will not. In this way, 79% and 81% of students are often very similar in many aspects, but the awarding of scholarships makes the academic performance of these two groups significantly different. Comparing the outcomes for these two groups of students allows inferences to be made about the local treatment effects of the scholarship.
“The realm is powerful not only because it provides us with an entry point, but also because it reveals the subtle differences between different groups.”
There are two main estimation methods for regression discontinuous designs: non-parametric estimation and parametric estimation. The most common of the nonparametric methods is local linear regression, which provides reliable estimates based on data near critical points. Compared with parametric methods that use complex models, non-parametric methods in this scenario can reduce the bias caused by data far from the critical point. Parameter estimation usually uses polynomial regression to fit the data more flexibly.
“Whether it is a non-parametric or parametric method, the core principle is to minimize the estimation error, which is the basis for effective research.”
When properly implemented and carefully analyzed, RDD can provide local, unbiased estimates of treatment effects. This design is more ethically and practically feasible than a randomized experiment. However, the effectiveness of RDD still depends on the correctness of the model and the correlation between the results.
One challenge is identifying and evaluating other confounding factors that may intersect at the same critical point, causing confounding effects. For example, when a treatment has the same effect on legal drinking age and gambling-related age, this makes it difficult to obtain accurate estimates.
With the deepening of research on discontinuous design, researchers have begun to explore more complex designs, such as fuzzy RDD and regression kink design (RKD). These methods aim to better explain and capture causal effects, and can still provide valuable insights for policy development even when tipping points are not fully adhered to.
“Regression discontinuity designs have led to innovations in research methods in many fields, prompting us to reexamine the possibility of quantifying causal effects.”
In summary, regression discontinuity design is not only an important tool in statistics, but also an important bridge in social science research. Is it possible to better understand the power of boundaries through ongoing research?