Browsing by Author "Li, Fan"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
- Representation Learning Based Causal Inference in Observational StudiesLu, Danni (Virginia Tech, 2021-02-22)This dissertation investigates novel statistical approaches for causal effect estimation in observational settings, where controlled experimentation is infeasible and confounding is the main hurdle in estimating causal effect. As such, deconfounding constructs the main subject of this dissertation, that is (i) to restore the covariate balance between treatment groups and (ii) to attenuate spurious correlations in training data to derive valid causal conclusions that generalize. By incorporating ideas from representation learning, adversarial matching, generative causal estimation, and invariant risk modeling, this dissertation establishes a causal framework that balances the covariate distribution in latent representation space to yield individualized estimations, and further contributes novel perspectives on causal effect estimation based on invariance principles. The dissertation begins with a systematic review and examination of classical propensity score based balancing schemes for population-level causal effect estimation, presented in Chapter 2. Three causal estimands that target different foci in the population are considered: average treatment effect on the whole population (ATE), average treatment effect on the treated population (ATT), and average treatment effect on the overlap population (ATO). The procedure is demonstrated in a naturalistic driving study (NDS) to evaluate the causal effect of cellphone distraction on crash risk. While highlighting the importance of adopting causal perspectives in analyzing risk factors, discussions on the limitations in balance efficiency, robustness against high-dimensional data and complex interactions, and the need for individualization are provided to motivate subsequent developments. Chapter 3 presents a novel generative Bayesian causal estimation framework named Balancing Variational Neural Inference of Causal Effects (BV-NICE). Via appealing to the Robinson factorization and a latent Bayesian model, a novel variational bound on likelihood is derived, explicitly characterized by the causal effect and propensity score. Notably, by treating observed variables as noisy proxies of unmeasurable latent confounders, the variational posterior approximation is re-purposed as a stochastic feature encoder that fully acknowledges representation uncertainties. To resolve the imbalance in representations, BV-NICE enforces KL-regularization on the respective representation marginals using Fenchel mini-max learning, justified by a new generalization bound on the counterfactual prediction accuracy. The robustness and effectiveness of this framework are demonstrated through an extensive set of tests against competing solutions on semi-synthetic and real-world datasets. In recognition of the reliability issue when extending causal conclusions beyond training distributions, Chapter 4 argues ascertaining causal stability is the key and introduces a novel procedure called Risk Invariant Causal Estimation (RICE). By carefully re-examining the relationship between statistical invariance and causality, RICE cleverly leverages the observed data disparities to enable the identification of stable causal effects. Concretely, the causal inference objective is reformulated under the framework of invariant risk modeling (IRM), where a population-optimality penalty is enforced to filter out un-generalizable effects across heterogeneous populations. Importantly, RICE allows settings where counterfactual reasoning with unobserved confounding or biased sampling designs become feasible. The effectiveness of this new proposal is verified with respect to a variety of study designs on real and synthetic data. In summary, this dissertation presents a flexible causal inference framework that acknowledges the representation uncertainties and data heterogeneities. It enjoys three merits: improved balance to complex covariate interactions, enhanced robustness to unobservable latent confounders, and better generalizability to novel populations.
- Two essays on institutional investorsLi, Fan (Virginia Tech, 2020-07-01)In the first essay, we study mutual funds' voting on compensation-related proposals initiated by corporate management. Compared with proposals on other topics, proposals on compensation issues are more likely to be challenged by mutual funds. Consistent with active institutional influence, mutual funds are more likely to vote against management at portfolio firms that make more excess CEO pay or depict other symptoms of poor governance such as bad performance and CEO entrenchment. Both active and passive funds' votes are significant drivers of the voting outcome of a proposal. Failed proposals are associated with lower CEO pay, especially excess pay, in the following year. Say-on-pay proposals opposed by more mutual funds are also followed by lower excess CEO pay. Collectively, evidence in this paper suggests that institutions (including passive institutions) play an important role in setting CEO pay through the voting channel. The second essay examines the equity loan supply for short selling. Using detailed stock lending data, we show that active equity funds, on average, are informed, stock lenders. The stocks they lend outperform those that they do not. The stocks they recall and sell perform worse in the future than those that remain on loan. These funds avoid lending stocks when lending fees are extremely high and use the shorting market's signals to form stock-selling decisions. Our findings help explain why institutional investors lend stocks. They also highlight a new source of short-sale constraints arising from the informed loan supply.