Seeing the Forest for the Trees: New approaches to Characterizing and Forecasting Cascades
Files
TR Number
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Cascades are a popular construct to observe and study information propagation (or diffusion) in social media such as Twitter and are defined using notions of influence, activity, or discourse commonality (e.g., hashtags). While these notions of cascades lead to different perspectives, primarily cascades are modeled as trees. We argue in this thesis an alternative viewpoint of cascades as forests (of trees) which yields a richer vocabulary of features to understand information propagation. We propose to develop a framework to extract forests and analyze their growth by studying their evolution at the tree-level and at the node-level. Furthermore, we outline four different problems that use the forest framework. First, we show that such forests of information cascades can be used to design counter-contagion algorithms to disrupt the spread of negative campaigns or rumors. Secondly, we demonstrate how such forests of information cascades can give us a rich set of features (structural and temporal), which can be used to forecast information flow. Thirdly, we argue that cascades modeled as forests can help us glean social network sensors to detect future contagious outbreaks that occur in the social network. To conclude, we show preliminary results of an approach - a generative model, that can describe information cascades modeled as forests and can generate synthetic cascades with empirical properties mirroring cascades extracted from Twitter.