Making diffusion work for you: Classification sans text, finding culprits and filling missing values

dc.contributor.authorSundareisan, Shashidharen
dc.contributor.committeechairPrakash, B. Adityaen
dc.contributor.committeememberBatra, Dhruven
dc.contributor.committeememberRamakrishnan, Narenen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2014-07-25T08:00:49Zen
dc.date.available2014-07-25T08:00:49Zen
dc.date.issued2014-07-24en
dc.description.abstractCan we find people infected with the flu virus even though they did not visit a doctor? Can the temporal features of a trending hashtag or a keyword indicate which topic it belongs to without any textual information? Given a history of interactions between blogs and news websites, can we predict blogs posts/news websites that are not in the sample but talk about the "the state of the economy" in 2008? These questions have two things in common: a network (social networks or human contact networks) and a virus (meme, keyword or the flu virus) diffusing over the network. We can think of interactions like memes, hashtags, influenza infections, computer viruses etc., as viruses spreading in a network. This treatment allows for the usage of epidemiologically inspired models to study or model these interactions. Understanding the complex propagation dynamics involved in information diffusion with the help of these models uncovers various non-trivial and interesting results. In this thesis we propose (a) A fast and efficient algorithm NetFill, which can be used to find quantitatively and qualitatively correct infected nodes, not in the sample and finding the culprits and (b) A method, SansText that can be used to find out which topic a keyword/hashtag belongs to just by looking at the popularity graph of the keyword without textual analysis. The results derived in this thesis can be used in various areas like epidemiology, news and protest detection, viral marketing and it can also be used to reduce sampling errors in graphs.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:3444en
dc.identifier.urihttp://hdl.handle.net/10919/49678en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectData Miningen
dc.subjectSocial Networksen
dc.subjectEpidemiologyen
dc.subjectCulpritsen
dc.subjectMissing nodesen
dc.subjectDiffusionen
dc.subjectProtestsen
dc.subjectClassificationen
dc.titleMaking diffusion work for you: Classification sans text, finding culprits and filling missing valuesen
dc.typeThesisen
thesis.degree.disciplineComputer Science and Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sundareisan_S_T_2014.pdf
Size:
2.07 MB
Format:
Adobe Portable Document Format

Collections