Social Network Project for IDEAL in CS5604

dc.contributor.authorHarb, Islamen
dc.contributor.authorJin, Yilongen
dc.contributor.authorCedeno, Vanessaen
dc.contributor.authorMallampati, Sai Ravi Kiranen
dc.contributor.authorBulusu, Bhaskara Srinivasa Bharadwajen
dc.date.accessioned2015-05-13T16:34:36Zen
dc.date.available2015-05-13T16:34:36Zen
dc.date.issued2015-05-11en
dc.description.abstractThe IDEAL (Integrated Digital Event Archiving and Library) project involves VT faculty, staff, and students, along with collaborators around the world, in archiving important events and integrating the digital library, and archiving approaches to support the Research and Development related to important events. An objective of the CS5604 (Information Retrieval), Spring 2015 course, was to build a state-of-the-art information retrieval system, in support of the IDEAL project. Students were divided into eight groups to become experts in a specific theme of high importance in the development of the tool. The identified themes were Classifying Types, Extraction and Feature Selection, Clustering, Hadoop, LDA, NER, Reducing Noise, Social Networks and Importance and Solr and Lucene. Our goal as a class was to provide documents that were relevant to an arbitrary user query from within a collection of tweets and their referenced web pages. The goal of the Social Network and Importance group was to develop a query independent importance methodology for these tweets and web pages based on social network type considerations. This report proposes a method to provide importance to the tweets and web pages by using non-content features. We define two features for the ranking, Twitter specific features and Account authority features. To determine the best set of features, the analysis of their individual effect in the output importance is also included. At the end, an “importance” value is associated with each document, to aid searching and browsing using Solr.en
dc.description.sponsorshipIDEAL Project, US National Science Foundation grant IIS - 1319578en
dc.identifier.urihttp://hdl.handle.net/10919/52264en
dc.language.isoen_USen
dc.rightsCreative Commons CC0 1.0 Universal Public Domain Dedicationen
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/en
dc.subjectTweetsen
dc.subjectWebpagesen
dc.subjectRankingen
dc.subjectImportance Valueen
dc.subjectSocial Networken
dc.titleSocial Network Project for IDEAL in CS5604en
dc.title.alternativeTweets and Webpages Importance Values Calculationen
dc.typePresentationen
dc.typeSoftwareen
dc.typeTechnical reporten

Files

Original bundle
Now showing 1 - 5 of 18
Loading...
Thumbnail Image
Name:
Final Presentation.pdf
Size:
618.27 KB
Format:
Adobe Portable Document Format
Description:
Presentation_SN
Name:
Final Presentation.pptx
Size:
410.19 KB
Format:
Microsoft Powerpoint
Description:
Presentation_SN
Name:
basic_classes.py
Size:
4.99 KB
Format:
Plain Text
Description:
Tweet Script 1
Name:
license.txt
Size:
34.32 KB
Format:
Plain Text
Description:
Tweet Script 2
Name:
main.py
Size:
4.06 KB
Format:
Plain Text
Description:
Tweet Script 3
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: