Event Trend Detector

dc.contributor.authorWard, Ryanen
dc.contributor.authorLee, Junen
dc.contributor.authorBeard, Stuarten
dc.contributor.authorEdwards, Skylaren
dc.contributor.authorSu, Spenceren
dc.date.accessioned2018-05-10T06:54:25Zen
dc.date.available2018-05-10T06:54:25Zen
dc.date.issued2018-05-07en
dc.description.abstractThe Global Event and Trend Archive Research (GETAR) project is supported by NSF (IIS-1619028 and 1619371) through 2019. It will devise interactive, integrated, digital library/archive systems coupled with linked and expert-curated webpage/tweet collections. In support of GETAR, the 2017 project built a tool to scrape the news to identify important global events. It generates seeds (URLs of relevant webpages, as well as Twitter-related hashtags and keywords and mentions). A display of the results can be seen from the hall outside 2030 Torgersen Hall. This project extends that work in multiple ways. The quality of the work done has been improved. This is evident in changes done to the clustering algorithm and the user interface changes to the clustering display of global events. Second, in addition to events reported in the news, trends have been identified, and a database of trends and related events were built with a corresponding user interface according to the client’s preferences. Third, the results of the detection are connected to software for collecting tweets and crawling webpages, so automated daily runs find and archive webpages related to each trend and event. The final deliverables include development of a trend detection feature with Reddit news, integration of Google Trends into trend detection, an improved clustering algorithm to have more accurate clusters according to k-means, an improved UI for important global events according to what the client wanted, and an aesthetically pleasing UI to display the trend information. Work accomplished included setting up a table of tagged entities for trend detection and configuring the database for clustering and trends to work with our personal machines, and completing the deliverables. Many lessons were learned regarding the importance of using existing tools, starting early, doing research, having regular meetings, and having good documentation.en
dc.description.notesEventTrendDetectorReport: This file goes over the project in its entirety from start to finish. It includes a developer's manual, user manual and key design aspects. Reference this file for any questions about the trend detection project. See Word and PDF versions. EventTrendDetectorPresentation: This file represents the final presentation given on the trend detector. Key design aspects and technologies used are referenced in this presentation. See PowerPoint and PDF versions. CS4624-EventTrendDetector: This zip file contains the necessary python scripts and php files to run this code on a local machine. Additionally there are 20 svg files containing graph data from Reddit and Google.en
dc.description.sponsorshipNSF (IIS-1619028 and 1619371)en
dc.identifier.urihttp://hdl.handle.net/10919/83205en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectTrend Detectionen
dc.subjectTrendsen
dc.subjectPythonen
dc.subjectGETARen
dc.subjectRedditen
dc.subjectGoogleen
dc.subjectNews trendsen
dc.titleEvent Trend Detectoren
dc.typePresentationen
dc.typeReporten
dc.typeSoftwareen

Files

Original bundle
Now showing 1 - 5 of 5
Loading...
Thumbnail Image
Name:
EventTrendDetectorPresentation.pdf
Size:
1.14 MB
Format:
Adobe Portable Document Format
Name:
EventTrendDetectorPresentation.pptx
Size:
4.28 MB
Format:
Microsoft Powerpoint XML
Name:
CS4624-EventTrendDetector.zip
Size:
151.84 KB
Format:
Name:
EventTrendDetectorReport.docx
Size:
6.21 MB
Format:
Microsoft Word XML
Loading...
Thumbnail Image
Name:
EventTrendDetectorReport.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: