Virginia Tech
    • Log in
    View Item 
    •   VTechWorks Home
    • Student Works
    • CS6604: Digital Libraries
    • View Item
    •   VTechWorks Home
    • Student Works
    • CS6604: Digital Libraries
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Unsupervised Event Extraction from News and Twitter

    Thumbnail
    View/Open
    Final presentation (pptx version) (1.478Mb)
    Downloads: 250
    Final presentation (PDF version) (848.0Kb)
    Downloads: 700
    Middle term presentation (pptx version) (1.093Mb)
    Downloads: 201
    Middle term presentation (PDF version) (684.0Kb)
    Downloads: 171
    A large news data set about the Ukraine Crisis story (6.014Mb)
    Downloads: 212
    A small news data set related to Apple Inc. (554.8Kb)
    Downloads: 318
    Final report (docx version) (317.6Kb)
    Downloads: 617
    Final report (PDF version) (789.8Kb)
    Downloads: 1278
    Date
    2014-05-11
    Author
    Xuan, Zhang
    Wei, Huang
    Ji, Wang
    Tianyu, Geng
    Metadata
    Show full item record
    Abstract
    Living in the age of big data, we are facing massive information every day, especially that from the mainstream news and the social networks. Due to its gigantic volume, one may get frustrated when trying to identify the key information which really matters. Thus, how to summarize the key information from the enormous amount of news and tweets becomes essential. Addressing this problem, this project explores the approaches to extract key events from newswires and Twitter data in an unsupervised manner, where Topic Modeling and Named Entity Recognition have been applied. Various methods have been tried regarding the different traits of news and tweets. The relevance between the news events and the corresponding Twitter events is studied as well. Tools have been developed to implement and evaluate these methods. Our experiments show that these tools can effectively extract key events from the news and tweets data sets. The tools, documents and data sets can be used for educational purposes and as a part of the IDEAL project of Virginia Tech.
    URI
    http://hdl.handle.net/10919/47954
    Collections
    • CS6604: Digital Libraries [19]

    If you believe that any material in VTechWorks should be removed, please see our policy and procedure for Requesting that Material be Amended or Removed. All takedown requests will be promptly acknowledged and investigated.

    Virginia Tech | University Libraries | Contact Us
     

     

    VTechWorks

    AboutPoliciesHelp

    Browse

    All of VTechWorksCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Log inRegister

    Statistics

    View Usage Statistics

    If you believe that any material in VTechWorks should be removed, please see our policy and procedure for Requesting that Material be Amended or Removed. All takedown requests will be promptly acknowledged and investigated.

    Virginia Tech | University Libraries | Contact Us