Unsupervised Event Extraction from News and Twitter
dc.contributor.author | Xuan, Zhang | en |
dc.contributor.author | Wei, Huang | en |
dc.contributor.author | Ji, Wang | en |
dc.contributor.author | Tianyu, Geng | en |
dc.date.accessioned | 2014-05-11T13:54:39Z | en |
dc.date.available | 2014-05-11T13:54:39Z | en |
dc.date.issued | 2014-05-11 | en |
dc.description | We appreciate the help of our client, Mohamed Magdy, a Ph.D. student of DLRL, Virginia Tech. We also thank NSF, who has funded the IDEAL project (NSF IIS – 1319578). Since we are working on some publications based on this project, we are not going to share our source code at this moment. We’ll consider sharing that after papers are published. | en |
dc.description.abstract | Living in the age of big data, we are facing massive information every day, especially that from the mainstream news and the social networks. Due to its gigantic volume, one may get frustrated when trying to identify the key information which really matters. Thus, how to summarize the key information from the enormous amount of news and tweets becomes essential. Addressing this problem, this project explores the approaches to extract key events from newswires and Twitter data in an unsupervised manner, where Topic Modeling and Named Entity Recognition have been applied. Various methods have been tried regarding the different traits of news and tweets. The relevance between the news events and the corresponding Twitter events is studied as well. Tools have been developed to implement and evaluate these methods. Our experiments show that these tools can effectively extract key events from the news and tweets data sets. The tools, documents and data sets can be used for educational purposes and as a part of the IDEAL project of Virginia Tech. | en |
dc.description.sponsorship | Mohamed Magdy, DLRL, Virginia Tech | en |
dc.description.sponsorship | NSF (Grant NSF IIS - 1319578) | en |
dc.identifier.uri | http://hdl.handle.net/10919/47954 | en |
dc.language.iso | en_US | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | Unsupervised event extraction | en |
dc.subject | Topic model | en |
dc.subject | Named entity recognition | en |
dc.subject | Newstream and Twitter | en |
dc.subject | Deep learning | en |
dc.title | Unsupervised Event Extraction from News and Twitter | en |
dc.title.alternative | IDEAL Computational Linguistics Prototype | en |
dc.type | Presentation | en |
dc.type | Technical report | en |
Files
Original bundle
1 - 5 of 8
- Name:
- Unsupervised Event Extraction-Final Presentation.pptx
- Size:
- 1.48 MB
- Format:
- Microsoft Powerpoint XML
- Description:
- Final presentation (pptx version)
Loading...
- Name:
- Unsupervised Event Extraction-Final Presentation.pdf
- Size:
- 848.02 KB
- Format:
- Adobe Portable Document Format
- Description:
- Final presentation (PDF version)
- Name:
- Unsupervised Event Extraction-MidTerm Presentation.pptx
- Size:
- 1.09 MB
- Format:
- Microsoft Powerpoint XML
- Description:
- Middle term presentation (pptx version)
Loading...
- Name:
- Unsupervised Event Extraction-MidTerm Presentation.pdf
- Size:
- 684.05 KB
- Format:
- Adobe Portable Document Format
- Description:
- Middle term presentation (PDF version)
- Name:
- UkraineCrisisNews.zip
- Size:
- 6.01 MB
- Format:
- Unknown data format
- Description:
- A large news data set about the Ukraine Crisis story
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.5 KB
- Format:
- Item-specific license agreed upon to submission
- Description: