Integrated Digital Event Archiving and Library (IDEAL): Preview of Award 1319578 - Annual Project Report

dc.contributorVirginia Tech. Department of Computer Science. Digital Library Research Laboratoryen
dc.contributor.authorFox, Edward A.en
dc.contributor.authorHanna, Kristineen
dc.contributor.authorKavanaugh, Andrea L.en
dc.contributor.authorSheetz, Steven D.en
dc.contributor.authorShoemaker, Donald J.en
dc.contributor.departmentDigital Library Research Laboratoryen
dc.contributor.departmentComputer Scienceen
dc.date.accessed2014-09-18en
dc.date.accessioned2015-05-29T20:36:39Zen
dc.date.available2015-05-29T20:36:39Zen
dc.date.issued2014-07-09en
dc.description.abstractThe goals of this project are to ingest tweets and Web-based content from social media and the general Web, including news and governmental information. In addition to archiving materials found, the project team will build an information system that includes related metadata and knowledge bases, consistent with the 5S (Societies, Scenarios, Spaces, Structures, Streams) framework, along with results from our intelligent focused crawler, to support comprehensive access to event related content. With the support of key partners, the IDEAL team will undertake important research, education, and dissemination efforts, to achieve three complementary objectives: 1. Collecting: The project team will spot, identify, and make sense of interesting events. We also will accept specific or general requests about types of events. Given resource and sampling constraints, we will integrate methods to identify appropriate URLs as seeds, and specify when to start crawling and when to stop, with regard to each event or sub-event. We will integrate focused crawling and filtering approaches in order to ingest content and generate new collections, with high precision and recall. 2. Archiving & Accessing: Permanent archiving, and access to those archives, will be ensured by our partner, Internet Archive (IA). Immediate access to ingested content will be facilitated through big data software built on top of our new Hadoop cluster. 3. Analyzing & Visualizing: We will provide a wide range of integrated services beyond the usual (faceted) browsing and searching, including: classification, clustering, summarization, text mining, theme and topic identification, and visualization.en
dc.format.extent10 pagesen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationFox, Edward A., Kristine Hannah, Andrea Kavanaugh, Steven Sheetz, Donald Shoemaker. Integrated Digital Event Archiving and Library (IDEAL). 2014en
dc.identifier.urihttp://hdl.handle.net/10919/52853en
dc.identifier.urlhttp://eventsarchive.org/sites/default/files/IDEALyear1Submitted.pdfen
dc.language.isoen_USen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectDigital event archivingen
dc.subjectTwitteren
dc.subjectSocial mediaen
dc.subjectInformation systemsen
dc.titleIntegrated Digital Event Archiving and Library (IDEAL): Preview of Award 1319578 - Annual Project Reporten
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2014_Integrated_Digital_Event_Archiving.pdf
Size:
180.88 KB
Format:
Adobe Portable Document Format