IDEAL Pages

dc.contributor.authorFarghally, Mohammeden
dc.contributor.authorElbery, Ahmeden
dc.date.accessioned2014-05-11T00:13:48Zen
dc.date.available2014-05-11T00:13:48Zen
dc.date.issued2014-05-10en
dc.descriptionThe submitted files include the full technical report, midterm presentation, final presentation, and complete source code for the document indexing as well as for the Web interface. We would like to acknowledge NSF for funding the project under the grant IIS - 1319578: Integrated Digital Event Archiving and Library (IDEAL) For a working URL of the project results, please contact Mohamed Magdy (mmagdy@vt.edu) or Edward Fox (fox@vt.edu) or visit http://www.eventsarchive.org/en
dc.description.abstractThe main goal of this project is to provide a convenient Web enabled interface to a large collection of event-related webpages supporting the two main services of browsing and searching. We first studied the events and decided what fields are required to build the events index based on the dataset available to us. We then configured a SolrCloud with a collection based on these fields in the Schema.xml file. Then we built a Hadoop Map-Reduce function along with SolrCloud to index documents related to the data about 60 events crawled from the Web. Then we were able to find a way to interface with the Solr server and indexed documents through a PHP server application. Finally, we were able to design a convenient user interface that allows users to browse the documents by event category and event name as well as to search the document collection for particular keywords.en
dc.description.sponsorshipMohammed Magdy, Digital Libraries lab , Virginia Techen
dc.description.sponsorshipNSF IIS - 1319578: Integrated Digital Event Archiving and Library (IDEAL)en
dc.identifier.urihttp://hdl.handle.net/10919/47952en
dc.language.isoen_USen
dc.rightsCreative Commons CC0 1.0 Universal Public Domain Dedicationen
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/en
dc.subjectEvents Collectionen
dc.subjectWeb Interfaceen
dc.subjectIndexing large collectionsen
dc.subjectSolr Clouden
dc.subjectHadoopen
dc.subjectSolarium clienten
dc.titleIDEAL Pagesen
dc.typePresentationen
dc.typeSoftwareen
dc.typeTechnical reporten

Files

Original bundle
Now showing 1 - 5 of 7
Loading...
Thumbnail Image
Name:
IDEAL WebPages Midterm.pdf
Size:
197.84 KB
Format:
Adobe Portable Document Format
Description:
Midterm Project Presentation in PDF
Name:
IDEAL WebPages Midterm.pptx
Size:
103.99 KB
Format:
Microsoft Powerpoint XML
Description:
Midterm Project Presentation in PPTX
Loading...
Thumbnail Image
Name:
IDEALwebages Final.pdf
Size:
1.06 MB
Format:
Adobe Portable Document Format
Description:
Final Project Presentation in PDF
Name:
IDEALwebages Final.pptx
Size:
1.04 MB
Format:
Microsoft Powerpoint XML
Description:
Final Project Presentation in PPTX
Name:
IDEAL_PAGES.zip
Size:
4.28 MB
Format:
Unknown data format
Description:
Project's complete source code (Indexing + Web Interface)
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: