IDEAL Pages
dc.contributor.author | Farghally, Mohammed | en |
dc.contributor.author | Elbery, Ahmed | en |
dc.date.accessioned | 2014-05-11T00:13:48Z | en |
dc.date.available | 2014-05-11T00:13:48Z | en |
dc.date.issued | 2014-05-10 | en |
dc.description | The submitted files include the full technical report, midterm presentation, final presentation, and complete source code for the document indexing as well as for the Web interface. We would like to acknowledge NSF for funding the project under the grant IIS - 1319578: Integrated Digital Event Archiving and Library (IDEAL) For a working URL of the project results, please contact Mohamed Magdy (mmagdy@vt.edu) or Edward Fox (fox@vt.edu) or visit http://www.eventsarchive.org/ | en |
dc.description.abstract | The main goal of this project is to provide a convenient Web enabled interface to a large collection of event-related webpages supporting the two main services of browsing and searching. We first studied the events and decided what fields are required to build the events index based on the dataset available to us. We then configured a SolrCloud with a collection based on these fields in the Schema.xml file. Then we built a Hadoop Map-Reduce function along with SolrCloud to index documents related to the data about 60 events crawled from the Web. Then we were able to find a way to interface with the Solr server and indexed documents through a PHP server application. Finally, we were able to design a convenient user interface that allows users to browse the documents by event category and event name as well as to search the document collection for particular keywords. | en |
dc.description.sponsorship | Mohammed Magdy, Digital Libraries lab , Virginia Tech | en |
dc.description.sponsorship | NSF IIS - 1319578: Integrated Digital Event Archiving and Library (IDEAL) | en |
dc.identifier.uri | http://hdl.handle.net/10919/47952 | en |
dc.language.iso | en_US | en |
dc.rights | Creative Commons CC0 1.0 Universal Public Domain Dedication | en |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | en |
dc.subject | Events Collection | en |
dc.subject | Web Interface | en |
dc.subject | Indexing large collections | en |
dc.subject | Solr Cloud | en |
dc.subject | Hadoop | en |
dc.subject | Solarium client | en |
dc.title | IDEAL Pages | en |
dc.type | Presentation | en |
dc.type | Software | en |
dc.type | Technical report | en |
Files
Original bundle
1 - 5 of 7
Loading...
- Name:
- IDEAL WebPages Midterm.pdf
- Size:
- 197.84 KB
- Format:
- Adobe Portable Document Format
- Description:
- Midterm Project Presentation in PDF
- Name:
- IDEAL WebPages Midterm.pptx
- Size:
- 103.99 KB
- Format:
- Microsoft Powerpoint XML
- Description:
- Midterm Project Presentation in PPTX
Loading...
- Name:
- IDEALwebages Final.pdf
- Size:
- 1.06 MB
- Format:
- Adobe Portable Document Format
- Description:
- Final Project Presentation in PDF
- Name:
- IDEALwebages Final.pptx
- Size:
- 1.04 MB
- Format:
- Microsoft Powerpoint XML
- Description:
- Final Project Presentation in PPTX
- Name:
- IDEAL_PAGES.zip
- Size:
- 4.28 MB
- Format:
- Unknown data format
- Description:
- Project's complete source code (Indexing + Web Interface)
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.5 KB
- Format:
- Item-specific license agreed upon to submission
- Description: