VTechWorks Repository :: Browsing by Author "Chitturi, Kiran"

Browsing by Author "Chitturi, Kiran"

Now showing 1 - 10 of 10

Building CTRnet Digital Library Services using Archive-It and LucidWorks Big Data Software
Chitturi, Kiran (Virginia Tech, 2014-03-27)
When a crisis occurs, information flows rapidly in the Web through social media, blogs, and news articles. The shared information captures the reactions, impacts, and responses from the government as well as the public. Later, researchers, scholars, students, and others seek information about earlier events, sometimes for cross-event analysis or comparison. There are very few integrated systems which try to collect and permanently archive the information about an event and provide access to the crisis information at the same time. In this thesis, we describe the CTRnet Digital Library and Archive which aims to permanently archive crisis event information by using Archive-It services and then provide access to the archived information by using LucidWorks Big Data software. Through the Big Data (LWBD) software, we take advantage of text extraction, clustering, similarity, annotation, and indexing services and build digital libraries with the generated metadata that will be helpful for the system stakeholders to locate information about an event. Through this study, we collected data for 46 crises events using Archive-It. We built a CTRnet DL prototype and its services for the ``Boston Marathon Bombing" collection by using the components of LucidWorks Big Data. Running LucidWorks Big Data on a 30 node Hadoop cluster accelerates the sub-workflows processing and also provides fault tolerant execution. LWBD sub-workflows, ``ingest" and ``extract", processed the textual data present in the WARC files. Other sub-workflows ``kmeans", ``simdoc", and ``annotate" helped in grouping the search-results, deleting the duplicates and providing metadata for additional facets in the CTRnet DL prototype, respectively.
Crawling
Fox, Edward A.; Khandeparker, Ashwin S. (2012-11-28)
This module covers the basic concepts of Web crawling, policies, techniques and how these can be applied to Digital Libraries.
Crisis, Tragedy, and Recovery Network Digital Library (CTRnet)
Chitturi, Kiran; Fox, Edward A. (2013-01-10)
This presentation outlines the goals of the Crisis, Tragedy and Recovery Network (CTRnet) project. These goals include researching the problems of integrating content, community, services related to crisis, tragedies, and recovery; integrating heterogeneous information in a specific domain, making it accessible, and preserving it for long-term reuse; extending the scope of digital libraries so they are closely but flexibly coupled with a wide variety of services to support diverse emerging communities; and supporting information exploration with advanced methods (Stepping Stones and Pathways (SSP), PathRank, and Storytelling) that facilitate searching, browsing, and discovery.
Information Retrieval System Evaluation
Wei, Shiyi; Suwardiman, Victoria; Swaminathan, Anand (2012-10-03)
The module introduces the evaluation in information retrieval. It focuses on the standard measurement of system effectiveness through relevance judgments.
LucidWorks: Advanced Searching cURL
Makkapati, Hemanth; Subbiah, Rajesh; Kaw, Rushi (2012-10-07)
This module focuses on advanced search techniques using Apache Solr through cURL. Successful completion of this module will enable students to employ advanced search techniques based on multi-values, multi-fields, phrase queries, query term proximity, boosting, etc. Also, students will be able to sort and display returned results in various ways.
Overview of LucidWorks Big Data Software
Chitturi, Kiran (2012-09-16)
This module introduces the basic concepts and the overview of LucidWorks Big Data software that is specifically designed for searching, discovery, and analysis of massive content sets.
Real-time Archiving of Spontaneous Events (Use-Case: Hurricane Sandy)
Chitturi, Kiran (2012-12-03)
This presentation describes the goals and purpose of the Crisis, Tragedy and Recovery network (CTRnet) and how CTRnet's efforts can be applied to real-life disasters such as Hurricane Sandy.
Relevance Feedback and Query Expansion
Wu, Sichao; Zhang, Yao (2012-10-17)
This module introduces the methods to improve the recall of information retrieval systems, mainly focuses on relevance feedback and query expansion.
Text Classification Using Mahout
Alam, Maksudul; Arifuzzaman, S. M.; Bhuiyan, Md Hasanuzzaman (2012-11-06)
This module focuses on classification of text using Apache Mahout. After successful completion of this module, students will be able to explain and apply methods of classification, correctly classify a set of documents using Apache Mahout, and construct and apply workflows for text classification using Apache Mahout.
Text Clustering Using LucidWorks and Apache Mahout
Chen, Liangzhe; Lin, Xiao; Wood, Andrew (2012-11-17)
This module introduces algorithms and evaluation metrics for flat clustering. We focus on the usage of LucidWorks big data analysis software and Apache Mahout, an open source machine learning library in clustering of document collections with the k-means algorithm.

Browsing by Author "Chitturi, Kiran"

Results Per Page

Sort Options