Segmentation Algorithm

dc.contributor.authorGad, Samahen
dc.date.accessioned2014-04-09T18:27:23Zen
dc.date.available2014-04-09T18:27:23Zen
dc.date.issued2014-04-09en
dc.description.abstractWe developed a dynamic temporal segmentation algorithm that wraps around topic modeling algorithms for the purpose of identifying change points where significant shifts in topics occur. The main task of the segmentation algorithm is to automatically partition the total time period defined by the documents in the collection such that segment boundaries indicate important periods of temporal evolution and re-organization. The algorithm moves across the data by time and evaluates two adjacent windows, assuming a given segmentation granularity (e.g., discrete days, weeks, or months). This granularity varies from one application to another and is decided by domain experts. We evaluate adjacent windows by comparing their underlying topic distributions and quantifying common terms and their probabilities. We chose to quantify common terms based on the overlap between them. The overlap can be captured using a contingency table.en
dc.description.sponsorshipNEH Office of Digital Humanities Digging into Data Programen
dc.identifier.urihttp://hdl.handle.net/10919/47101en
dc.language.isoen_USen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleSegmentation Algorithmen
dc.typeSoftwareen

Files

Original bundle
Now showing 1 - 1 of 1
Name:
Segmentation_Algorithm.zip
Size:
88.07 MB
Format:
Unknown data format
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: