A Dynamic Characteristic Aware Index Structure Optimized for Real-world Datasets

dc.contributor.authorYang, Jinen
dc.contributor.authorYoon, Heejinen
dc.contributor.authorYun, Gyeongchanen
dc.contributor.authorNoh, Samen
dc.contributor.authorChoi, Young-rien
dc.date.accessioned2025-01-09T17:38:12Zen
dc.date.available2025-01-09T17:38:12Zen
dc.date.issued2024-12en
dc.date.updated2025-01-01T08:52:44Zen
dc.description.abstractMany datasets in real life are complex and dynamic, that is, their key densities are varied over the whole key space and their key distributions change over time. It is challenging for an index structure to efficiently support all key operations for data management, in particular, search, insert, and scan, for such dynamic datasets. In this paper, we present DyTIS (Dynamic dataset Targeted Index Structure), an index that targets dynamic datasets. DyTIS, though based on the structure of Extendible hashing, leverages the CDF of the key distribution of a dataset, and learns and adjusts its structure as the dataset grows. The key novelty behind DyTIS is to group keys by the natural key order and maintain keys in sorted order in each bucket to support scan operations within a hash index. We also define what we refer to as a dynamic dataset and propose a means to quantify its dynamic characteristics. Our experimental results show that DyTIS provides higher performance than the state-of-the-art learned index for the dynamic datasets considered. We also analyze the effects of the dynamic characteristics of datasets, including sequential datasets, as well as the effect of multiple threads on the performance of the indexes.en
dc.description.versionAccepted versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1145/3707642en
dc.identifier.urihttps://hdl.handle.net/10919/124023en
dc.language.isoenen
dc.publisherACMen
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 4.0 Internationalen
dc.rights.holderThe author(s)en
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en
dc.titleA Dynamic Characteristic Aware Index Structure Optimized for Real-world Datasetsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3707642.pdf
Size:
1.84 MB
Format:
Adobe Portable Document Format
Description:
Accepted version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: