Are Repositories Impeding Big Data Reuse?

dc.contributor.authorXie, Zhiwuen
dc.contributor.authorGalad, Andrejen
dc.contributor.authorChen, Yinlinen
dc.contributor.authorFox, Edward A.en
dc.date.accessioned2016-06-26T16:07:09Zen
dc.date.available2016-06-26T16:07:09Zen
dc.date.issued2016-06-14en
dc.description.abstractIn this intentionally provocative presentation, we question the scalability of popular digital repositories and whether they are suitable for big data reuse. Are the layers of API these repositories have painted over file system primitives necessary? How essential is it for the repository to insist on being the sole manager of the content, and arranging files in ways to prevent access other than from their own APIs? We explore these questions from the perspective of big data reuse, and describe controlled reuse experiments against Fedora 4 to evaluate the cost of these practices.en
dc.description.sponsorshipIMLS: LG-71-16-0037en
dc.identifier.urihttp://hdl.handle.net/10919/71474en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.relation.ispartof11th International Conference on Open Repositories (OR2016)en
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/en
dc.subjectInstitutional repositoryen
dc.subjectData managementen
dc.subjectBig dataen
dc.subjectScalabilityen
dc.subjectThroughputen
dc.titleAre Repositories Impeding Big Data Reuse?en
dc.typeArticleen

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
OR2016-repo-throughput.pdf
Size:
151.06 KB
Format:
Adobe Portable Document Format
Description:
Preprint: submitted version
Loading...
Thumbnail Image
Name:
2016-OR-big-data-slides.pdf
Size:
12.72 MB
Format:
Adobe Portable Document Format
Description:
Slides for presentation at OR2016
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: