Are Repositories Impeding Big Data Reuse?
| dc.contributor.author | Xie, Zhiwu | en |
| dc.contributor.author | Galad, Andrej | en |
| dc.contributor.author | Chen, Yinlin | en |
| dc.contributor.author | Fox, Edward A. | en |
| dc.date.accessioned | 2016-06-26T16:07:09Z | en |
| dc.date.available | 2016-06-26T16:07:09Z | en |
| dc.date.issued | 2016-06-14 | en |
| dc.description.abstract | In this intentionally provocative presentation, we question the scalability of popular digital repositories and whether they are suitable for big data reuse. Are the layers of API these repositories have painted over file system primitives necessary? How essential is it for the repository to insist on being the sole manager of the content, and arranging files in ways to prevent access other than from their own APIs? We explore these questions from the perspective of big data reuse, and describe controlled reuse experiments against Fedora 4 to evaluate the cost of these practices. | en |
| dc.description.sponsorship | IMLS: LG-71-16-0037 | en |
| dc.identifier.uri | http://hdl.handle.net/10919/71474 | en |
| dc.language.iso | en_US | en |
| dc.publisher | Virginia Tech | en |
| dc.relation.ispartof | 11th International Conference on Open Repositories (OR2016) | en |
| dc.rights | Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States | en |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | en |
| dc.subject | Institutional repository | en |
| dc.subject | Data management | en |
| dc.subject | Big data | en |
| dc.subject | Scalability | en |
| dc.subject | Throughput | en |
| dc.title | Are Repositories Impeding Big Data Reuse? | en |
| dc.type | Article | en |
Files
Original bundle
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.5 KB
- Format:
- Item-specific license agreed upon to submission
- Description: