Library Tweets Conversion

dc.contributor.authorDhakal, Pranaven
dc.contributor.authorBhargava, Yashen
dc.contributor.authorHerms, Annaen
dc.contributor.authorPowell, Kennethen
dc.contributor.authorBurdisso, Danielen
dc.date.accessioned2021-12-17T01:13:25Zen
dc.date.available2021-12-17T01:13:25Zen
dc.date.issued2021-12-16en
dc.description.abstractThe Digital Library Research Laboratory (DLRL) has collected billions of tweets over the course of years. These tweets were gathered using three different data collection tools, and have been organized into collections based on keywords. The different collection tools used were: Social Feed Manager (SFM), yourTwapperKeeper (YTK), and Digital Methods Initiative Twitter Capture and Analysis Toolset (DMI-TCAT). Because each of these tools store the tweets differently, the DLRL aims to consolidate these tweets so the Library can provide a service that allows the campus to easily access and use this data. Our job was to come up with a unified JSON format that all of these tweets could be represented by and to provide a way to convert them to this new format. Additionally, we had to provide suitable collection-level information for each distinct data collection that showed the connections between tweets and the collections they belonged to. To accomplish this, we have six conversion scripts. Three of these are for converting the individual tweets, and three of them are for compiling the collection-level metadata and preserving the relationship between tweets and collections. When run with the Twitter data, they provide a unified way to digest all of the collected data regardless of which method it was obtained by.en
dc.description.notesPDF of final report = Library6BtweetsReport.pdf. PDF of final presentation = Library6BtweetsPresentation.pdf. Editable version of the report (e.g., a Word document) = Library6BtweetsReport.docx. Editable version of the presentation (e.g., a PowerPoint file) = Library6BtweetsReport.pptx.en
dc.identifier.urihttp://hdl.handle.net/10919/107086en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsCC0 1.0 Universalen
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/en
dc.subjectYTKen
dc.subjectyourtwapperkeeperen
dc.subjectTwitteren
dc.subjectData conversionen
dc.subjectCollection-levelen
dc.subjectPythonen
dc.subjecttweeten
dc.subjectDigital Methods Initiative Twitter Capture and Analysis Toolseten
dc.subjectDMI-TCATen
dc.subjectSocial Feed Manageren
dc.subjectSFMen
dc.subjectMySQLen
dc.subjectJSONen
dc.subjectLibraryen
dc.subjectLibrary tweets dataen
dc.titleLibrary Tweets Conversionen
dc.typePresentationen
dc.typeReporten

Files

Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
Library6BtweetsReport.pdf
Size:
1.28 MB
Format:
Adobe Portable Document Format
Name:
Library6BtweetsReport.docx
Size:
1.36 MB
Format:
Microsoft Word XML
Loading...
Thumbnail Image
Name:
Library6BtweetsPresentation.pdf
Size:
1.97 MB
Format:
Adobe Portable Document Format
Name:
Library6BtweetsPresentation.pptx
Size:
3.33 MB
Format:
Microsoft Powerpoint XML
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: