A Discovery Portal for Twitter Collections

dc.contributor.authorCasery, Christinaen
dc.contributor.authorAnderson, Quinnen
dc.contributor.authorOmotosho, Abdulen
dc.contributor.authorPatel, Kirtien
dc.contributor.authorJohnson, Adrianen
dc.date.accessioned2024-12-17T23:16:19Zen
dc.date.available2024-12-17T23:16:19Zen
dc.date.issued2024-12-15en
dc.description.abstractThis report documents the continuation of a project begun by previous students three years ago in 2021. About six billion Tweets have been collected in three formats, Social Feed Manager (SFM), yourTwapperKeeper (YTK), and Digital Methods Initiative Twitter Capture and Analysis Toolset (DMI-TCAT), by the Digital Library Research Laboratory (DLRL) at Virginia Tech. The overall goal of this project is to organize these Tweets into event collections and consolidate the collection information that is stored in three different schemas and databases into one web app, making the data more accessible. In Fall 2021, the Library6BTweet team designed an individual Tweet and collection-level Tweet schema. They also worked on converting Tweet data. In Spring 2022, the Twitter Collections team optimized the conversion scripts, converted Tweet data, and looked into implementing a machine learning model to categorize Tweets. In Spring 2024, the Twitter Database Discovery Portal team consolidated the collected data into a local mongo database and built a web app with minimal features that display the collected data and allows the user to search and filter the collections. The Twitter Database Discovery Portal team did not complete extracting the data from the SFM database. Our team’s goal is to build upon the past team’s contributions to finish extracting the data from the SFM database and add new features to the web app.en
dc.description.sponsorshipProfessor Mohamed Faragen
dc.identifier.urihttps://hdl.handle.net/10919/123824en
dc.titleA Discovery Portal for Twitter Collectionsen

Files

Original bundle
Now showing 1 - 5 of 9
Name:
env
Size:
38 B
Format:
Unknown data format
Name:
tweets.json
Size:
1.46 MB
Format:
Unknown data format
Name:
MAPPING OF SFM DATABASE.xlsx
Size:
504.64 KB
Format:
Microsoft Excel XML
Name:
user_collections_tweets.jsonl
Size:
653.41 KB
Format:
Unknown data format
Name:
TwitterCollections_Collection_Table_for_IA20180620_Labeled_Fall24.xlsx
Size:
373.97 KB
Format:
Microsoft Excel XML
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: