Temporal Topic Embeddings with a Compass
dc.contributor.author | Palamarchuk, Daniel Andrew | en |
dc.contributor.committeechair | North, Christopher L. | en |
dc.contributor.committeemember | Danielson, Thomas Lee | en |
dc.contributor.committeemember | Mayer, Brian Benjamin | en |
dc.contributor.committeemember | Wang, Xuan | en |
dc.contributor.department | Computer Science and#38; Applications | en |
dc.date.accessioned | 2024-05-23T08:00:35Z | en |
dc.date.available | 2024-05-23T08:00:35Z | en |
dc.date.issued | 2024-05-22 | en |
dc.description.abstract | Aligning Word2vec word embeddings using a compass in a system of Compass-aligned Distributional Embeddings (CADE) creates stable and accurate temporal word embeddings. This thesis seeks to expand the CADE framework into the area of dynamic topic modeling (DTM), where temporal word2vec embeddings can be used to describe temporally and unsupervised evolving topics. It also seeks to improve upon the CADE framework through a theoretical and experimental exploration of compass parameters, cluster and topic generation techniques, and topic descriptor creation. This method of Temporal Topic Embeddings with a Compass (TTEC) will be compared to other DTM techniques in the ability to create coherent and diverse clusters and will be shown to be competitive compared to traditional and transformer-aided DTM architectures. In addition to a qualitative discussion of results, there will be a political theoretical overview of the nature of this technique and potential use cases, with interviews from political actors of various backgrounds as to how the technique and machine learning as a whole can be used in the organizational setting. | en |
dc.description.abstractgeneral | Diachronic word embeddings look at how the context words appear in evolve over time. Dynamic Topic Modeling (DTM) is the ability to computationally discover topics and how they evolve over time. This thesis creates a DTM technique called Temporal Topic Embeddings with a Compass (TTEC) based off diachronic word embeddings, allowing a user to simultaneously look at word and topic evolution over time. There is also an exploration of the use case of TTEC and similar machine learning models within various political organizational settings through interviews. | en |
dc.description.degree | Master of Science | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:40595 | en |
dc.identifier.uri | https://hdl.handle.net/10919/119057 | en |
dc.language.iso | en | en |
dc.publisher | Virginia Tech | en |
dc.rights | Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en |
dc.subject | diachronic | en |
dc.subject | neural network | en |
dc.subject | hci | en |
dc.subject | bureaucracy | en |
dc.subject | legibility | en |
dc.title | Temporal Topic Embeddings with a Compass | en |
dc.type | Thesis | en |
thesis.degree.discipline | Computer Science & Applications | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | masters | en |
thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1