Study of Pretraining Bias and Frequencies

Taware, Rutuja Murlidhar

Study of Pretraining Bias and Frequencies

dc.contributor.author	Taware, Rutuja Murlidhar	en
dc.contributor.committeechair	Ramakrishnan, Narendran	en
dc.contributor.committeemember	Lourentzou, Ismini	en
dc.contributor.committeemember	Lu, Chang Tien	en
dc.contributor.department	Computer Science and Applications	en
dc.date.accessioned	2023-07-11T08:00:33Z	en
dc.date.available	2023-07-11T08:00:33Z	en
dc.date.issued	2023-07-10	en
dc.description.abstract	Usage of language models in an in-context learning environment has been adapted for a wide range of tasks. Recent works have showcased the impact of pretraining data on the in-context performance of language models. In this work, we experiment with numbers having high and low frequencies in the pretraining data to understand the impact of term frequencies on the model's performance. We also experiment with random and adversarial demonstrations to understand the pretraining bias present in the model. Through these experiments, we showcase the importance of pretraining frequencies of the numbers present in the demonstrations and explain how highly frequent terms can be used in the demonstrations to achieve better task performance. Moreover, we also show the impact of pretraining bias on the model's performance and explain how the model overcomes this bias with more demonstrations.	en
dc.description.abstractgeneral	Recent works focus on understanding and improving the arithmetic capabilities of the state-of-the-art (SOTA) systems in the domain of Natural Language Processing (NLP). This work focuses on designing and performing novel experiments to analyze the impact of training data on the performance of such systems. Through these experiments, this work showcases interesting properties of the SOTA systems which will promote future research to understand them better as well as help in creating better downstream applications.	en
dc.description.degree	Master of Science	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:37728	en
dc.identifier.uri	http://hdl.handle.net/10919/115712	en
dc.language.iso	en	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	In-Context Learning	en
dc.subject	Pretraining	en
dc.subject	Frequency	en
dc.subject	Bias	en
dc.subject	Language Model	en
dc.title	Study of Pretraining Bias and Frequencies	en
dc.type	Thesis	en
thesis.degree.discipline	Computer Science and Applications	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	masters	en
thesis.degree.name	Master of Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Taware_R_T_2023.pdf
Size:: 2.21 MB
Format:: Adobe Portable Document Format

Download

Collections

Masters Theses