Addressing the Challenges of Mental Health Conversations with Large Language Models
| dc.contributor.author | Shiwakoti, Shuvam | en |
| dc.contributor.author | Shah, Siddhant Bikram | en |
| dc.contributor.author | Razzak, Imran | en |
| dc.contributor.author | Thapa, Surendrabikram | en |
| dc.contributor.author | Naseem, Usman | en |
| dc.date.accessioned | 2025-08-12T13:05:05Z | en |
| dc.date.available | 2025-08-12T13:05:05Z | en |
| dc.date.issued | 2025-05-08 | en |
| dc.date.updated | 2025-08-01T07:49:58Z | en |
| dc.description.abstract | Virtual Mental Health Assistants offer a promising solution to address the growing demand for accessible and scalable mental healthcare. However, existing dialogue generation models struggle with the complexities inherent in mental health conversations. In this paper, we explore the limitations of current Medical Dialogue Generation models by conducting experiments on the large language model ChatMGL.We propose modifications to ChatMGL, including finetuning the model on a mental health dataset without proximal policy optimization and incorporating dialogue act labels, to enhance its ability to handle the complex nature of mental health dialogues. Our results demonstrate that these modifications outperform baseline models in terms of ROUGE and BERT scores. Our work suggests that specialized fine-tuning and incorporating domain-specific knowledge can improve the efficacy of virtual assistants for mental health support. | en |
| dc.description.version | Published version | en |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.doi | https://doi.org/10.1145/3701716.3718374 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/137460 | en |
| dc.language.iso | en | en |
| dc.publisher | ACM | en |
| dc.rights | Creative Commons Attribution 4.0 International | en |
| dc.rights.holder | The author(s) | en |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
| dc.title | Addressing the Challenges of Mental Health Conversations with Large Language Models | en |
| dc.type | Article - Refereed | en |
| dc.type.dcmitype | Text | en |