Utilizing RoBERTa model for Churn prediction through clustered contextual conversation opinion mining

Tools

Ibitoye, Ayodeji ORCID: https://orcid.org/0000-0002-5631-8507 and Onifade, Olufade F.W (2023) Utilizing RoBERTa model for Churn prediction through clustered contextual conversation opinion mining. International Journal of Intelligent Systems and Applications (IJISA), 15 (6). pp. 1-8. ISSN 1740-8865 (Print), 1740-8873 (Online) (doi:10.5815/ijisa.2023.06.01)

Preview

PDF (VoR)
46086_IBITOYE_Utilizing_RoBERTa_model_for_Churn_prediction_through_clustered_contextual_conversation_opinion_mining.pdf - Published Version
Available under License Creative Commons Attribution.
Download (549kB) | Preview

Official URL: https://doi.org/10.5815/ijisa.2023.06.01

Abstract

In computational study and automatic recognition of opinions in free texts, certain words in sentences are used to decide its sentiments. While analysing each customer’s opinion per time in churn management will be effective for personalised recommendations. Oftentimes, the opinion is not sufficient for contextualised content mining. While personalised recommendations are time consuming, it also does not provide complete picture of an overall sentiment in the business community of customers. To help businesses identify widespread issues affecting a large segment of their customers towards engendering patterns and trends of different customer churn behaviour, here, we developed a clustered contextualised conversation as opinions set for integration with Roberta Model. The developed churn behavioural opinion clusters disambiguated short messages while charactering contents collectively based on context beyond keyword-based sentiment matching for effective mining. Based on the predicted opinion threshold, customer churn category for groupbased personalised decision support was generated, with matching concepts. The baseline RoBERTa model on the contextually clustered opinions, trained with a batch size of 16, a learning rate of 2e-5, over 8 epochs, using a maximum sequence length of 128 and standard hyperparameters, achieved an accuracy of 92%, Precision of 88%, Recall of 86% and F1 score of 84% over a test set of 30%.

Item Type:	Article
Uncontrolled Keywords:	Churn prediction; opinion mining; Roberta model; customer relationship management; decision Support
Subjects:	Q Science > Q Science (General) Q Science > QA Mathematics Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Faculty / School / Research Centre / Research Group:	Faculty of Engineering & Science Faculty of Engineering & Science > School of Computing & Mathematical Sciences (CMS)
Last Modified:	04 Jul 2024 23:53
URI:	http://gala.gre.ac.uk/id/eprint/46086

Actions (login required)

View Item

Downloads

Downloads per month over past year

View more statistics

Altmetric