Sabaragamuwa University of Sri Lanka

DEVELOPMENT OF A PREDICTION MODEL IN IDENTIFYING THE CHURNERS AND NON-CHURNERS

Show simple item record

dc.contributor.author Prasanth, S
dc.contributor.author Rathnayake, R.M.K.T
dc.date.accessioned 2021-01-06T17:37:48Z
dc.date.available 2021-01-06T17:37:48Z
dc.date.issued 2019-11-14
dc.identifier.uri http://repo.lib.sab.ac.lk:8080/xmlui/handle/123456789/534
dc.description.abstract Customers play a vital role in the overall function of the telecommunication industry. So, it is important for any industry to prevent the tendency of churning of the customers from an organization. In order to do so, an effective churn prediction model need to be developed in advance. In this view, the sole objective of this research is to develop an appropriate application with the intention of finding the churners and non-churners from any given number of customer data. During this research process, 10,000 post-paid subscriber details with 20 attributes were obtained from a local telecommunication company and a thorough analysis was executed. Among the above number, it was observed that 4888 were churners and the rest 5112 were nonchurners. To find the best algorithm for the development of the final prediction model, several supervised machine learning techniques were incorporated and a proper comparison was done against certain evaluation metrics such as Accuracy, Mean Squared Error (MSE), Precision , Recall and so on. In fact, the following supervised machine learning techniques namely Random Forest, XGBoost, AdaBoost, Logistic Regression, Neural Network, Support Vector Machine, and Decision Tree were experimented with the given data set. As an initial step, the given data were pre-processed and feature engineering was performed with the help of correlation analysis. From the results obtained, 17 attributes out of 20 were identified as the most important aspects to cover the entire data. Consequently, the whole data were fed into aforementioned techniques for the purpose of finding the best one. In this process, more preferable results were obtained from the ensemble approaches such as Random Forest, XGBoost and AdaBoost. Eventually, it was found that XGBoost had the highest accuracy of 82.90% and lowest error rate was 17.1%. In addition to this, five (5) fold cross validation too had been performed for the purpose of ensuring the highest accurate results by the XGBoost incorporated with different percentages of training and testing data. Further, with the intention of getting an increase from the accuracy already obtained, hyper-parameter tuning was done with XGBoost and thus this attempt resulted an accuracy of 83.13%. en_US
dc.language.iso en_US en_US
dc.publisher Sabaragamuwa University of Sri Lanka en_US
dc.subject Churn en_US
dc.subject Machine learning en_US
dc.subject Prediction en_US
dc.subject XGBoost en_US
dc.subject Adaboost en_US
dc.title DEVELOPMENT OF A PREDICTION MODEL IN IDENTIFYING THE CHURNERS AND NON-CHURNERS en_US
dc.type Book en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account