Sabaragamuwa University of Sri Lanka

A machine learning approach to optimize coagulant dosage: A case study at the Demodara water treatment plant

Show simple item record

dc.contributor.author Didulani, A.V.T.
dc.contributor.author Nusnan, A.M.
dc.contributor.author Farhath, F.R.
dc.contributor.author Chathurangi, K.A.A.
dc.contributor.author Thushari, G.N.N.
dc.contributor.author Dalumuragama, S
dc.date.accessioned 2026-01-17T06:58:39Z
dc.date.available 2026-01-17T06:58:39Z
dc.date.issued 2025-12-03
dc.identifier.issn 2815-0341
dc.identifier.uri http://repo.lib.sab.ac.lk:8080/xmlui/handle/susl/5176
dc.description.abstract Water is one of the fundamental necessities of human life. Ensuring clean and safe water for communities is essential. The main steps in the water treatment process include coagulation and flocculation, sedimentation, filtration, and disinfection. This study focuses on introducing a machine learning-based approach to optimise coagulant dosage in the water treatment process at the Demodara Water Treatment Plant. Currently, the traditional jar test is used to determine coagulant dosage; however, it is time-consuming and labour-intensive. In this study, historical jar test data from 2019 to 2025 were collected and preprocessed to develop predictive models. Water quality parameters such as pH, turbidity, and colour were applied to six machine learning models, such as Random Forest, XGBoost, LightGBM, CatBoost, Artificial Neural Networks (ANN) and K-Nearest Neighbours (KNN) to predict the optimal dosage of Poly Aluminium Chloride (PAC), the coagulant used at the Demodara Water Treatment Plant. All models were trained using dedicated training and testing datasets. Hyperparameter tuning was conducted using grid search with cross-validation, and model performance was assessed using mean absolute error (MAE), mean squared error (MSE), and the coefficient of determination (R2). Among these models, the highest predictive accuracy was achieved in Random Forest. The R2 values resulting from the Random Forest were 0.93, 0.89, and 0.93 for pH, turbidity, and colour, respectively. Corresponding MAEs were 0.05 for pH, 2.93 for turbidity, and 11.04 for colour, while MSEs were 0.01, 25.94, and 416.29, respectively. The best-performing model was integrated into a graphical user interface to support real-time plant operations. This machine learning-based approach enhances consistency in treatment quality, reduces chemical waste, and minimises human intervention. It will provide operators with an interpretable, convenient, and scalable solution to improve the efficiency and reliability of the water treatment process. Future work will focus on incorporating real-time sensor data to enable dynamic dosage adjustments, further increasing the practicality of ML integration in real-world settings. en_US
dc.language.iso en en_US
dc.publisher Sabaragamuwa University of Sri Lanka en_US
dc.subject Coagulant dosage en_US
dc.subject Machine learning en_US
dc.subject Random forest en_US
dc.subject Regression en_US
dc.subject Water treatment en_US
dc.title A machine learning approach to optimize coagulant dosage: A case study at the Demodara water treatment plant en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account