Enhancing Sinhala Hate Speech and Offensive Language Detection through XAI

Pilapitiya, H.M.H.N.; Ishanka, U.A.P.

Digital Library | SUSL Home
→
Research Publications
→
Proceedings
→
Workshops, Seminars, Symposiums ect
→
Faculty of Computing
→
COMPUTING UNDERGRADUATE RESEARCH SYMPOSIUM
→
ComURS2026 Computing Undergraduate Research Symposium : Abstracts
→
View Item

dc.contributor.author	Pilapitiya, H.M.H.N.
dc.contributor.author	Ishanka, U.A.P.
dc.date.accessioned	2026-05-26T08:19:02Z
dc.date.available	2026-05-26T08:19:02Z
dc.date.issued	2026-01-28
dc.identifier.isbn	978-624-5727-44-5
dc.identifier.uri	http://repo.lib.sab.ac.lk:8080/xmlui/handle/susl/5301
dc.description.abstract	The rapid growth of social media has sped up the spread of hatred, abusive and offensive content, creating an urgent need for automated detection systems especially for low-resource languages such as Sinhala. This study develops, and evaluates standard deep learning (DL) models, transformerbased architectures, and a newly proposed hybrid model using DL models with Sinhala Offensive Language Dataset (SOLD) for detecting hate speech in Sinhala. Among Conventional DL models, Bi-LSTM demonstrated the strongest performance with 82% accuracy and a ROC score of 0.88, despite the challenge of informal expressions, and rich morphology in Sinhala language. Among transformer-based models, XLM-R large achieved the best results with 84% accuracy and an ROC of 0.92, demonstrating their effectiveness in modeling nuanced semantic and syntactic structures in Sinhala online discourse. Moreover, a hybrid model integrating multiple DL models was developed and evaluated, achieving superior performance with an accuracy of 86% with a ROC of 0.93 on Sinhala hate speech detection task, outperforming all baseline deep learning and transformer-based models. Beyond predictive performance, this study also contributes important interpretability of model predictions with Explainable AI (XAI) techniques - SHAP and LIME - which detail the local and global contributions of tokens. These explanations provide clear pathways to make decisions about models. Overall, this study presents a comprehensive evaluation about model’s performance and interpretability analysis of model’s predictions to detect hate speech in Sinhala with exciting brevity for future multilingual and explainable NLP applications.	en_US
dc.language.iso	en	en_US
dc.publisher	Faculty of Computing. Sabaragamuwa University of Sri Lanka.	en_US
dc.subject	Deep learning	en_US
dc.subject	Hybrid model	en_US
dc.subject	Sinhala hate speech	en_US
dc.subject	Transformerbased models	en_US
dc.subject	XAI	en_US
dc.title	Enhancing Sinhala Hate Speech and Offensive Language Detection through XAI	en_US
dc.type	Article	en_US