Sabaragamuwa University of Sri Lanka

Enhancing Sinhala Hate Speech and Offensive Language Detection through XAI

Show simple item record

dc.contributor.author Pilapitiya, H.M.H.N.
dc.contributor.author Ishanka, U.A.P.
dc.date.accessioned 2026-05-26T08:19:02Z
dc.date.available 2026-05-26T08:19:02Z
dc.date.issued 2026-01-28
dc.identifier.isbn 978-624-5727-44-5
dc.identifier.uri http://repo.lib.sab.ac.lk:8080/xmlui/handle/susl/5301
dc.description.abstract The rapid growth of social media has sped up the spread of hatred, abusive and offensive content, creating an urgent need for automated detection systems especially for low-resource languages such as Sinhala. This study develops, and evaluates standard deep learning (DL) models, transformerbased architectures, and a newly proposed hybrid model using DL models with Sinhala Offensive Language Dataset (SOLD) for detecting hate speech in Sinhala. Among Conventional DL models, Bi-LSTM demonstrated the strongest performance with 82% accuracy and a ROC score of 0.88, despite the challenge of informal expressions, and rich morphology in Sinhala language. Among transformer-based models, XLM-R large achieved the best results with 84% accuracy and an ROC of 0.92, demonstrating their effectiveness in modeling nuanced semantic and syntactic structures in Sinhala online discourse. Moreover, a hybrid model integrating multiple DL models was developed and evaluated, achieving superior performance with an accuracy of 86% with a ROC of 0.93 on Sinhala hate speech detection task, outperforming all baseline deep learning and transformer-based models. Beyond predictive performance, this study also contributes important interpretability of model predictions with Explainable AI (XAI) techniques - SHAP and LIME - which detail the local and global contributions of tokens. These explanations provide clear pathways to make decisions about models. Overall, this study presents a comprehensive evaluation about model’s performance and interpretability analysis of model’s predictions to detect hate speech in Sinhala with exciting brevity for future multilingual and explainable NLP applications. en_US
dc.language.iso en en_US
dc.publisher Faculty of Computing. Sabaragamuwa University of Sri Lanka. en_US
dc.subject Deep learning en_US
dc.subject Hybrid model en_US
dc.subject Sinhala hate speech en_US
dc.subject Transformerbased models en_US
dc.subject XAI en_US
dc.title Enhancing Sinhala Hate Speech and Offensive Language Detection through XAI en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account