Optimizing Red and White Wine Quality Prediction Using Hybrid Ensemble Learning and Interpretable Feature Engineering

Dissanayake, D.K.H.; Kumara, B.T.G.S.

Digital Library | SUSL Home
→
Research Publications
→
Proceedings
→
Workshops, Seminars, Symposiums ect
→
Faculty of Computing
→
COMPUTING UNDERGRADUATE RESEARCH SYMPOSIUM
→
ComURS2026 Computing Undergraduate Research Symposium : Abstracts
→
View Item

Optimizing Red and White Wine Quality Prediction Using Hybrid Ensemble Learning and Interpretable Feature Engineering

Dissanayake, D.K.H.; Kumara, B.T.G.S.

URI: http://repo.lib.sab.ac.lk:8080/xmlui/handle/susl/5334

Date: 2026-01-28

Abstract:

The conventional method of wine quality prediction has always been based on sensory testing by human analyst and this has been subjective as well as time consuming, and this type of evaluation is hard to scale to the modern wine production system. The current development of machine learning allowed predicting wine quality based on physicochemical data; however, the current models are often characterized by the class imbalance, low interpretability, and poor consistency in working with red and white wine samples. This paper hypothesizes a hybrid ensemble learning model, which combines classification and regression models with interpretable feature engineering to enhance predictive performance and interpretability. The dataset of red and white wines in the region of Vinho Verde was utilized, with comprehensive preparations and feature scaling, feature selection, and resampling, including SMOTE and SMOTE-ENN to overcome an uneven distribution of classes. Several models of base line, such as Support Vector Machines, Naïve Bayes, Ridge Regression, Artificial Neural Networks, Random Forest, and Gradient Boosting, were compared and tested against ensemble and stacked hybrid models. The performance of hybrid ensemble models is shown to be more accurate, has a higher recall and a larger F1-score, especially when compared to single baseline models. SHAP and LIME analysis of interpretability found alcohol content to be a positively strong predictor of wine quality, whereas volatile acidity was significantly negative. The explanations present practical recommendations that can be used by the winemakers in an attempt to make better production choices. Altogether, the suggested framework provides the best predictive results with reasonable interpretability, justifying the application of machine learning-based wine quality assessment in the wine industry practically.

Show full item record

Files in this item

Name: ComURS-2026_(2)-p ...

Size: 43.73Kb

Format: PDF

View/Open

This item appears in the following Collection(s)

ComURS2026 Computing Undergraduate Research Symposium : Abstracts [54]
"Next-Gen Solutions for a Digitally Connected World"

Optimizing Red and White Wine Quality Prediction Using Hybrid Ensemble Learning and Interpretable Feature Engineering

Optimizing Red and White Wine Quality Prediction Using Hybrid Ensemble Learning and Interpretable Feature Engineering

Abstract:

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account