ABSTRACT
PREDICTING WATER CONSUMPTION BY POOR URBAN HOUSEHOLDS USING GEOSPATIAL TECHNOLOGY AND MACHINE LEARNING TECHNIQUES
Journal: Malaysian Journal of Geosciences (MJG)
Author: Taiwo, Tolu A., Olusina J. O., Hamid-Mosaku A. I., Abiodun O. E.
This is an open access article distributed under the Creative Commons Attribution License CC BY 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited
DOI: 10.26480/mjg.01.2025.10.17

This study examines the performance of predictive machine learning models implemented with an integrated tool of geospatial technology (GST) and machine learning techniques (MLT). Historical data of daily volume of water consumed in dry and wet seasons was gathered through questionnaires, and integrated with socioeconomic data, weather data, property data and geospatial data. The datasets were passed through Principal Component Analysis algorithm to select few features that explain the variability in all the original variables. The selected features were inputted into four predictive models – Multilinear Regression (MLR), Random Forest (RF), Support Vector Regression (SVR), and Artificial Neural Networks (ANN). Three error metrics, Mean Absolute Error (MAE), Root Mean Squared Error (RMSE) and R squared (R2) score were used to measure the model performances. All four models performed very well in predicting volume of water consumed by poor urban households as they produced RMSE of 57 litres, 46 litres, 63 litres and 52 litres respectively during training, and R2 score of 88%, 92%, 88% and 90% respectively. Significance test performed at 95% confidence level shows that there is significant difference between volume of water consumed during dry and wet seasons.