Status : Verified
| Personal Name | Romano, Al R. |
|---|---|
| Resource Title | Feature Selection-Driven Supervised Machine Learning for Dissolved Oxygen Prediction Using Multidomain Environmental Parameters in a Major Fishing Lake |
| Date Issued | 6 July 2025 |
| Abstract | Dissolved oxygen (DO) is a critical indicator of water quality, as it directly affects the health and survival of aquatic organisms, including those raised in aquaculture systems. This study investigates the feasibility of using supervised machine learning-based models for predicting DO levels in Sampaloc Lake, San Pablo, Laguna, utilizing Filter and Wrapper methods of feature selection. The analysis is based on a dataset comprising 104 historical records collected from 2010 to 2022, incorporating a total of 21 features. These features encompass a range of water quality, physical, and hydroclimatic parameters, which are integrated as predictors to develop reliable predictive models. Both Filter and Wrapper feature selection methods proved effective in identifying influential parameters and predicting DO levels. Among the models, XGBoost Regression (XGBR) emerged as the top performer for the Filter Method with R2 = 0.95, RMSE=0.14, and MAE = 0.10. For the Wrapper Method, Random Forest (RF) demonstrates better generalization, with higher test R2 (0.91) and lower RMSE (0.39) and MAE (0.22) values, making it the most suitable representative for the Wrapper Method. Comparing the two feature selection approaches, the Filter Method was found to be superior overall. The SHAP value analyses for both the Filter and Wrapper feature selection methods consistently identified biochemical oxygen demand (BOD), air temperature (AT), and surface pressure (SP) as the most influential predictors of DO concentration in Sampaloc Lake. Overall, this study concludes that the Filter Method with XGBR offers the most reliable and effective framework for predicting DO levels in Sampaloc Lake, contributing valuable insights for water quality management and sustainable aquaculture practices. |
| Degree Course | MS Environmental Engineering |
| Language | English |
| Keyword | Dissolved Oxygen, Machine Learning, Water Quality Prediction, Aquaculture, Feature Selection |
| Material Type | Thesis/Dissertation |
Preliminary Pages
9.31 Mb
Category : P - Author wishes to publish the work personally.
Access Permission : Limited Access
