Study of evaluation metrics while predicting the yield of lettuce plants in indoor farms using machine learning models
2023 (English)Independent thesis Advanced level (degree of Master (One Year)), 10 credits / 15 HE credits
Student thesis
Abstract [en]
A key challenge for maximizing the world’s food supply is crop yield prediction. In this study, three machine models are used to predict the fresh weight (yield) of lettuce plants that are grown inside indoor farms hydroponically using the vertical farming infrastructure, namely, support vector regressor (SVR), random forest regressor (RFR), and deep neural network (DNN).The climate data, nutrient data, and plant growth data are passed as input to train the models to understand the growth pattern based on the available features. The study of evaluation metrics majorly covers Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), R-squared, and Adjusted R-squared values.The results of the project have shown that the Random Forest with all the features is the best model having the best results with the least cross-validated MAE score and good cross-validated Adjusted R-squared value considering that the error of the prediction is minimal. This is followed by the DNN model with minor differences in the resulting values. The Support Vector Regressor (SVR) model gave a very poor performance with a huge error value that cannot be afforded in this scenario. In this study, we have also compared various evaluating metrics mentioned above and considered the cross-validated MAE and cross-validated Adjusted R-squared metrics. According to our study, MAE had the lowest error value, which is less sensitive to the outliers and adjusted R-squared value helps to understand the variance of the target variable with the predictor variable and adjust the metric to prevent the issues of overfitting.
Place, publisher, year, edition, pages
2023. , p. 51
Keywords [en]
machine learning, lettuce yield prediction, Regression, SVR, RF, DNN, MAE, MSE, RMSE, R-squared, Adjusted R-squared
National Category
Other Computer and Information Science
Identifiers
URN: urn:nbn:se:his:diva-23198OAI: oai:DiVA.org:his-23198DiVA, id: diva2:1796493
External cooperation
Ljusgårda (Erik Lundgren, Anders Svensson, Shirin Moradi)
Subject / course
Informationsteknologi
Educational program
Data Science - Master’s Programme
Supervisors
Examiners
2023-09-122023-09-122023-09-12Bibliographically approved