The Implementation of Machine Learning and Deep Learning Algorithms for Crop Yield Prediction in Agriculture
Download 0.67 Mb. Pdf ko'rish
|
AGRI ARTIC 2nd Rahimov
- Bu sahifa navigatsiya:
- 4. Results and discussion
3. Data preprocessing
Figure 2 illustrates the data preprocessing process used for model learning. Initially, the dataset was imported into Python from kaggle.com. Additional features were added to create a new dataset. Next, normalization was performed to analyze the data. Finally, certain specification data were identified as model learning data, while the Base specification data were set aside to evaluate the performance of the generated predictive model. Figure 2. The preprocessing process for prediction crop yield data. 4. Results and discussion 4.1.Evaluation metrics This study utilizes dataset that collected during over 20 years, including measure of rainfall, productivity of the each year, temperature. Machine learning techniques, including a multivariate regression (MR), deep neural networks (DNN), and multiple linear regression predict, were employed to construct the predictive model using Python, Scikit-learn and Seaborn libraries. The predictive performance of the models was evaluated using a mean absolute error (MAE), and a root mean squared error (RMSE), while a separate dataset was used to test and verify the selected model. The test included assessing the performance of each prediction model on a separate dataset and generating graphs to compare the predicted and actual values of crop yield such as changing temperature, rainfall. For regression problems MAE and RMSE metrics are most implemented. In this section we compare the results taken from four models through MAE and RMSE according to mentioned four algorithms in section 3. MAE = 1 𝑛 ∑ |𝑦 𝑎𝑐𝑡 − 𝑦 𝑝𝑟𝑒𝑑 | 𝑛 𝑖=1 (1) RMSE = √ ∑ (𝑦 𝑎𝑐𝑡 − 𝑦 𝑝𝑟𝑒𝑑 ) 𝑛 𝑖=1 2 𝑛 (2) Where: n is the number of data points, y pred is the predicted value of the dependent variable for the i th data point, Bulletin of TUIT: Management and Communication Technologies Nodir Rahimov, Dilmurod Khasanov 2023.Vol-2(4) y act is the actual value of the dependent variable for the i th data point. 4.2.Multivariate Regression Prediction Model Performance The performance evaluation of the predictive model is presented in Table 2, where the MR model exhibits RMSE values of 83256.2 and 84955.1, and MAE values of 93365.8 and 64242.0 when predicting the crop yield prediction, respectively, based on the test dataset. The prediction results of the MR model for the test dataset are visualized in Figure 3 and Figure 4. Table 2. The performance evaluation results of the MR prediction model. Metric MAE RMSE Target Train Test Train Test Predictio n 63365. 8 64242. 0 83256. 2 84955. 1 Figure 3. High-correlation among features. Figure 4. True values (blue) and predictions (orange). 4.3.Multiple Linear Regression Prediction Model Performance The performance evaluation of the predictive model is presented in Table 3, where the MLR model exhibits MAE values of 63879.3 and 64099.9, and RMSE values of 84145.8 and 84254.6 when predicting the crop yield prediction, respectively, based on the test dataset. The prediction results of the MLR model for the test dataset are visualized in Figure 5 and Figure 6. Table 3. The performance evaluation results of the MLR prediction model. Metric MAE RMSE Target Train Test Train Test Prediction 63879.3 64099.9 84145.8 84254.6 Figure 5. The dynamics Figure 6. True values of crop by years. and predictions. Bulletin of TUIT: Management and Communication Technologies Nodir Rahimov, Dilmurod Khasanov 2023.Vol-2(4) 4.4.Deep Neural Network Prediction Model Performance The performance evaluation of the predictive model is presented in Table 4, where the DNN model exhibits MAE values of 63713.9 and 63747.2, and RMSE values of 83510.5 and 83493.9 when predicting the crop yield prediction, respectively, based on the test dataset. Table 4. The performance evaluation results of the DNN prediction model. Metric MAE RMSE Target Train Test Train Test Prediction 63713.9 63747.2 83510.5 83493.9 The study has revealed that multiple linear regression (MLR) outperforms other algorithms that were evaluated in terms of dataset size, sorting, and key features. Although models based on deep neural network (DNN) and multiple regression (MR) algorithms have been observed to be highly effective in certain circumstances, MLR has been identified as the most optimal algorithm for predicting crop yield based on the research findings. 4.5. Gradient Boosting Regressor Tree Model Performance The performance evaluation of the predictive model is presented in Table 5, where the GBRT model exhibits MAE values of 61378.7 and 61749.5, and RMSE values of 79139.3 and 79641.6 when predicting the crop yield prediction, respectively, based on the test dataset. Table 5. The performance evaluation results of the GBRT prediction model. Metric MAE RMSE Target Train Test Train Test Prediction 61378.7 61749.5 79139.3 79641.6 Table 6. The SOTA comparison of models. Metrics → Models ↓ Root Mean squared error (./1000 ha) Mean absolute error (./1000 ha) Mean percentage error (%) MR 84.10 61.8 83 MLR 84.2 63.98 80 DNN 83.5 62.7 82 GBRT 70.39 59.5 88 Download 0.67 Mb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling