Quantifying the Relationship between Adaptive Traits and Agro-climatic Conditions
Abstract
Background & Aim: Durum wheat is an economically important and regularly eaten food for billions of people in the world. In the International Center for Agriculture Research in the Dry Areas (ICARDA), genbanks are using Focused Identification of the Germplasm Strategy (FIGS) to find out and quantify relationships between agro-climatic conditions and the presence of specific traits. Hence, the study is aimed to investigate the predictive value of various types of long-term agro-climatic variables on the future values of different traits.
Method: Ordinary multiple linear regression with stepwise variable selection method on the complete data set, and multiple linear regression models with predictors selected by penalized methods with mean square error cross-validation as a model selection criterion, are used to analyze 238 durum wheat landraces. Each of the models are fitted on Days to Heading and Days to Maturity response variables with 57 predictor variables, independently. Ordinary least square and weighted least square estimation methods were used.
Result: Findings implied that there is high multicollinearity among the predictor variables. It is found that there are some predictors which affect positively and some others affect negatively for both Days to Heading and Days to Maturity using both ordinary and shrinkage based models. It is revealed that the prediction from the lasso based model is not that much reasonable. Furthermore, for the Days to Heading showed that there seems better prediction as their predicted value increase continuously as a function of the actual values though there is considerable variability.
Conclusion: In conclusion, inferences and predictions by the ordinary MLR models are not trusted due to the presence of multicollinearity, and violation of some model assumptions. However, predictions using the models with predictors selected by the shrinkage methods may be better as the effects of the variability on these methods are minimal. Moreover, the WLS methods might give more sensible predictions than the OLS estimation methods. Better predictions were found on the Days to Heading.
2- Christensen, L. A. (1997, March). Introduction to building a linear regression model. In Proceedings of the Twenty-Second Annual SAS Users Group International Conference.
3- Cohen, R. A. (2006, March). Introducing the GLMSELECT procedure for model selection. In Proceedings of the Thirty-First Annual SAS Users Group International Conference.
4- Del Moral, L. F., Rharrabti, Y., Villegas, D., & Royo, C. (2003). Evaluation of Grain Yield and Its Components in Durum Wheat under Mediterranean Conditions Funding for this study was provided by the Spanish government throughout INIA Project SC97039-C2 and CICYT Project AGF99-0611-CO3. Agronomy Journal, 95(2), 266-274.
5-Dias, A. S., & Lidon, F. C. (2009). Evaluation of grain filling rate and duration in bread and durum wheat, under heat stress after anthesis. Journal of Agronomy and Crop Science, 195(2), 137-147.
6- Fonti, V., & Belitser, E. (2017). Feature Selection using LASSO
7- Friedman, J., Hastie, T., & Tibshirani, R. (2001). The elements of statistical learning (Vol. 1, pp. 337-387). New York: Springer series in statistics.
8- Giunta, F., Motzo, R., & Deidda, M. (1993). Effect of drought on yield and yield components of durum wheat and triticale in a Mediterranean environment. Field Crops Research, 33(4), 399-409
9- Gunes, F. (2015). Penalized regression methods for linear models in SAS/STAT R. In Proceedings of the SAS Global Forum 2015 Conference. Cary, NC: SAS Institute Inc. http://support. sas. com/rnd/app/stat/papers/2015/PenalizedRegression_Li nearModels. pdf.
10- Kabbaj, H., Sall, A. T., Al-Abdallat, A., Geleta, M., Amri, A., Filali-Maltouf, A., ... & Bassi, F. M. (2017). Genetic diversity within a global panel of durum wheat (Triticum durum) landraces and modern germplasm reveals the history of alleles exchange. Frontiers in plant science, 8, 1277.
11- Khan, M. H., Hassan, G., Khan, N., & Khan, M. A. (2003). Efficacy of different herbicides for controlling broadleaf weeds in wheat. Asian J. Plant Sci, 2(3), 254256.
12- Khazaei, H., Street, K., Bari, A., Mackay, M., & Stoddard, F. L. (2013). The FIGS (Focused Identification of Germplasm Strategy) approach identifies traits related to drought adaptation in Vicia faba genetic resources. PLoS One, 8(5), e63107.
13- Maçãs, B., Gomes, M. C., Dias, A. S., & Coutinho, J. (2000). The tolerance of durum wheat to high temperatures during grain filling. Options
Méditerranéennes. Durum wheat improvement in the Mediterranean region: new challenges, 257-261.
14- Maccaferri, M., Sanguineti, M. C., Demontis, A., El-Ahmed, A., Garcia del Moral, L., Maalouf, F., ... & Royo, C. (2010). Association mapping in durum wheat grown across a broad range of water regimes. Journal of experimental botany, 62(2), 409-438
15-Mackay, M., von Bothmer, R., & Skovmand, B. (2005). Conservation and utilization of plant genetic resources–future directions. Czech Journal of Genetics and Plant Breeding, 41(335.344).
16-Nishida, K. (2017). Skewing Methods for Variance-Stabilizing Local Linear Regression Estimation. arXiv preprint arXiv:1704.04356.
17-Ottman, M. J., Kimball, B. A., White, J. W., & Wall, G. W. (2012). Wheat growth response to increased temperature from varied planting dates and supplemental infrared heating. Agronomy Journal, 104(1), 7-16.
18-Rao, N. K., Hanson, J., Dulloo, M. E., Ghosh, K., & Nowell, A. (2006). Manual of seed handling in genebanks (No. 8). Bioversity International.
19-Romano, J. P., & Wolf, M. (2017). Resurrecting weighted least squares. Journal of Econometrics, 197(1), 1-19.
20-Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), 267-288.
21-Tibshirani, R., Wainwright, M., & Hastie, T. (2015). Statistical learning with sparsity: the lasso and generalizations. Chapman and Hall/CRC.
22-Van der Kooij, A. J., & Meulman, J. J. (2008). Regularization with ridge penalties, the lasso, and the elastic net for regression with optimal scaling transformations. Submitted for publication.
23-van Hintum, T. J., Brown, A. H. D., Spillane, C., & Hodkin, T. (2000). Core collections of plant genetic resources (No. 3). Bioversity International.
24-Wu, W., May, R., Dandy, G. C., & Maier, H. R. (2012). A method for comparing data splitting approaches for developing hydrological ANN models (Doctoral dissertation, International Environmental Modelling and Software Society (iEMSs)).
25-Yaffee, R. A. (2002). Robust regression analysis: some popular statistical package options. ITS Statistics, Social Science and Mapping Group, New York State University, downloaded on Dec, 23, 2009.
26-Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American statistical association, 101(476), 1418-1429.
27-Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2), 301-320.
28-https://www.azdhs.gov/documents/preparedness/st ate-laboratory/lab-licensurecertification/technical resources/calibration-training/11-weighted-leastsquaresregression-calib.pdf. Accessed on May 5, 2018.
Files | ||
Issue | Vol 5 No 2 (2019) | |
Section | Original Article(s) | |
DOI | https://doi.org/10.18502/jbe.v5i2.2343 | |
Keywords | ||
Cross-validation Mean Square Error MLR Penalized Methods Lasso Elastic net Bias-Variance Trade-off Weighted Least Square |
Rights and permissions | |
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. |