Comparative accuracy of different machine learning classifiers for characterizing varieties of pulse crops
DOI:
https://doi.org/10.59797/jfl.v32i1.706Keywords:
Mungbean, Chickpea, Lentil, Fieldpea, Moong, Urdbean, K-NNAbstract
Important machine learning classifiers viz., Logistic Regression, Linear Discriminant Analysis, K-Nearest Neighbor’s and Naïve-Bayes were subjected for studing their accuracy, precision and recall accommodating 100 dataset each of 12 varieties (Awrodhi, C 235, JG 14, K 850, KWR 108, Pragati, PG 186, Pusa 362, Radhey, Shubhra, Udai, Ujjawal) of chickpea, 10 of lentil (DPL 62, HUL 57, IPL 316, K 75, NDL 1, PL 6, PL 8, PL 406, Pusa Vaibhav, Shekhar), 11 of fieldpea (Adarsh, Aman, Arkel, Azad pea 1, HUDP 15, Indra, IPF 4-9, IPFD 10-12, Prakash, Rachna, Vikash) 11 moong (PDM 139, HUM 12, HUM 16, IPM 02-3, IPM 02-4, Meha, NDM 1, Pant Moong 6, Pusa Vishal, Samrat, Sweta) and 11 of urdbean (Azad 1, Azad 2, Azad 3, IPU 02-43, NDU 1, PU 31, PU 40, Shekhar 1, Shekhar 2, T 9, Uttara) on the basis of their most important metric traits viz., plant height, size of leaf/leaflets, number of branches per plant, days to 50 per cent flowering, number of pods per plant, pod length, number of seed per pod and seed size in order to find out comparatively the best one model for characterizing the varieties. The average accuracy of Logistic Regression, Linear Discriminant Analysis, K-Nearest Neighbor’s and Naïve-Bayes were varied over the pulse crops. The precision and recall of test data set of all crops varieties were 100%. The K-NN model was thus found to be out performed over other models under studied and could therefore effectively be utilized for characterizing, classifying and/or identifying the varieties of pulse crops.
References
Agrawal PK, Singh P, Kanaujia P, Gangawar P, Das A and Dixit GP. 2015. Molecular diversity and genetic enhancement among major pulse crops of India. Pulses: challanges & opportunities under changing climatic scenario (Eds: GP Dixit, Jagdish Singh and NP Singh), ICAR-IIPR, Kanpur, India. pp. 52-76
Agresti A. 1996. An introduction to categorical data analysis. wiley, New York.
Bishop CM. 2007. Pattern recognition and machine learning. Springer, New York.
Camelo-Méndez GA, Camacho-Díaz BH, Del Villar-Martínez AA, Arenas-Ocampo ML, Bello-Pérez LA and Jiménez-Aparicio AR. 2012. Digital image analysis of diverse Mexican rice cultivars. Journal of Science, Food and Agriculture 92: 2709–2714
Hobson DM, Carter RM and Yan Y. 2007. Characterization and identification of rice grains through digital image analysis. IEEE instrumentation and measurement technology conference. Warsaw.pp.1-5
Dheer P. 2019. Distinguishing of rice varieties by using machine learning models. International Journal of Advanced Research in Computer and Communication Engineering 8(1): 55-57
Dheer P and Singh RK. 2019. Identification of Indian rice varieties using machine learning classifiers. Plant Archives 19(1): 155-158
Dheer P, Purshottam and Singh V. 2019. Classifying wheat varieties using machine learning model. Journal of Pharmacognosy and Phytochemistry 1(3): 47-49
Duda RO, Hart P and Stork DG. 2000. Pattern classification. 2nd ed. John Wiley and Sons, New York.
Kong W, Zhang C, Liu F, Nie P and He Y. 2013. Rice seed cultivar identification using near-infrared hyper spectral imaging and multivariate data analysis. Sensors 13(7): 8916-8927
Mitchell T. 1997. Machine Learning. McGraw Hill, NY. Winter P, Pfaff T, Udupa SM, Huttel B, Sharma PC, Sahi S, Arreguin- Espinoza R, Weigand F, Muehlbaur FJ and Kahl G. 1999.
Characterization and mapping of sequence-tagged microsatellite sites in chickpea (Cicer arietinum L.) genome. Molecular Genomics and Genetics 262(1): 90-101
Yadav RDS and Srivastava JP. 2002. DUS characteristics of chickpea varieties. Seed Tech News 32(1): 29-30




