Application Analysis of Supervised and Unsupervised Methods for Clustering and Classification of a Wheat Seeds Dataset

  • Mirella M. de O. Carneiro Federal University of Rio de Janeiro, Rio de Janeiro, RJ
  • Milena F. Pinto Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, RJ
  • Alessandro R. L. Zachi Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, RJ
Keywords: Supervised Learning, Decision Tree, Unsupervised Learning, K-Means, Kohonen Neural Network, Wheat Seeds

Abstract

This project aims to apply easy-to-implement supervised and unsupervised learning methods to do a cluster and classification analysis of the information from a wheat seeds dataset. In addition, it intends to thoroughly evaluate, employing post-processing techniques, the efficiency of the models produced from them, proving that it is not necessary to use complex procedures in this database, since the obtained results for clustering and classification were highly similar to the dataset original labels. The decision tree was chosen as the classification algorithm. Furthermore, k-means and Kohonen neural network were selected as the clustering methods. Additionally, pre-processing and exploratory data analysis techniques were explained in detail and employed in order to maximize the final quality of the model developed.
Published
2022-10-19
Section
Articles