Feature discovery in NIR spectroscopy based Rocha pear classification
Daniel, M.; Guerra, R.; Brázio, A.; Rodrigues, D.; Cavaco, A. M.; Antunes, M. D.; Valente de Oliveira, J. (2021).
Abstract:
Non-invasive techniques for automatic fruit classification are gaining importance in the global agro-industry as they allow for optimizing harvesting, storage, management, and distribution decisions. Visible, near infra-red (NIR) diffuse reflectance spectroscopy is one of the most employed techniques in such fruit classification. Typically, after the acquisition of a fruit reflectance spectrum the wavelength domain signal is preprocessed and a classifier is designed. Up to now, little or no work considered the problem of feature generation and selection of the reflectance spectrum. This work aims at filling this gap, by exploiting a feature engineering phase before the classifier. The usual approach where the classifier is fed directly with the reflectances measured at each wavelength is contrasted with the proposed division of the spectra into bands and their characterization in wavelength, frequency, and wavelength-frequency domains. Feature selection is also applied for optimizing efficiency, predictive accuracy, and for mitigating over-training. A total of 3050 Rocha pear samples from different origins and harvest years are considered. Statistical tests of hypotheses on classification results of soluble solids content – a predictor of both fruit sweetness and ripeness – show that the proposed preliminary phase of feature engineering outperforms the usual direct approach both in terms of accuracy and in the number of necessary features. Moreover, the method allows for the identification of features that are physical chemistry meaningful.
Keywords:Feature extraction; Feature selection; Data analysis; Classification; Machine learning.
Consulte este Artigo Científico aqui.