Extraction of Data Features for Neuro-Classifier Input

Cover Page

Abstract


The problem of essential data compression to be input to ANN-classifier without loosing significant information is considered on the example of the quite substantial task of the genetic protein structure analysis, which is important for genetic biology researches in radiobiology and, especially, in agricultural. Such analysis is usually carried out by studying ElectroPhoretic Spectra (EPS) of gliadin (alcohol soluble protein) of the inspected grain cultivar. EPS digitization produces a densitogram with 4 thousands counts, which most informative features must be extracted to be input to ANN. Besides these data require special preprocessing for densitogram smoothing, pedestal eliminating, as well as compensating such digitization orocess defects as signal noise, variability of spectrum borders and illumination, their non-linear starches due to electrophoresis nonstationarity.
Several alternative approaches to features extracting were studied: (1) the densitogram coarsing into 200 averaged measurements; (2) the principal component analysis; (3) recognition of all well-pronounced peaks in order to evaluate their parameters to be input to ANN; (4)-(5) data compression by both discrete Fourier (DFT) and wavelet (DWT) transformations. These methods have been used for feature extraction from samples formed by experts for 30 different sorts. Then extracted features were used to train ANN of three-layer perceptron type. The comparative study of the recognition efficiency with data compressed by the methods listed above shows their high sensitivity to the number of sorts to be classified. Only DFT and DWT approaches could keep the efficiency on the level 95-97% up to 20 sorts.
A further development of feature extraction methods and a study of possibility to develop a hierarchy of classifying ANNs are intended.

G A Ososkov

Joint Institute for Nuclear Research

Email: ososkov@jinr.ru
Лаборатория информационных технологий; Объединённый институт ядерных исследований; Joint Institute for Nuclear Research

D A Baranov

Joint Institute for Nuclear Research

Лаборатория информационных технологий; Объединённый институт ядерных исследований; Joint Institute for Nuclear Research

Views

Abstract - 34

PDF (English) - 21


Copyright (c) 2010 Ососков Г.А., Баранов Д.А.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.