Comparative Study of Cluster and Neural Network Methods in the Problem of Protein Structure Analysis

Cover Page

Abstract


This work continues the previous study where the important problem of automatization of differentiation methods of the genetic protein structures according to their electrophoretic spectrums (EPS) was considered. The multicriterion problem of the agriculture cultivar identification by their spectra caused the idea of its solution by an artificial neural network (ANN) trained on an expert data base. In the given paper peculiarities of the neural net use as well as the purposefulness of cluster analysis applications for the EPS classifying are studied. A special model of multidimensional vectors adequately imitating the most essential characteristics of real data obtained after EPS digitalization, denoising and normalization is developed. A numerical experiment is fulfilled on such simulated data stream to study the influence of contamination and distortion factors on the ANN efficiency in order to suppress those factors and improve ANN functioning. Various methods of cluster analysis are also applied to simulated multidimensional data as either an ANN alternative or more soundly as a prior stage of a coarse data classification in some set of detached cultivar groups to be classified next by ANN.

D A Baranov

Joint Institute for Nuclear Research

Email: DmitriyBaranof@gmail.com
Laboratory of Information Technologies

G A Ososkov

Joint Institute for Nuclear Research

Email: ososkov@jinr.ru
Laboratory of Information Technologies

A A Baranov

Moscow State Institute of Radio Engineering, Electronics and Automation

Email: andbar91@yandex.ru

Views

Abstract - 47

PDF (Russian) - 30


Copyright (c) 2014 Баранов Д.А., Ососков Г.А., Баранов А.А.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.