Cybernetics And Systems Analysis logo
Editorial Board Announcements Abstracts Authors Contents
Cybernetics And Systems Analysis
International Theoretical Science Journal
UDC 519.217.2
I.V. Sergienko, A.M. Gupal, A.V. Ostrovskiy

EM ALGORITHM FOR GENE CLASSIFICATION

Abstract. The EM algorithm is considered for the problem of separating probability distribution mixtures with components described by Markov chains, together with the related weighted log likelihood maximization problem. Auxiliary algorithms to select initial approximation and optimal mixture size are proposed, as well as a method for approximating the mixture with given data using support vector machines. The results are applied to boost the quality of gene fragment classifiers.

Keywords: Markov chain, classification, gene, bioinformatics, nucleotide, exon, intron, likelihood.



FULL TEXT

Сергиенко Иван Васильевич,
академик НАН Украины, директор Института кибернетики им. В.М. Глушкова НАН Украины, Киев,
e-mail: aik@public.icyb.kiev.ua.

Гупал Анатолий Михайлович,
чл.-кор. НАН Украины, профессор, заведующий отделом Института кибернетики им. В.М. Глушкова НАН Украины, Киев,
e-mail: gupal_anatol@mail.ru.

Островский Алексей Викторович,
младший научный сотрудник Института кибернетики им. В.М. Глуш¬кова НАН Украины, Киев,
e-mail: ostrovski.alex@gmail.com.

© 2015 Kibernetika.org. All rights reserved.