An adaptive model of person identification combining speech and image information

Zhang, D.
Ghobakhlou, A.
Kasabov, N
Item type
Conference Proceedings
Degree name
Journal Title
Journal ISSN
Volume Title

The paper introduces a combination of adaptive neural network systems and statistical method for integrating speech and face image information for person identification. The method allows for the development of models of persons and their on-going adjustment based on new speech and face images. The method is illustrated with a modeling and classification of different persons, when speech and face images are presented in an incremental way. In this model, there are two sub - networks, one for face image and one for speaker recognition. A higher-level layer is applied to make a final decision. In the speaker recognition subnetwork, a text-dependant model is built using Evolving Connectionist Systems (ECOS) [1]. In the face image recognition sub-network, composite profile technique is applied for face image feature extraction and Zero Instruction Set Computing (ZISC) [2] technology is used to build the neural network. In the higher-level conceptual subsystem, final recognition decision is made using statistical method. The experiments show that ECOS and ZISC are appropriate techniques for the creation of evolving models for the task of speaker and face recognition individually. It is also shown that the integration of the speech and image information using statistical method improves the person identification rate. © 2004 IEEE.

Publisher's version
Rights statement
©2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.