Speech is considered as the most powerful form of communication between humans. Speech is a slow varying signal. It gives information about the characteristics of a person, such as the gender, emotional state, health condition, spoken language and the identity of the speaker. In other words, the human voice is a complex signal produced as a result of many transformations happening at many different levels such as: Semantic, Articulatory, Linguistic, and Acoustic. The speech signal is basically created at the vocal cords, travels through the vocal tract, and produced at the speaker’s mouth. The study of speech signals and the dealing with methods of these signals is called speech processing. Some of the best electronics colleges in Nashik are training students to develop multilingual speaker systems to drive innovation in the field.
What is Speaker Identification and its Importance?
Speech identification or recognition is the task about identifying a spoken word or sentence from the database. Language Identification is the process of recognising the words and sentences of a language and determining which language is spoken.
Speaker identification is about identifying the speaker using speech samples spoken by different speakers. A speaker identification system identifies persons from their voice. The voice or speech of every individual speaker is different. This is because of the difference in the size of the larynx, shapes of vocal tract, and other organs involved in voice production. Also, many speakers differ in their style of way of pronunciation, speaking or accent, rhythm, style of intonation, choice of vocabulary etc.
Speaker identification has become a most preferable biometric authentication technique in present days. It uses certain characteristics of the persons to identify the individuals. It is supposed that in the coming years the speaker identification technology will be used to verify the identity of persons, to access a system, to enormous services such as online bank transactions, online shopping and also the transmission and reception of confidential data can be controlled by human speech signals.
The Need for Multilingual Systems Around the World
Many countries around the world have populations who speak multiple languages. Like India, where many people are able to speak more than one language fluently. Therefore, there is a need for the development of a multilingual speaker identification system with new challenges. Instead of designing multiple speech processing systems for different languages our focus is to develop a common multilingual system, with support for multiple languages.
Multilingual Speech Processing
Multilingual Speech Processing is the field of speech technology in which the speech signal of multiple languages of a speaker is analysed to observe the effect of the language on the speech features. On the basis of this observation, a Multilingual Speaker Identification system can be designed for identification of the speaker in multilingual environments. The identity of a multilingual speaker can be determined from the information contained in the Multilanguage speech signal through speaker identification.
The Multilingual Speaker Identification is concerned with identifying unknown speakers from a speech database of multilingual speaker models previously enrolled in the system. The demand for multilingual speaker identification systems increases for countries like India where many people are able to speak more than one language.
Applications of Multilingual Speaker Identification
Multilingual speech processing for multilingual speaker identification is a field of research in speech signal processing and speaker identification which comes together many techniques developed for multilingual speaker identification in single language environment with new approaches that convert it to the multilingual environment In Multilingual Speech Processing for Multilingual Speaker Identification are covered in two sections first is Processing and second one is Identification. In Processing, various features of the Multilingual speech signal have been analysed whereas in Identification unknown speakers will be identified through the speech signal from a speech database of multilingual speakers. A text dependent Multilingual speaker identification i.e. speaker will have to speak predefined sentences for identification has been designed. Multilingual Speaker Identification System has utilisation both in biometric as well as forensic applications. However, several areas remain unwrapped for further research and improvement to the proposed design.
Conclusion
Multilingual speech signal processing is a field of research in speech and language technology. This field combines many of the techniques used in monolingual speech processing systems with new techniques that are challenges in the multilingual domain. A B.Tech Electronics and Telecommunication program is essential for candidates who wish to build a career in developing multilingual systems. Spoken language contains a lot of information such as information about the content of a message and information about the speaker of that message. Content is composed of several levels of linguistic information like phonological information, morphological information, syntactic information, and semantic information.
