Is tonal language a problem for speaker identification (SPID)?

Lakshmi Prasanna P 1, 2, * and Vanlalhriati C 2

1 Speech – Language Pathology, and 2MSc (SLP).
2  Helen Keller’s Institute of Research and Rehabilitation for the Disabled Children, RK Puram, near Neredmet X roads, Secunderabad-500056, Telangana, India.
 
Research Article
World Journal of Advanced Engineering Technology and Sciences, 2022, 07(02), 163–173.
Article DOI: 10.30574/wjaets.2022.7.2.0140
Publication history: 
Received on 27 October 2022; revised on 04 December 2022; accepted on 06 December 2022
 
Abstract: 
The availability of numerous technologies has led to an increase in the usage of bioinformatics in recent years. Siri, Alexa, and other artificial intelligence systems assist us in our daily lives. Voice recognition systems are used to confirm an individual's identity based on particular elements retrieved from his or her voice. In this regard, the current study attempted to assess the proportion of speaker identification in tonal language speaking persons. The study included 20 participants in the age range from 20 to 40 years. All participants were given a few phrases to speak and were recorded. PRAAT software was used to analyze the obtained data. A vector was developed by using the first two formants, which was then utilized to calculate the percentage for speaker identification. From small sample size to bigger sample size, three groups were formed: A-5, B-10, and C-20 speakers. In a lower sample size, results showed a benchmark of 65% for vowel /i:/, which is better for SPID, 60% for /a:/, which is above chance level, and 45% for /u:/, which is below chance level. The authors stated that increasing the sample size has an influence on speaker identification.
 
Keywords: 
Tonal language; Speaker identification; Benchmark; PRAAT software; Artificial Intelligence
 
Full text article in PDF: