ISSN: 2640-8104
Open Journal of Pain Medicine
Mini Review       Open Access      Peer-Reviewed

Attention for the speech of cleft lip and palate in speaker recognition

Lakshmi Prasanna P*

Helen Keller’s Institute of Research and Rehabilitation for the Disabled Children, RK Puram, near Neredmet X roads, Secunderabad-500056, Telangana, India
*Corresponding author: Lakshmi Prasanna P, Associate Professor in Speech-Language Pathology, Helen Keller’s Institute of Research and Rehabilitation for the Disabled Children, RK Puram, near Neredmet X roads, Secunderabad-500056, Telangana, India, E-Mail: lakshmiprasannaspeechpathologist@yahoo.com
Received: 21 November, 2023 | Accepted: 30 November, 2023 | Published: 01 December, 2023
Keywords: Artificial intelligence; Cleft lip and palate; Voice quality; Speaker recognition; Hypernasality

Cite this as

Lakshmi Prasanna P (2023) Attention for the speech of cleft lip and palate in speaker recognition. Open J Pain Med 7(1): 022-024. DOI: 10.17352/ojpm.000036

Copyright License

© 2023 Lakshmi Prasanna P. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Artificial Intelligence (AI) has become indispensable to all people, primarily for the purposes of speaker recognition, voice identification, educational purposes, workplace, and health care. Based on a speaker’s voice characteristics, identification and recognition of the speaker is accomplished. The voice is affected by both intra- and interspeaker variability. In addition to this, a condition known as structural abnormalities can cause resonance, which can seriously affect voice quality. As a result, speakers may experience difficulties when using AI-based devices. The study aims to investigate the effects of speech with cleft lip and palate on speaker recognition. The review stated that even after surgery, some people with cleft lip and palate exhibit hypernasality and poor speech intelligibility depending on the severity of the cleft. The author discovered that artificial intelligence has been applied to surgical procedures. In children with corrected cleft lip and palate, acoustic analysis revealed poor benchmarking for speaker identification. The most prevalent type of hypernasality also affects speech intelligibility. Thus, more research on speaker recognition using different algorithms and hypernasality is essential. These can help speakers who have CLP to use AI freely and without any issues. Even with its flaws, people with CLP can still learn more about using AI.

Introduction

Cleft lip and palate are examples of craniofacial abnormalities, which are congenital structural deformities brought on by abnormal embryological development. Between the fourth and tenth weeks of the developing embryo or foetus, there is a break in the growth of the embryo, which causes cranial variations [1].

Cleft Lip and Palate are the most congenital craniofacial anomalies that show structural defects in the lip, alveolus, nasal floor, and palates (hard/soft). This occurs due to the lack of fusion of the frontonasal and maxillary processes and palatal shelves which shows cleft in hard/soft palates. These are due to the failure of the facial process of the development.

Orofacial clefts occur at varying rates depending on the demographic. Overall, Indians have reported greater rates (one in 500 births), while people with African ancestry have reported lower rates (one in 2,500 births [2]). The prevalence of isolated cleft palate is 2:1 more common in females than in boys. In contrast, the ratio of men to women who have cleft lips, whether they also have cleft palates, is 2:1 [3].

In the United States, cleft lip with or without cleft palate is the second most frequent birth abnormality, affecting one in every 940 births and leading to 4,437 instances per year [4]. An estimated prevalence of 7.75 to 10.63 per 10,000 live births has been reported [5].

The global prevalence of cleft lip [0.3 (95% CI:0.26-0.34)], cleft palate [0.33 (95% CI: 0.28 - 0.38)], and cleft lip and palate [0.45 (95% CI: 0.38 - 0.52)] in every 1000 live births, based on the study [6].

Quality of speech depends on the severity of the cleft lip and palate. It also depends on the type of cleft involved. Usually, speech errors such as articulation, and hyper/hypo nasality, will be seen.

In the field of speaker recognition, it is also important to give attention to the cleft lip and palate population due to the increase of technology available in day-to-day life situations. Post-COVID life has become more inclined to virtual and depends on various identification devices at educational institutes, industries/companies, government access, banking systems, entertainment devices, etc. It is also necessary to improve the quality of life of an individual with cleft lip and palate.

According to the scoping review of a study, an attempt was made to pinpoint the many applications of artificial Intelligence (AI) for helping people with cleft lip and/or palate as well as their current state [7]. Artificial intelligence and cleft lip and/or palate were searched for in the Pubmed, Embase, and IEEE Xplore databases. The study was carried out in accordance with PRISMA- ScR Recommendations. 458 items from the initial search were filtered based on title and abstract. 26 publications were added after the screening, duplication removal, and full-text reading of a few papers. They investigated how AI could be used in cases of cleft lip and/or palate to help with diagnosis, treatment-particularly speech therapy-and prediction.

Speech characteristics

Due to pressure loss through the velopharyngeal port or when a youngster lowers their vocal pitch as a coping mechanism to lessen or mask hoarseness, nasal emission, or hypernasality [1].

According to a study, children with cleft lip or palate have articulation disorders (obligatory and compensatory errors) that impair their ability to understand speech [8]. Their survey confirms that in order to improve speech intelligibility in this population, speech therapy is required.

Quality of life with AI in CLP

AI model that can intelligently and consistently identify the anatomy of cleft lip and nasal deformity and automate the placement of nasolabial marks that can serve as a surgical design guide. We used the high-resolution net architecture, a recently developed family of convolutional neural networks-based deep learning architecture, to train an AI model that can recognise and identify the 21 anthropometric points associated with a cleft lip on images and videos of cleft lips. The cleft AI model was put to the test by measuring the Euclidean distance between anthropometric points that were manually indicated by a skilled cleft surgeon and those produced by our model. Each point’s Normalized Mean Error (NME) was computed. The range of NME readings was 0.029 to 0.055. The cleft-side Crista Philtri inferior (cphi) NME was the largest. Cleft-side alare has the lowest NME value. These failures fell well within the bounds of accepted AI standards benchmarks [9].

After reviewing 44 publications, it was concluded that 12 of them stated that AI-enabled computerized programming software for precise landmark detection, rapid digital cephalometric analysis, clinical decision-making, and treatment prediction [10]. Machine learning can assist in identifying cephalometric predictors of the future need for orthognathic surgery in children who have had their unilateral cleft lip and palate corrected.

AI is used to improve the quality of life of an individual with CLP.

Studies on formants, other automatic methods

The obtained benchmark for SPID in children with CLP results showed that for vowel /i:/ it was 75%, this indicates a better benchmark and can be used for further SPID process. A 60% for vowel /a:/ where it states above chance level and 40% for vowel /u:/ which indicates poor benchmarking [11]. The above-said benchmarking was obtained in a small group that consisted of 5 speakers. There is no significant difference between vowels which could be due to anatomical and functional variations. They stated that the SPID is affected as the no. of speakers size increases.

Quality of speech affected

The ability to speak is an essential life skill, and any age can be impacted by cleft-related palatal, dental, and occlusal abnormalities. When velopharyngeal dysfunction is present in certain patients, speech prosthetics are a suitable treatment option that should be administered by a specialized multidisciplinary team [12].

Due to the overlapping or coarticulation and quality effect on the speech of CLP, there are chances of misperception of the voice. Further, it may impact on quality of life.

How speech recognition is important for children/adults

An individual’s identity is important at any age and in the current situation, biometrics are essential even for children in school, assistive devices, apps/tools, mobiles, or computers. Voice is one of the biometrics which can be used in assistive devices.

How CLP affects speech recognition

Cleft Lip and Palate is a congenital anomaly where the structures are disturbed and can be rectified through surgeries. Due to the anatomical disturbances, the production of speech may also affected and the quality of speech is affected due to the disturbance in the nasal quality. In most of the individuals voice of the cleft lip and palate will be hypernasality, where speaker identification can be difficult to identify a particular individual.

Opinion/focus on speech recognition aspects in patients with CLP

According to a study, the most effective tools for assessing hypernasal speech are Machine Learning (ML) algorithms [13]. In instances where machine learning algorithms can quickly and independently identify hypernasality in a manner that is consistent with professional speech-language pathologists and has a high concordance rate. ML algorithms can supplement speech-language pathologists’ gold standard and increase their reach. Long-term outcome monitoring holds significant potential for enhancing treatment outcomes.

Individuals with cleft lip and palate still exhibit a variety of speech problems and hypernasality despite receiving speech therapy and surgical correction. Speech intelligibility is also affected based on the severity of the cleft. These errors could result from the age of identification and management (surgical/speech therapy). AI is being used more and more in modern times, and speaker identification and recognition issues with CLP speech are still present. To enhance the quality of life for a person with CLP, it would therefore be preferable to focus more on speaker verification/identification and develop enhanced AI techniques.

Significance

Speech intelligibility is affected by structural abnormalities in a different condition called cleft lip and palate. AI usage is most common in the modern era. The current study made an attempt to clarify the applications of AI and the speech issues that CLP children face. Errors may endure for a lifetime. Some people may find it difficult to use AI devices because of their hypernasality, speech errors, and intelligibility issues. AI is currently utilized during surgical procedures. In contrast, in a small group of five children, only 75% of the children were able to identify the speaker based on the vowel /i:/. The current study also examined the possibility that people with abnormal speech patterns could have trouble using speaker recognition software.

Conclusion

Finally, since some people may not be successful or entirely satisfied with using AI devices for their purposes, there should be serious attention paid to cleft lip and palate when using these devices. To match their speech quality, further research should be conducted using different algorithms. Additionally, AI can be used in therapeutic management to enhance the speech quality of people who have cleft lip and palate.

  1. Falzone PSJ, Mary HJ, Karnell MP. Communication disorders associated with cleft palate. Cleft Palate Speech. 2010; 162-198.
  2. Dixon MJ, Marazita ML, Beaty TH, Murray JC. Cleft lip and palate: understanding genetic and environmental influences. Nat Rev Genet. 2011 Mar;12(3):167-78. doi: 10.1038/nrg2933. PMID: 21331089; PMCID: PMC3086810.
  3. Mossey PA, Shaw WC, Munger RG, Murray JC, Murthy J, Little J. Global oral health inequalities: challenges in the prevention and management of orofacial clefts and potential solutions. Adv Dent Res 2009.
  4. Parker SE, Mai CT, Canfield MA, Rickard R, Wang Y, Meyer RE, Anderson P, Mason CA, Collins JS, Kirby RS, Correa A; National Birth Defects Prevention Network. Updated National Birth Prevalence estimates for selected birth defects in the United States, 2004-2006. Birth Defects Res a Clin Mol Teratol. 2010 Dec;88(12):1008-16. doi: 10.1002/bdra.20735. Epub 2010 Sep 28. PMID: 20878909.
  5. Tanaka SA, Mahabir RC, Jupiter DC, Menezes JM. Updating the epidemiology of cleft lip with or without cleft palate. Plast Reconstr Surg. 2012 Mar;129(3):511e-518e. doi: 10.1097/PRS.0b013e3182402dd1. PMID: 22374000.
  6. Salari N, Darvishi N, Heydari M, Bokaee S, Darvishi F, Mohammadi M. Global prevalence of cleft palate, cleft lip and cleft palate and lip: A comprehensive systematic review and meta-analysis. J Stomatol Oral Maxillofac Surg. 2022 Apr;123(2):110-120. doi: 10.1016/j.jormas.2021.05.008. Epub 2021 May 24. PMID: 34033944.
  7. Dhillon H, Chaudhari PK, Dhingra K, Kuo RF, Sokhi RK, Alam MK, Ahmad S. Current Applications of Artificial Intelligence in Cleft Care: A Scoping Review. Front Med (Lausanne). 2021 Jul 28; 8:676490. doi: 10.3389/fmed.2021.676490. PMID: 34395471; PMCID: PMC8355556.
  8. Safaiean A, Jalilevand N, Ebrahimipour M, Asleshirin E, Hiradfar M. Speech intelligibility after repair of cleft lip and palate. Med J Islam Repub Iran. 2017 Dec 15; 31:85. doi: 10.14196/mjiri.31.85. PMID: 29951386; PMCID: PMC6014757.
  9. Sayadi LR, Hamdan US, Zhangli Q, Hu J, Vyas RM. Harnessing the Power of Artificial Intelligence to Teach Cleft Lip Surgery. Plast Reconstr Surg Glob Open. 2022 Jul 25;10(7):e4451. doi: 10.1097/GOX.0000000000004451. Erratum in: Plast Reconstr Surg Glob Open. 2022 Oct 28;10(10):e4638. PMID: 35924000; PMCID: PMC9325328.
  10. Huqh MZU, Abdullah JY, Wong LS, Jamayet NB, Alam MK, Rashid QF, Husein A, Ahmad WMAW, Eusufzai SZ, Prasadh S, Subramaniyan V, Fuloria NK, Fuloria S, Sekar M, Selvaraj S. Clinical Applications of Artificial Intelligence and Machine Learning in Children with Cleft Lip and Palate-A Systematic Review. Int J Environ Res Public Health. 2022 Aug 31;19(17):10860. doi: 10.3390/ijerph191710860. PMID: 36078576; PMCID: PMC9518587.
  11. Prasanna LP, Akansha. Benchmarks for speaker identification in cleft lip and palate children. Unpublished dissertation, Osmania University, Hyderabad. 2022.
  12. Phippen G. Articulating the issues: speech assessment and intervention in cleft lip and palate. Br Dent J. 2023 Jun;234(12):912-917. doi: 10.1038/s41415-023-5954-y. Epub 2023 Jun 23. PMID: 37349440.
  13. Zhang A, Pyon RE, Chen K, Lin AY. Speech Analysis of Patients with Cleft Palate Using Artificial Intelligence Techniques: A Systematic Review. FACE. 2023; 4(3):327-337. doi:10.1177/27325016231187985.
 


Article Alerts

Subscribe to our articles alerts and stay tuned.


Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.



Help ?