MyMedR

Displaying all 5 publications

Abstract:

Sort:

Fulltext Age estimation based on children's voice: a fuzzy-based decision fusion strategy

Mirhassani SM, Zourmand A, Ting HN

ScientificWorldJournal, 2014;2014:534064.
PMID: 25006595 DOI: 10.1155/2014/534064

Automatic estimation of a speaker's age is a challenging research topic in the area of speech analysis. In this paper, a novel approach to estimate a speaker's age is presented. The method features a "divide and conquer" strategy wherein the speech data are divided into six groups based on the vowel classes. There are two reasons behind this strategy. First, reduction in the complicated distribution of the processing data improves the classifier's learning performance. Second, different vowel classes contain complementary information for age estimation. Mel-frequency cepstral coefficients are computed for each group and single layer feed-forward neural networks based on self-adaptive extreme learning machine are applied to the features to make a primary decision. Subsequently, fuzzy data fusion is employed to provide an overall decision by aggregating the classifier's outputs. The results are then compared with a number of state-of-the-art age estimation methods. Experiments conducted based on six age groups including children aged between 7 and 12 years revealed that fuzzy fusion of the classifier's outputs resulted in considerable improvement of up to 53.33% in age estimation accuracy. Moreover, the fuzzy fusion of decisions aggregated the complementary information of a speaker's age from various speech sources.

Matched MeSH terms: Voice/physiology
Detection of Voice Pathology using Fractal Dimension in a Multiresolution Analysis of Normal and Disordered Speech Signals

Ali Z, Elamvazuthi I, Alsulaiman M, Muhammad G

J Med Syst, 2016 Jan;40(1):20.
PMID: 26531753 DOI: 10.1007/s10916-015-0392-2

Voice disorders are associated with irregular vibrations of vocal folds. Based on the source filter theory of speech production, these irregular vibrations can be detected in a non-invasive way by analyzing the speech signal. In this paper we present a multiband approach for the detection of voice disorders given that the voice source generally interacts with the vocal tract in a non-linear way. In normal phonation, and assuming sustained phonation of a vowel, the lower frequencies of speech are heavily source dependent due to the low frequency glottal formant, while the higher frequencies are less dependent on the source signal. During abnormal phonation, this is still a valid, but turbulent noise of source, because of the irregular vibration, affects also higher frequencies. Motivated by such a model, we suggest a multiband approach based on a three-level discrete wavelet transformation (DWT) and in each band the fractal dimension (FD) of the estimated power spectrum is estimated. The experiments suggest that frequency band 1-1562 Hz, lower frequencies after level 3, exhibits a significant difference in the spectrum of a normal and pathological subject. With this band, a detection rate of 91.28 % is obtained with one feature, and the obtained result is higher than all other frequency bands. Moreover, an accuracy of 92.45 % and an area under receiver operating characteristic curve (AUC) of 95.06 % is acquired when the FD of all levels is fused. Likewise, when the FD of all levels is combined with 22 Multi-Dimensional Voice Program (MDVP) parameters, an improvement of 2.26 % in accuracy and 1.45 % in AUC is observed.

Matched MeSH terms: Voice/physiology
Objective and subjective changes in voice after endoscopic sinus surgeries in patients with and without nasal polyps

Wong EHC, Chong AW

Am J Otolaryngol, 2019 12 05;41(2):102367.
PMID: 31831185 DOI: 10.1016/j.amjoto.2019.102367

BACKGROUND: Many studies have looked at the effect of functional endoscopic sinus surgeries (FESS) on nasalance, nasal consonant and nasalized vowels. Only two studies investigated the effect of FESS on vocal sound quality and have not found statistically significant changes before and after operations. The aim of this study was to examine the short-term and long-term objective and subjective changes in the vocal quality of patients after FESS, comparing patients with and without nasal polyps.
METHODS: Sixteen patients were recruited for voice analysis during pre-operative, within two weeks and at least three months post-operatively. Subjective questionnaire was used to assess perception of voice changes.
RESULTS: There were no statistically significant changes in the acoustic parameters of patients with nasal polyposis. In patients with CRS without polyps, there was a statistically significant increase in fundamental frequency (F0) in nasal sound during early follow up. The changes in soft phonation index (SPI) values between the two groups were statistically significant during early follow-ups. Only patients with nasal polyposis perceived a subjective change in their voice post-operatively.
CONCLUSIONS: Clinicians should inform all patients, especially voice professionals about the possible effects of endoscopic sinus surgeries on their voice quality.

Matched MeSH terms: Voice/physiology*
Fulltext Determinants and Effects of Voice Disorders among Secondary School Teachers in Peninsular Malaysia Using a Validated Malay Version of VHI-10

Moy FM, Hoe VC, Hairi NN, Chu AH, Bulgiba A, Koh D

PLoS One, 2015;10(11):e0141963.
PMID: 26540291 DOI: 10.1371/journal.pone.0141963

OBJECTIVES: To establish the prevalence of voice disorder using the Malay-Voice Handicap Index 10 (Malay-VHI-10) and to study the determinants, quality of life, depression, anxiety and stress associated with voice disorder among secondary school teachers in Peninsular Malaysia.
METHODS: This study was divided into two phases. Phase I tested the reliability of the Malay-VHI-10 while Phase II was a cross-sectional study with two-stage sampling. In Phase II, a self-administered questionnaire was used to collect socio-demographic and teaching characteristics, depression, anxiety and stress scale (Malay version of DASS-21); and health-related quality of life (Malay version of SF12-v2). Complex sample analysis was conducted using multivariate Poisson regression with robust variance.
RESULTS: In Phase I, the Spearman correlation coefficient and Cronbach alpha for total VHI-10 score was 0.72 (p < 0.001) and 0.77 respectively; showing good correlation and internal consistency. The ICCs ranged from 0.65 to 0.78 showing fair to good reliability and demonstrating the subscales to be reliable and stable. A total of 6039 teachers participated in Phase II. They were primarily Malays, females, married, had completed tertiary education and aged between 30 to 50 years. A total of 10.4% (95% CI 7.1, 14.9) of the teachers had voice disorder (VHI-10 score > 11). Compared to Malays, a greater proportion of ethnic Chinese teachers reported voice disorder while ethnic Indian teachers were less likely to report this problem. There was a higher prevalence ratio (PR) of voice disorder among single or divorced/widowed teachers. Teachers with voice disorder were more likely to report higher rates of absenteeism (PR: 1.70, 95% CI 1.33, 2.19), lower quality of life with lower SF12-v2 physical (0.98, 95% CI 0.96, 0.99) and mental (0.97, 95% CI 0.96, 0.98) component summary scales; and higher anxiety levels (1.04, 95% CI 1.02, 1.06).
CONCLUSIONS: The Malay-VHI-10 is valid and reliable. Voice disorder was associated with increased absenteeism, marginally associated with reduced health-related quality of life as well as increased anxiety among teachers.

Matched MeSH terms: Voice/physiology
Immediate selective laryngeal reinnervation in vagal paraganglioma patients

Mat Baki M, Clarke P, Birchall MA

J Laryngol Otol, 2018 Sep;132(9):846-851.
PMID: 30180919 DOI: 10.1017/S0022215118000476

OBJECTIVE: This prospective case series aimed to present the outcomes of immediate selective laryngeal reinnervation.
METHODS: Two middle-aged women with vagal paraganglioma undergoing an excision operation underwent immediate selective laryngeal reinnervation using the phrenic nerve and ansa cervicalis as the donor nerve. Multidimensional outcome measures were employed pre-operatively, and at 1, 6 and 12 months post-operatively.
RESULTS: The voice handicap index-10 score improved from 23 (patient 1) and 18 (patient 2) at 1 month post-operation, to 5 (patient 1) and 1 (patient 2) at 12 months. The Eating Assessment Tool 10 score improved from 20 (patient 1) and 24 (patient 2) at 1 month post-operation, to 3 (patient 1) and 1 (patient 2) at 12 months. There was slight vocal fold abduction observed in patient one and no obvious abduction in patient two.
CONCLUSION: Selective reinnervation is safe to perform following vagal paraganglioma excision conducted on the same side. Voice and swallowing improvements were demonstrated, but no significant vocal fold abduction was achieved.

Matched MeSH terms: Voice/physiology