Search for: [Keywords = "speech analysis"]

Search results

Number of results: 4

items per page: 25 50 75

Sort by:

of 1

Audio Feature Space Analysis for Emotion Recognition from Spoken Sentences

Lukasz Smietanka Tomasz Maka

Archives of Acoustics | 2021 | vol. 46 | No 2 | 271-277 | DOI: 10.24425/aoa.2021.136581

Keywords speech analysis classification emotional speech

Download PDF Download RIS Download Bibtex

Abstract

An analysis of low-level feature space for emotion recognition from the speech is presented. The main goal was to determine how the statistical properties computed from contours of low-level features influence the emotion recognition from speech signals. We have conducted several experiments to reduce and tune our initial feature set and to configure the classification stage. In the process of analysis of the audio feature space, we have employed the univariate feature selection using the chi-squared test. Then, in the first stage of classification, a default set of parameters was selected for every classifier. For the classifier that obtained the best results with the default settings, the hyperparameter tuning using cross-validation was exploited. In the result, we compared the classification results for two different languages to find out the difference between emotional states expressed in spoken sentences. The results show that from an initial feature set containing 3198 attributes we have obtained the dimensionality reduction about 80% using feature selection algorithm. The most dominant attributes selected at this stage based on the mel and bark frequency scales filterbanks with its variability described mainly by variance, median absolute deviation and standard and average deviations. Finally, the classification accuracy using tuned SVM classifier was equal to 72.5% and 88.27% for emotional spoken sentences in Polish and German languages, respectively.

Go to article

Authors and Affiliations

Lukasz Smietanka

Tomasz Maka

Faculty of Computer Science and Information Technology, West Pomeranian University of Technology, Szczecin, Poland

Fusing the electromagnetic articulograph, high-speed video cameras and a 16-channel microphone array for speech analysis

Ł. Mik A. Lorenc D. Król R. Wielgat R. Święciński R. Jędryka

Bulletin of the Polish Academy of Sciences Technical Sciences | 2018 | 66 | No 3 | 257-266

Keywords electromagnetic articulography microphone array vision system speech analysis

Download PDF Download RIS Download Bibtex

Authors and Affiliations

Ł. Mik

A. Lorenc

D. Król

R. Wielgat

R. Święciński

R. Jędryka

Comparative Analysis of Classifiers for the Assessment of Respiratory Disorders Using Speech Parameters

Poonam Shrivastava Neeta Tripathi Bikesh Kumar Singh Bhupesh Kumar Dewangan

Archives of Acoustics | 2023 | vol. 48 | No 1 | 13-24 | DOI: 10.24425/aoa.2022.142905

Keywords healthy speech affected speech machine learning classification techniques respiratory disorders speech analysis

Download PDF Download RIS Download Bibtex

Abstract

Non-invasive techniques for the assessment of respiratory disorders have gained increased importance in recent years due to the complexity of conventional methods. In the assessment of respiratory disorders, machine learning may play a very essential role. Respiratory disorders lead to variation in the production of speech as both go hand in hand. Thus, speech analysis can be a useful means for the pre-diagnosis of respiratory disorders. This article aims to develop a machine learning approach to differentiate healthy speech from speech corresponding to different respiratory disorders (affected). Thus, in the present work, a set of 15 relevant and efficient features were extracted from acquired data, and classification was done using different classifiers for healthy and affected speech. To assess the performance of different classifiers, accuracy, specificity (Sp), sensitivity (Se), and area under the receiver operating characteristic curve (AUC) was used by applying both multi-fold cross-validation methods (5-fold and 10-fold) and the holdout method. Out of the studied classifiers, decision tree, support vector machine (SVM), and k-nearest neighbor (KNN) were found more appropriate in providing correct assessment clinically while considering 15 features as well as three significant features (Se > 89%, Sp > 89%, AUC> 82%, and accuracy > 99%). The conclusion was that the proposed classifiers may provide an aid in the simple assessment of respiratory disorders utilising speech parameters with high efficiency. In the future, the proposed approach can be evaluated for the detection of specific respiratory disorders such as asthma, COPD, etc.

Go to article

Authors and Affiliations

Poonam Shrivastava

Neeta Tripathi

Bikesh Kumar Singh

Bhupesh Kumar Dewangan

Department of Electronics and Telecommunication, SSTC Bhilai, India
Department of Biomedical Engineering, National Institute of Technology, Raipur, India
Department of Computer Science and Engineering, School of Engineering, OP Jindal University, Raigarh, India

Speech Analysis as a Tool for Detection and Monitoring of Medical Conditions: A review

Magdalena Igras-Cybulska Daria Hemmerling Mariusz Ziółko Wojciech Datka Ewa Stogowska Michał Kucharski Rafał Rzepka Bartosz Ziółko

Archives of Acoustics | 2023 | vol. 48 | No 3 | 289-315 | DOI: 10.24425/aoa.2023.146640

Keywords speech analysis speech features acoustic parameters linguistic analysis voice biomarkers screening tests

Download PDF Download RIS Download Bibtex

Abstract

The goal of this article is to present and compare recent approaches which use speech and voice analysis as biomarkers for screening tests and monitoring of some diseases. The article takes into account metabolic, respiratory, cardiovascular, endocrine, and nervous system disorders. A selection of articles was performed to identify studies that assess voice features quantitatively in selected disorders by acoustic and linguistic voice analysis. Information was extracted from each paper in order to compare various aspects of datasets, speech parameters, methods of applied analysis and obtained results. 110 research papers were reviewed and 47 databases were summarized. Speech analysis is a promising method for early diagnosis of certain disorders. Advanced computer voice analysis with machine learning algorithms combined with the widespread availability of smartphones allows diagnostic analysis to be conducted during the patient’s visit to the doctor or at the patient’s home during a telephone conversation. Speech analysis is a simple, low-cost, non-invasive and easy-toprovide method of medical diagnosis. These are remarkable advantages, but there are also disadvantages. The effectiveness of disease diagnoses varies from 65% up to 99%. For that reason it should be treated as a medical screening test and should be an indication of the need for classic medical tests.

Go to article

Authors and Affiliations

Magdalena Igras-Cybulska

1 2

e-mail:

ORCID:

Daria Hemmerling

1 2

Mariusz Ziółko

Wojciech Datka

3 4

Ewa Stogowska

Michał Kucharski

Rafał Rzepka

Bartosz Ziółko

1 5

Techmo sp. z o.o., Kraków, Poland
AGH University of Science and Technology, Kraków, Poland
Medical University of Bialystok, Białystok, Poland
Faculty of Medicine, Jagiellonian University, Kraków, Poland
Hokkaido University Kita Ward, Sapporo, Hokkaido, Japan

Search results

Filters

Search results

Audio Feature Space Analysis for Emotion Recognition from Spoken Sentences

Abstract

Authors and Affiliations

Fusing the electromagnetic articulograph, high-speed video cameras and a 16-channel microphone array for speech analysis

Authors and Affiliations

Comparative Analysis of Classifiers for the Assessment of Respiratory Disorders Using Speech Parameters

Abstract

Authors and Affiliations

Speech Analysis as a Tool for Detection and Monitoring of Medical Conditions: A review

Abstract

Authors and Affiliations