Search results

Filters

  • Journals
  • Authors
  • Keywords
  • Date
  • Type

Search results

Number of results: 4
items per page: 25 50 75
Sort by:
Download PDF Download RIS Download Bibtex

Abstract

An analysis of low-level feature space for emotion recognition from the speech is presented. The main goal was to determine how the statistical properties computed from contours of low-level features influence the emotion recognition from speech signals. We have conducted several experiments to reduce and tune our initial feature set and to configure the classification stage. In the process of analysis of the audio feature space, we have employed the univariate feature selection using the chi-squared test. Then, in the first stage of classification, a default set of parameters was selected for every classifier. For the classifier that obtained the best results with the default settings, the hyperparameter tuning using cross-validation was exploited. In the result, we compared the classification results for two different languages to find out the difference between emotional states expressed in spoken sentences. The results show that from an initial feature set containing 3198 attributes we have obtained the dimensionality reduction about 80% using feature selection algorithm. The most dominant attributes selected at this stage based on the mel and bark frequency scales filterbanks with its variability described mainly by variance, median absolute deviation and standard and average deviations. Finally, the classification accuracy using tuned SVM classifier was equal to 72.5% and 88.27% for emotional spoken sentences in Polish and German languages, respectively.
Go to article

Authors and Affiliations

Lukasz Smietanka
1
Tomasz Maka
1

  1. Faculty of Computer Science and Information Technology, West Pomeranian University of Technology, Szczecin, Poland
Download PDF Download RIS Download Bibtex

Abstract

Non-invasive techniques for the assessment of respiratory disorders have gained increased importance in recent years due to the complexity of conventional methods. In the assessment of respiratory disorders, machine learning may play a very essential role. Respiratory disorders lead to variation in the production of speech as both go hand in hand. Thus, speech analysis can be a useful means for the pre-diagnosis of respiratory disorders. This article aims to develop a machine learning approach to differentiate healthy speech from speech corresponding to different respiratory disorders (affected). Thus, in the present work, a set of 15 relevant and efficient features were extracted from acquired data, and classification was done using different classifiers for healthy and affected speech. To assess the performance of different classifiers, accuracy, specificity (Sp), sensitivity (Se), and area under the receiver operating characteristic curve (AUC) was used by applying both multi-fold cross-validation methods (5-fold and 10-fold) and the holdout method. Out of the studied classifiers, decision tree, support vector machine (SVM), and k-nearest neighbor (KNN) were found more appropriate in providing correct assessment clinically while considering 15 features as well as three significant features (Se > 89%, Sp > 89%, AUC> 82%, and accuracy > 99%). The conclusion was that the proposed classifiers may provide an aid in the simple assessment of respiratory disorders utilising speech parameters with high efficiency. In the future, the proposed approach can be evaluated for the detection of specific respiratory disorders such as asthma, COPD, etc.
Go to article

Authors and Affiliations

Poonam Shrivastava
1
Neeta Tripathi
1
Bikesh Kumar Singh
2
Bhupesh Kumar Dewangan
3

  1. Department of Electronics and Telecommunication, SSTC Bhilai, India
  2. Department of Biomedical Engineering, National Institute of Technology, Raipur, India
  3. Department of Computer Science and Engineering, School of Engineering, OP Jindal University, Raigarh, India
Download PDF Download RIS Download Bibtex

Abstract

The goal of this article is to present and compare recent approaches which use speech and voice analysis as biomarkers for screening tests and monitoring of some diseases. The article takes into account metabolic, respiratory, cardiovascular, endocrine, and nervous system disorders. A selection of articles was performed to identify studies that assess voice features quantitatively in selected disorders by acoustic and linguistic voice analysis. Information was extracted from each paper in order to compare various aspects of datasets, speech parameters, methods of applied analysis and obtained results. 110 research papers were reviewed and 47 databases were summarized. Speech analysis is a promising method for early diagnosis of certain disorders. Advanced computer voice analysis with machine learning algorithms combined with the widespread availability of smartphones allows diagnostic analysis to be conducted during the patient’s visit to the doctor or at the patient’s home during a telephone conversation. Speech analysis is a simple, low-cost, non-invasive and easy-toprovide method of medical diagnosis. These are remarkable advantages, but there are also disadvantages. The effectiveness of disease diagnoses varies from 65% up to 99%. For that reason it should be treated as a medical screening test and should be an indication of the need for classic medical tests.
Go to article

Authors and Affiliations

Magdalena Igras-Cybulska
1 2
ORCID: ORCID
Daria Hemmerling
1 2
Mariusz Ziółko
1
Wojciech Datka
3 4
Ewa Stogowska
3
Michał Kucharski
1
Rafał Rzepka
5
Bartosz Ziółko
1 5

  1. Techmo sp. z o.o., Kraków, Poland
  2. AGH University of Science and Technology, Kraków, Poland
  3. Medical University of Bialystok, Białystok, Poland
  4. Faculty of Medicine, Jagiellonian University, Kraków, Poland
  5. Hokkaido University Kita Ward, Sapporo, Hokkaido, Japan

This page uses 'cookies'. Learn more