Search results

Filters

  • Journals
  • Authors
  • Keywords
  • Date
  • Type

Search results

Number of results: 12
items per page: 25 50 75
Sort by:
Download PDF Download RIS Download Bibtex

Abstract

Speech enhancement in strong noise condition is a challenging problem. Low-rank and sparse matrix decomposition (LSMD) theory has been applied to speech enhancement recently and good performance was obtained. Existing LSMD algorithms consider each frame as an individual observation. However, real-world speeches usually have a temporal structure, and their acoustic characteristics vary slowly as a function of time. In this paper, we propose a temporal continuity constrained low-rank sparse matrix decomposition (TCCLSMD) based speech enhancement method. In this method, speech separation is formulated as a TCCLSMD problem and temporal continuity constraints are imposed in the LSMD process. We develop an alternative optimisation algorithm for noisy spectrogram decomposition. By means of TCCLSMD, the recovery speech spectrogram is more consistent with the structure of the clean speech spectrogram, and it can lead to more stable and reasonable results than the existing LSMD algorithm. Experiments with various types of noises show the proposed algorithm can achieve a better performance than traditional speech enhancement algorithms, in terms of yielding less residual noise and lower speech distortion.

Go to article

Authors and Affiliations

Chengli Sun
Conglin Yuan
Download PDF Download RIS Download Bibtex

Abstract

Precise measurement of the sound source directivity not only requires special equipment, but also is time-consuming. Alternatively, one can reduce the number of measurement points and apply spatial interpolation to retrieve a high-resolution approximation of directivity function. This paper discusses the interpolation error for different algorithms with emphasis on the one based on spherical harmonics. The analysis is performed on raw directivity data for two loudspeaker systems. The directivity was measured using sampling schemes of different densities and point distributions (equiangular and equiareal). Then, the results were interpolated and compared with these obtained on the standard 5° regular grid. The application of the spherical harmonic approximation to sparse measurement data yields a mean error of less than 1 dB with the number of measurement points being reduced by 89%. The impact of the sparse grid type on the retrieval error is also discussed. The presented results facilitate optimal sampling grid choice for low-resolution directivity measurements.
Go to article

Authors and Affiliations

Adam Szwajcowski
1
Daniel Krause
2
Anna Snakowska
1

  1. Department of Robotics and Mechatronics, AGH University of Science and Technology, Kraków, Poland
  2. Faculty of Information Technology and Communication Sciences, Tampere University, Tampere, Finland
Download PDF Download RIS Download Bibtex

Abstract

Optimal estimation of water balance components at the local and regional scales is essential for many applications such as integrated water resources management, hydrogeological modelling and irrigation scheduling. Evapotranspiration is a very important component of the hydrological cycle at the soil surface, particularly in arid and semi-arid lands. Mapping evapotranspiration at high resolution with internalised calibration (METRIC), trapezoid interpolation model (TIM), two-source energy balance (TSEB), and soil-plant-atmosphere and remote sensing evapotranspiration (SPARSE) models were applied using Landsat 8 images for four dates during 2014–2015 and meteorological data. Surface energy maps were then generated. Latent heat flux estimated by four models was then compared and evaluated with those measured by applying the method of Bowen ratio for the various days. In warm periods with high water stress differences and with important surface temperature differences, METRIC proves to be the most robust with the root-mean-square error ( RMSE) less than 40 W∙m –2. However, during the periods with no significant surface temperature and soil humidity differences, SPARSE model is superior with the RMSE of 35 W∙m –2. The results of TIM are close to METRIC, since both models are sensitive to the difference in surface temperature. However, SPARSE remains reliable with the RMSE of 55 W∙m –2 unlike TSEB, which has a large deviation from the other models. On the other hand, during the days when the temperature difference is small, SPARSE and TSEB are superior, with a clear advantage of SPARSE serial version, where temperature differences are less important.
Go to article

Authors and Affiliations

Tewfik A. Oualid
1
ORCID: ORCID
Abderrahmane Hamimed
1
ORCID: ORCID
Abdelkader Khaldi
1
ORCID: ORCID

  1. University Mustapha Stambouli of Mascara, Laboratory of Biological Systems and Geomatics, P.O. Box 305, Route de Mamounia, 29000, Mascara, Algeria
Download PDF Download RIS Download Bibtex

Abstract

Single Image Super-Resolution (SISR) through sparse representation has received much attention in the past decade due to significant development in sparse coding algorithms. However, recovering high-frequency textures is a major bottleneck of existing SISR algorithms. Considering this, dictionary learning approaches are to be utilized to extract high-frequency textures which improve SISR performance significantly. In this paper, we have proposed the SISR algorithm through sparse representation which involves learning of Low Resolution (LR) and High Resolution (HR) dictionaries simultaneously from the training set. The idea of training coupled dictionaries preserves correlation between HR and LR patches to enhance the Super-resolved image. To demonstrate the effectiveness of the proposed algorithm, a visual comparison is made with popular SISR algorithms and also quantified through quality metrics. The proposed algorithm outperforms compared to existing SISR algorithms qualitatively and quantitatively as shown in experimental results. Furthermore, the performance of our algorithm is remarkable for a smaller training set which involves lesser computational complexity. Therefore, the proposed approach is proven to be superior based upon visual comparisons and quality metrics and have noticeable results at reduced computational complexity.

Go to article

Authors and Affiliations

Rutul Patel
Vishvjit Thakar
Rutvij Joshi
Download PDF Download RIS Download Bibtex

Abstract

Subspace-based methods have been effectively used to estimate enhanced speech from noisy speech samples. In the traditional subspace approaches, a critical step is splitting of two invariant subspaces associated with signal and noise via subspace decomposition, which is often performed by singular-value decomposition or eigenvalue decomposition. However, these decomposition algorithms are highly sensitive to the presence of large corruptions, resulting in a large amount of residual noise within enhanced speech in low signal-to-noise ratio (SNR) situations. In this paper, a joint low-rank and sparse matrix decomposition (JLSMD) based subspace method is proposed for speech enhancement. In the proposed method, we firstly structure the corrupted data as a Toeplitz matrix and estimate its effective rank value for the underlying clean speech matrix. Then the subspace decomposition is performed by means of JLSMD, where the decomposed low-rank part corresponds to enhanced speech and the sparse part corresponds to noise signal, respectively. An extensive set of experiments have been carried out for both of white Gaussian noise and real-world noise. Experimental results show that the proposed method performs better than conventional methods in many types of strong noise conditions, in terms of yielding less residual noise and lower speech distortion.
Go to article

Authors and Affiliations

Chengli Sun
Jianxiao Xie
Yan Leng
Download PDF Download RIS Download Bibtex

Abstract

Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of do- mains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recogni- tion rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.
Go to article

Authors and Affiliations

Jingjie Yan
Xiaolan Wang
Weiyi Gu
LiLi Ma
Download PDF Download RIS Download Bibtex

Abstract

Optimal random network coding is reduced complexity in computation of coding coefficients, computation of encoded packets and coefficients are such that minimal transmission bandwidth is enough to transmit coding coefficient to the destinations and decoding process can be carried out as soon as encoded packets are started being received at the destination and decoding process has lower computational complexity. But in traditional random network coding, decoding process is possible only after receiving all encoded packets at receiving nodes. Optimal random network coding also reduces the cost of computation. In this research work, coding coefficient matrix size is determined by the size of layers which defines the number of symbols or packets being involved in coding process. Coding coefficient matrix elements are defined such that it has minimal operations of addition and multiplication during coding and decoding process reducing computational complexity by introducing sparseness in coding coefficients and partial decoding is also possible with the given coding coefficient matrix with systematic sparseness in coding coefficients resulting lower triangular coding coefficients matrix. For the optimal utility of computational resources, depending upon the computational resources unoccupied such as memory available resources budget tuned windowing size is used to define the size of the coefficient matrix.

Go to article

Authors and Affiliations

Dhawa Sang Dong
Yagnya Murti Pokhrel
Anand Gachhadar
Ram Krishna Maharjan
Faizan Qamar
Iraj Sadegh Amiri
Download PDF Download RIS Download Bibtex

Abstract

The zero attraction affine projection algorithm (ZA-APA) achieves better performance in terms of convergence rate and steady state error than standard APA when the system is sparse. It uses l1 norm penalty to exploit sparsity of the channel. The performance of ZA-APA depends on the value of zero attractor controller. Moreover a fixed attractor controller is not suitable for varying sparsity environment. This paper proposes an optimal adaptive zero attractor controller based on Mean Square Deviation (MSD) error to work in variable sparsity environment. Experiments were conducted to prove the suitability of the proposed algorithm for identification of unknown variable sparse system.

Go to article

Authors and Affiliations

S. Radhika
A. Chandrasekar
S. Nirmalraj
Download PDF Download RIS Download Bibtex

Abstract

The aim of the work is to present the method for designing sparse FIR filters with very low group delay and approximately linear-phase in the passband. Significant reduction of the group delay, e.g. several times in relation to the linear phase filter, may cause the occurrence of undesirable overshoot in the magnitude frequency response. The method proposed in this work consists of two stages. In the first stage, FIR filter with low group delay is designed using minimax constrained optimization that provides overshoot elimination. In the second stage, the same process is applied iteratively to reach sparse solution. Design examples demonstrate the effectiveness of the proposed method.
Go to article

Authors and Affiliations

Jacek Konopacki
1

  1. Faculty of Automatic Control, Electronics and Computer Sciences, Silesian University of Technology
Download PDF Download RIS Download Bibtex

Abstract

Advancement in medical technology creates some issues related to data transmission as well as storage. In real-time processing, it is too tedious to limit the flow of data as it may reduce the meaningful information too. So, an efficient technique is required to compress the data. This problem arises in Magnetic Resonance Imaging (MRI), Electrocardiogram (ECG), Electroencephalogram (EEG), and other medical signal processing domains. In this paper, we demonstrate Block Sparse Bayesian Learning (BSBL) based compressive sensing technique on an Electroencephalogram (EEG) signal. The efficiency of the algorithm is described using the Mean Square Error (MSE) and Structural Similarity Index Measure (SSIM) value. Apart from this analysis we also use different combinations of sensing matrices too, to demonstrate the effect of sensing matrices on MSE and SSIM value. And here we got that the exponential and chi-square random matrices as a sensing matrix are showing a significant change in the value of MSE and SSIM. So, in real-time body sensor networks, this scheme will contribute a significant reduction in power requirement due to its data compression ability as well as it will reduce the cost and the size of the device used for real-time monitoring.
Go to article

Bibliography

[1] Zou, Xiuming, Lei Feng, and Huaijiang Sun. "Compressive Sensing of Multichannel EEG Signals Based on Graph Fourier Transform and Cosparsity." Neural Processing Letters (2019): 1-10.
[2] Tayyib, Muhammad, Muhammad Amir, Umer Javed, M. Waseem Akram, Mussyab Yousufi, Ijaz M. Qureshi, Suheel Abdullah, and Hayat Ullah. "Accelerated sparsity-based reconstruction of compressively sensed multichannel EEG signals." PLoS One 15, no. 1 (2020): e0225397.
[3] Şenay, Seda, Luis F. Chaparro, Mingui Sun, and Robert J. Sclabassi. "Compressive sensing and random filtering of EEG signals using Slepian basis." In 2008 16th European Signal Processing Conference, pp. 1-5. IEEE, 2008.
[4] Gurve, Dharmendra, Denis Delisle-Rodriguez, Teodiano Bastos-Filho, and Sridhar Krishnan. "Trends in Compressive Sensing for EEG Signal Processing Applications." Sensors 20, no. 13 (2020): 3703.
[5] Amezquita-Sanchez, Juan P., Nadia Mammone, Francesco C. Morabito, Silvia Marino, and Hojjat Adeli. "A novel methodology for automated differential diagnosis of mild cognitive impairment and the Alzheimer’s disease using EEG signals." Journal of Neuroscience Methods 322 (2019): 88-95.
[6] R. DeVore, "Nonlinear approximation." Acta Numerica, 7, 51-150. (1998).
[7] A. Einstein, B. Podolsky, N. Rosen, 1935, “Can quantum-mechanical description of physical reality be considered complete?”, Phys. Rev. 47, 777-780.
[8] R. G. Baraniuk, "Compressive sensing, IEEE Signal Proc." Mag 24, no. 4 (2007): 118-120.
[9] Upadhyaya, Vivek, and Mohammad Salim. "Basis & Sensing Matrix as key effecting Parameters for Compressive Sensing." In 2018 International Conference on Advanced Computation and Telecommunication (ICACAT), pp. 1-6. IEEE, 2018.
[10] E. Candes, “Compressive sampling”, In Proc. Int. Congress of Math., Madrid, Spain, Aug. 2006.
[11] E. Candes, J. Romberg, “Quantitative robust uncertainty principles and optimally sparse decompositions”, Found. Compute. Math., 6(2): 227-254, 2006.
[12] E. Candes, J. Romberg, T. Tao, “Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information”. IEEE Trans. Inform. Theory, 52(2):489-509, 2006.
[13] E. Candes, J. Romberg, and T. Tao. Stable signal recovery from incomplete and inaccurate measurements. Comm. Pure Appl. Math., 59(8): 1207-1223, 2006.
[14] E. Candes and T. Tao. Near-optimal signal recovery from random projections: Universal encoding strategies? IEEE Trans. Inform. Theory, 52(12): 5406-5425, 2006.
[15] D. Donoho. Compressed sensing. IEEE Trans. Inform. Theory, 52(4):1289-1306, 2006.
[16] S. Kirolos, J. Laska, M. Wakin, M. Duarte, D. Baron, T. Ragheb, Y. Massoud, and R.G. Baraniuk, “Analog-to-information conversion via random demodulation,” in Proc. IEEE Dallas Circuits Systems Workshop, Oct. 2006, pp. 71-74.
[17] Zhang, Zhilin, Tzyy-Ping Jung, Scott Makeig, Bhaskar D. Rao. "Compressed sensing for energy-efficient wireless telemonitoring of noninvasive fetal ECG via block sparse Bayesian learning." IEEE Transactions on Biomedical Engineering 60, no. 2 (2012): 300-309.
[18] https://sccn.ucsd.edu/eeglab/download.php.
[19] Joshi, Amit Mahesh, Vivek Upadhyaya. "Analysis of compressive sensing for non-stationary music signal." In 2016 International Conference on Advances in Computing, Communications, and Informatics (ICACCI), pp. 1172-1176. IEEE, 2016.
[20] Wang, Zhou, Alan C. Bovik, Hamid R. Sheikh, Eero P. Simoncelli. "Image quality assessment: from error visibility to structural similarity." IEEE transactions on image processing 13, no. 4 (2004): 600-612.
[21] Nibheriya, Khushboo, Vivek Upadhyaya, Ashok Kumar Kajla. "To Analysis the Effects of Compressive Sensing on Music Signal with variation in Basis & Sensing Matrix." In 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), pp. 1121-1126. IEEE, 2018.
[22] Zhang, Zhilin, Tzyy-Ping Jung, Scott Makeig, and Bhaskar D. Rao. "Compressed sensing of EEG for wireless telemonitoring with low energy consumption and inexpensive hardware." IEEE Transactions on Biomedical Engineering 60, no. 1 (2012): 221-224.
Go to article

Authors and Affiliations

Vivek Upadhyaya
1
ORCID: ORCID
Mohammad Salim
1

  1. Malaviya National Institute of Technology, India
Download PDF Download RIS Download Bibtex

Abstract

Inter-turn short circuit (ITSC) is a frequent fault of interior permanent magnet synchronous motors (IPMSM). If ITSC faults are not promptly monitored, it may result in secondary faults or even cause extensive damage to the entire motor. To enhance the reliability of IPMSMs, this paper introduces a fault diagnosis method specifically designed for identifying ITSC faults in IPMSMs. The sparse coefficients of phase current and torque are solved by clustering shrinkage stage orthogonal matching tracking (CcStOMP) in the greedy tracking algorithm.The CcStOMP algorithm can extract multiple target atoms at one time, which greatly improves the iterative efficiency. The multiple features are utilized as input parameters for constructing the random forest classifier. The constructed random forest model is used to diagnose ITSC faults with the results showing that the random forest model has a diagnostic accuracy of 98.61% using all features, and the diagnostic accuracy of selecting three of the most important features is still as high as 97.91%. The random forest classification model has excellent robustness that maintains high classification accuracy despite the reduction of feature vectors, which is a great advantage compared to other classification algorithms. The combination of greedy tracing and the random forest is not only a fast diagnostic model but also a model with good generalisation and anti-interference capability. This non-invasive method is applicable to monitoring and detecting failures in industrial PMSMs.
Go to article

Authors and Affiliations

Jianping Wang
1
Jian Ma
1
ORCID: ORCID
Dean Meng
1
Xuan Zhao
1
Kai Zhang
1
Qiquan Liu
1
Kejie Xu
1

  1. School of Automobile, Chang’an University, Xi’an 710064, China

This page uses 'cookies'. Learn more