Search results

Filters

  • Journals
  • Authors
  • Keywords
  • Date
  • Type

Search results

Number of results: 7
items per page: 25 50 75
Sort by:
Download PDF Download RIS Download Bibtex

Abstract

Filter bank multicarrier waveform is investigated as a potential waveform for visible light communication broadcasting systems. Imaginary inter-carrier and/or inter-symbol interference are causing substantial performance degradation in the filter bank multicarrier system. Direct current-biased optical filter bank multicarrier modulation overcomes all the problems of direct current-biased optical-orthogonal frequency division multiplexing modulation approaches in terms of speed and bandwidth. However, it also wastes a lot of energy while transforming a true bipolar signal into a positive unipolar signal by adding direct current-bias. In this paper, a flip-filter bank multicarrier-based visible light communication system was introduced to overcome this problem. In this system, a bipolar signal is converted to a unipolar signal by isolating the positive and negative parts, turning them to positive and then delivering the signal. Also, a new channel estimation scheme for a flip-filter bank multicarrier system is proposed which improves the channel estimation performance compared to that of each of the conventional schemes. The proposed system performance is measured in terms of bit error rate, normalized mean squared error, and constellation diagram. The superiority of the proposed scheme over other conventional structures has been successfully verified by MATLAB 2020b simulation experiments results. These results are evaluated under indoor visible light communication standard.
Go to article

Bibliography

  1. Kumar, S. & Singh, P. Filter bank multicarrier modulation schemes for visible light communication. Pers. Commun. 113, 2709–2722 (2020). https://doi.org/10.1007/s11277-020-07347-6
  2. Wang, J. Y. et al. Performance analysis and improvement for secure vlc with slipt and random terminals. IEEE Access 8, 73645–73658 (2020). https://doi.org/10.1109/ACCESS.2020.2988470
  3. Chen, R. et al. Visible light communication using DC-biased optical filter bank multi-carrier modulation. in 2018 Global LIFI Congress (GLC) 1–6 (2018). https://doi.org/10.23919/GLC.2018.8319094
  4. Al Hammadi, A., Sofotasios, P. C., Muhaidat, S., Al-Qutayri, M. & Elgala, H. Non-orthogonal multiple access for hybrid VLC-RF networks with imperfect channel state information. IEEE Trans. Veh. Technol. 70, 398–411 (2021). https://doi.org/10.1109/TVT.2020.3044837
  5. Tanaka, Y., Komine, T., Haruyama, S. & Nakagawa, M. Indoor visible communication utilizing plural white LEDs as lighting. in 12th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC) F81–F85 (2001). https://doi.org/10.1109/PIMRC.2001.965300
  6. Mohammed, N. A., Elnabawy, M. M. & Khalaf, A. A. M. PAPR reduction using a combination between precoding andnon-linear companding techniques foraco-ofdm-based VLC systems. Opto-Electron. Rev. 29, 59–70 (2021). https://doi.org/10.24425/opelre.2021.135829
  7. Qasim, A. A., Abdullah, M. F. L. & Talib, R. Adaptive DCO-FBMC in visible light communication. in IOP Conferene: Material Science and Engineering 812018 (2020). https://doi.org/10.1088/1757-899X/767/1/012018
  8. Kumar, S. & Singh, P. Spectral efficient asymmetrically clipped hybrid FBMC for visible light communication. J. Opt. 2021, (2021). https://doi.org/10.1155/2021/8897928
  9. Abouldahab, M. A., Fouad, M. M. & Roshdy, R. A. A proposed preamble based channel estimation method for FBMC in 5G wireless channels. in 35th IEEE National Radio Science Confernce (NRSC) 140–148 (2018). https://doi.org/10.1109/NRSC.2018.8354382
  10. Roshdy, R. A., Aboul-Dahab, M. A. & Fouad, M. M. A modified interference approximation scheme for improving preamble based channel estimation performance in FBMC system. J. Comput. Networks Commun. 12, 19–35 (2020). https://doi.org/10.5121/ijcnc.2020.12102
  11. Sun, J. et al. Channel estimation approach with low pilot overhead in FBMC/OQAM Systems. Commun. Mob. Comput. 2021, 5533399 (2021). https://doi.org/10.1155/2021/5533399
  12. Liu, W., Schwarz, S., Rupp, M. & Jiang, T. Pairs of pilots design for preamble-based channel estimation in OQAM/FBMC systems. IEEE Wirel. Commun. Lett. 10, 488–492 (2021). https://doi.org/10.1109/lwc.2020.3035388
  13. El-Ganiny, M. Y., Klialaf, A. A. M., Hussein, A. I. & Hamed, H. F. A. A preamble based channel estimation methods for FBMC waveform: A comparative study. Procedia Comput. Sci. 182, 63–70 (2020). https://doi.org/10.1016/j.procs.2021.02.009
  14. Kong, D. et al. Preamble-based MMSE channel estimation with low pilot overhead in MIMO-FBMC systems. IEEE Access 8, 148926–148934 (2020). https://doi.org/10.1109/ACCESS.2020.3015809
  15. Hu, S. et al. Training sequence design for efficient channel estimation in MIMO-FBMC systems. IEEE Access 5, 4747–4758 (2017). https://doi.org/10.1109/ACCESS.2017.2688399
  16. Wang, H. Sparse channel estimation for MIMO-FBMC/OQAM wireless communications in smart city applications. IEEE Access 6, 60666–60672 (2018). https://doi.org/10.1109/ACCESS.2018.2875245
  17. Lélé, C., Javaudin, J. P., Legouable, R., Skrzypczak, A. & Siohan, P. Channel estimation methods for preamble-based OFDM/OQAM modulations. Trans. Telecommun. 19, 741–750 (2008). https://doi.org/10.1002/ett.1332
  18. Du, J. & Signell, S. Novel preamble-based channel estimation for OFDM / OQAM systems. in 2009 IEEE International Conference on Communications 1–6 (2009). https://doi.org/10.1109/ICC.2009.5199226
  19. Kofidis, E. & Katselis, D. Improved interference approximation method for preamble-based channel estimation in FBMC/OQAM. in European Signal Processing Conference 1603–1607 (2011). https://doi.org/10.5281/zenodo.42712
  20. Wang, H., Du, W. & Xu, L. Novel preamble design for channel estimation in FBMC/OQAM systems. KSII Trans. Internet Inf. Syst. 10, 3672–3688 (2016). https://doi.org/10.3837/tiis.2016.08.014
  21. Kashani, M. A. & Kavehrad, M. On the performance of single- and multi-carrie modulation schemes for indoor visible light communication systems. in 2014 IEEE Global Communications Conference (GLOBECOM) 2084–2089 (2014). https://doi.org/10.1109/GLOCOM.2014.7037115
  22. Rehman, S. U., Ullah, S., Chong, P. H. J., Yongchareon, S. & Komosny, D. Visible light communication: A system perspective—Overview and challenges. Sensors 19, 1153 (2019). https://doi.org/10.3390/s19051153
  23. Al-Ahmadi, S., Maraqa, O., Uysal, M. & Sait, S. M. Multi-user visible light communications: State-of-the-art and future directions. IEEE Access 6, 70555–70571 (2018). https://doi.org/10.1109/ACCESS.2018.2879885
  24. Shalaby, E. M., Dessouky, M. & Hussin, S. Performance evaluation of UFMC-based VLC systems using a modified SLM technique. Opto-Electron. Rev. 29, 85–90 (2021). https://doi.org/10.24425/opelre.2021.135832
  25. Hussin, S. & Shalaby, E. M. Performance analysis of DFT-S-OFDM waveform for Li-Fi systems. Opto-Electron. Rev. 29, 167–174 (2021). https://doi.org/10.24425/opelre.2021.139753
  26. Qasim, A. A., Mohammedali, H. N., Abdullah, M. F. L., Talib, R. & Dhaam, H. Z. Enhanced Flip-FBMC visible light communication model. J. Electr. Eng. Comput. Sci. 23, 1783–1793 (2021). https://doi.org/10.11591/ijeecs.v23.i3.pp1783-1793
  27. Elgala, H., Mesleh, R., Haas, H. & Pricope, B. OFDM visible light wireless communication based on white LEDs. in IEEE Vehicular Technology Conference (VTC) 2185–2189 (2007) https://doi.org/10.1109/VETECS.2007.451
  28. Yesilkaya, A., Karatalay, O., Ogrenci, A. S. & Panayirci, E. Channel estimation for visible light communications using neural networks. in International Joint Conference on Neural Networks (IJCNN) 320–325 (2016). https://doi.org/10.1109/IJCNN.2016.7727215
Go to article

Authors and Affiliations

Mohamed Y. El-Ganiny
1
Ashraf A. M. Khalaf
2
ORCID: ORCID
Aziza I. Hussein
3
ORCID: ORCID
Hesham F. A. Hamed
4

  1. Department of Electrical Engineering, Higher Technological Institute, 10th of Ramadan City, Sharqia, Egypt
  2. Department of Electrical Engineering, Faculty of Engineering, Minia University, Minia, Egypt
  3. Electrical and Computer Engineering Department, Effat University, Jeddah, Kingdom of Saudi Arabia
  4. Department of Telecommunications Engineering, Egyptian Russian University, Badr City, Egypt
Download PDF Download RIS Download Bibtex

Abstract

This work aims to further compensate for the weaknesses of feature sparsity and insufficient discriminative acoustic features in existing short-duration speaker recognition. To address this issue, we propose the Bark-scaled Gauss and the linear filter bank superposition cepstral coefficients (BGLCC), and the multidimensional central difference (MDCD) acoustic feature extracted method. The Bark-scaled Gauss filter bank focuses on low-frequency information, while linear filtering is uniformly distributed, therefore, the filter superposition can obtain more discriminative and richer acoustic features of short-duration audio signals. In addition, the multi-dimensional central difference method captures better dynamics features of speakers for improving the performance of short utterance speaker verification. Extensive experiments are conducted on short-duration text-independent speaker verification datasets generated from the VoxCeleb, SITW, and NIST SRE corpora, respectively, which contain speech samples of diverse lengths, and different scenarios. The results demonstrate that the proposed method outperforms the existing acoustic feature extraction approach by at least 10% in the test set. The ablation experiments further illustrate that our proposed approaches can achieve substantial improvement over prior methods.
Go to article

Authors and Affiliations

Yunfei Zi
1
Shengwu Xiong
1

  1. School of Computer and Artificial Intelligence, Wuhan University of Technology Wuhan, China
Download PDF Download RIS Download Bibtex

Abstract

The main objective of this paper is to produce an applications-oriented review covering infrared techniques and devices. At the beginning infrared systems fundamentals are presented with emphasis on thermal emission, scene radiation and contrast, cooling techniques, and optics. Special attention is focused on night vision and thermal imaging concepts. Next section concentrates shortly on selected infrared systems and is arranged in order to increase complexity; from image intensifier systems, thermal imaging systems, to space-based systems. In this section are also described active and passive smart weapon seekers. Finally, other important infrared techniques and devices are shortly described, among them being: non-contact thermometers, radiometers, LIDAR, and infrared gas sensors.

Go to article

Authors and Affiliations

A. Rogalski
K. Chrzanowski
Download PDF Download RIS Download Bibtex

Abstract

A traditional frequency analysis is not appropriate for observation of properties of non-stationary signals. This stems from the fact that the time resolution is not defined in the Fourier spectrum. Thus, there is a need for methods implementing joint time-frequency analysis (t/f) algorithms. Practical aspects of some representative methods of time-frequency analysis, including Short Time Fourier Transform, Gabor Transform, Wigner-Ville Transform and Cone-Shaped Transform are described in this paper. Unfortunately, there is no correlation between the width of the time-frequency window and its frequency content in the t/f analysis. This property is not valid in the case of a wavelet transform. A wavelet is a wave-like oscillation, which forms its own “wavelet window”. Compression of the wavelet narrows the window, and vice versa. Individual wavelet functions are well localized in time and simultaneously in scale (the equivalent of frequency). The wavelet analysis owes its effectiveness to the pyramid algorithm described by Mallat, which enables fast decomposition of a signal into wavelet components.

Go to article

Authors and Affiliations

Andrzej Majkowski
Marcin Kołodziej
Remigiusz J. Rak
Download PDF Download RIS Download Bibtex

Abstract

Visible light communication based on a filter bank multicarrier holds enormous promise for optical wireless communication systems, due to its high-speed and unlicensed spectrum. Moreover, visible light communication techniques greatly impact communication links for small satellites like cube satellites, and pico/nano satellites, in addition to inter-satellite communications between different satellite types in different orbits. However, the transmitted visible signal via the filter bank multicarrier has a high amount of peak-to-average power ratio, which results in severe distortion for a light emitting diode output. In this work, a scheme for enhancing the peak-to-average power ratio reduction amount is proposed. First, an algorithm based on generating two candidates signals with different peak-to-average power ratio is suggested. The signal with the lowest ratio is selected and transmitted. Second, an alternate direct current-biased approach, which is referred to as the addition reversed method, is put forth to transform transmitted signal bipolar values into actual unipolar ones. The performance is assessed through a cumulative distribution function of peak-to-average power ratio, bit error rate, power spectral density, and computational complexity. The simulation results show that, compared to other schemes in literature, the proposed scheme attains a great peak-to-average power ratio reduction and improves the bit the error rate performance with minimum complexity overhead. The proposed approach achieved about 5 dB reduction amount compared to companding technique, 5.5 dB compared to discrete cosine transform precoding, and 8 dB compared to conventional direct current bias of an optical filter bank multicarrier. Thus, the proposed scheme reduces the complexity overhead by 15.7% and 55.55% over discrete cosine transform and companding techniques, respectively.
Go to article

Authors and Affiliations

Radwa A. Roshdy
1
ORCID: ORCID
Aziza I. Hussein
2
ORCID: ORCID
Mohamed M. Mabrook
3 4
ORCID: ORCID
Mohammed A. Salem
ORCID: ORCID

  1. Department of Electrical Engineering, Higher Technological Institute, 10th of Ramadan City, Egypt
  2. Electrical & Computer Eng. Dept., Effat University, Jeddah, Saudi Arabia
  3. Space Communication Dept., Faculty of Navigation Science & Space Technology, Beni-Suef University, Beni-Suef, Egypt
  4. Department of Communication and Computer Engineering, Faculty of Engineering, Nahda University in Beni-Suef, Egypt
Download PDF Download RIS Download Bibtex

Abstract

The design of a low complexity multiplier-less narrow transition band filter bank for the channelizer of multistandard software-defined radio (SDR) is investigated in this paper. To accomplish this, the modal filter and complementary filter in the upper and lower branches of the conventional Frequency Response Masking (FRM) architecture are replaced with two power-complementary and linear phase filter banks. Secondly, a new masking strategy is proposed to fully exploit the potential of the numerous spectra replicas produced by the interpolation of the modal filter, which was previously ignored in the existing FRM design. In this scheme, the two masking filters are appropriately modulated and alternately masked over the spectra replicas from 0 to 2π, to generate even and odd channels. This Alternate Masking Scheme (AMS) increases the potency of the Modified FRM (ModFRM) architecture for the design of computationally efficient narrow transition band uniform filter bank (termed as ModFRM-FB). Finally, by combining the adjoining ModFRM-FB channels, Non-Uniform ModFRMFB (NUModFRM-FB) for extracting different communication standards in the SDR channelizer is created. To reduce the total power consumption of the architecture, the coefficients of the proposed system are made multiplier-less using Matching Pursuits Generalized Bit-Planes (MPGBP) algorithm. In this method, filter coefficients are successively approximated using a dictionary of vectors to give a sum-of-power-of-two (SOPOT) representation. In comparison to all other general optimization techniques, such as genetic algorithms, the suggested design method stands out for its ease of implementation, requiring no sophisticated optimization or exhaustive search schemes. Another notable feature of the suggested approach is that, in comparison to existing methods, the design time for approximation has been greatly reduced. To further bring down the complexity, adders are reused in recurrent SOPOT terms using the Common Subexpression Elimination (CSE) technique without compromising the filter performance.
Go to article

Authors and Affiliations

A.K. Parvathi
1
V. Sakthivel
1

  1. National Institute of Technology, Calicut, India
Download PDF Download RIS Download Bibtex

Abstract

Parallel realizations of discrete transforms (DTs) computation algorithms (DTCAs) performed on graphics processing units (GPUs) play a significant role in many modern data processing methods utilized in numerous areas of human activity. In this paper the authors propose a novel execution time prediction model, which allows for accurate and rapid estimation of execution times of various kinds of structurally different DTCAs performed on GPUs of distinct architectures, without the necessity of conducting the actual experiments on physical hardware. The model can serve as a guide for the system analyst in making the optimal choice of the GPU hardware solution for a given computational task involving particular DT calculation, or can help in choosing the best appropriate parallel implementation of the selected DT, given the limitations imposed by available hardware. Restricting the model to exhaustively adhere only to the key common features of DTCAs enables the authors to significantly simplify its structure, leading consequently to its design as a hybrid, analytically–simulational method, exploiting jointly the main advantages of both of the mentioned techniques, namely: time-effectiveness and high prediction accuracy, while, at the same time, causing mutual elimination of the major weaknesses of both of the specified approaches within the proposed solution. The model is validated experimentally on two structurally different parallel methods of discrete wavelet transform (DWT) computation, i.e. the direct convolutionbased and lattice structure-based schemes, by comparing its prediction results with the actual measurements taken for 6 different graphics cards, representing a fairly broad spectrum of GPUs compute architectures. Experimental results reveal the overall average execution time and prediction accuracy of the model to be at a level of 97.2%, with global maximum prediction error of 14.5%, recorded throughout all the conducted experiments, maintaining at the same time high average evaluation speed of 3.5 ms for single simulation duration. The results facilitate inferring the model generality and possibility of extrapolation to other DTCAs and different GPU architectures, which along with the proposed model straightforwardness, time-effectiveness and ease of practical application, makes it, in the authors’ opinion, a very interesting alternative to the related existing solutions.
Go to article

Bibliography

  1.  U.N. Ahmed and K.R. Rao, Orthogonal Transforms for Digital Signal Process. Secaucus, NJ, USA: Springer-Verlag, New York, Inc., 1974.
  2.  Y. Su and Z. Xu, “Parallel implementation of wavelet-based image denoising on programmable pc-grade graphics hardware,” Signal Process., vol. 90, pp. 2396–2411, 2010, doi: 10.1016/j.sigpro.2009.06.019.
  3.  P. Lipinski and D. Puchala, “Digital image watermarking using fast parametric transforms,” Bull. Pol. Acad. Sci. Tech. Sci., vol. 67, pp. 463–477, 2019.
  4.  K.R. Rao and P. Yip, Discrete cosine transform: algorithms, advantages, applications. San Diego, CA, USA: Academic Press Professional, Inc., 1990.
  5.  D. Salomon, A Guide to Data Compression Methods. New York: Springer-Verlag
  6. D. Puchala and M. Yatsymirskyy, “Joint compression and encryption of visual data using orthogonal parametric transforms,” Bull. Pol. Acad. Sci. Tech. Sci., vol. 64, pp. 373–382, 2016.
  7.  M. Akay, Time Frequency and Wavelets in Biomedical Signal Process., ser. IEEE Press Series in Biomed. Eng. Wiley-IEEE Press, 1998.
  8.  S. Babichev, J. Skvor, J. Fiser, and V. Lytvynenko, “Technology of gene expression profiles filtering based on wavelet analysis,” Int. J. Intell. Syst. Appl., vol. 10, pp. 1–7, 2018.
  9.  Z. Jakovljevic, R. Puzovic, and M. Pajic, “Recognition of planar segments in point cloud based on wavelet transform,” IEEE Trans. Ind. Inf., vol. 11, no. 2, pp. 342–352, 2015.
  10.  J. Cheng, M. Grossman, and T. McKercher, Professional CUDA C Programming. Indianapolis, IN 46256: John Wiley & Sons, Inc., 2014.
  11.  J. Sanders and E. Kandrot, CUDA by Example: An Introduction to General-Purpose GPU Programming. Addison-Wesley Professional, 2010.
  12.  G. Barlas, Multicore and GPU Programming: An Integrated Approach. Morgan Kaufmann Publishers, 2015.
  13.  K. Stokfiszewski and K. Wieloch, “ Time effectiveness optimization of cross correlation methods for OCR systems with the use of graphics processing units,” J. Appl. Comput. Sci., vol. 23, no. 2, pp. 79–100, 2015.
  14.  A. Wojciechowski and T. Gałaj, “GPU supported dual quaternions based skinning,” in Computer Game Innovations. A. Wojciechowski, P. Napieralski (Eds.), Lodz University of Technology Press, 2016, pp. 5–23.
  15.  M. Wawrzonowski, D. Szajerman, M. Daszuta, and P. Napieralski, “Mobile devices’ GPUs in cloth dynamics simulation,” in Proceedings of the Federated Conference on Computer Science and Information Systems. M. Ganzha, L. Maciaszek, M. Paprzycki (Eds.), 2017, pp. 1283–1290.
  16.  D. Puchala, K. Stokfiszewski, B. Szczepaniak, and M. Yatsymirskyy, “Effectiveness of fast fourier transform implementations on GPU and CPU,” Przegla˛d Elektrotechniczny, vol. 92, no. 7, pp. 69–71, 2016.
  17.  K. Stokfiszewski, K. Wieloch, and M. Yatsymirskyy, “The fast Fourier transform partitioning scheme for GPU’s computation effectiveness improvement,” in Advances in Intelligent Systems and Computing II (CSIT), N. Shakhovska and V. Stepashko (Eds.), Springer, Cham, 2017, vol. 689, no. 1, pp. 511–522.
  18.  B.H.H. Juurlink and H.A.G. Wijshoff, “A quantitive comparison of parallel computation models,” ACM Trans. Comput. Syst., vol. 16, no. 3, pp. 271–318, 1988.
  19.  S.G. Akl, Parallel computation. Models and methods. Upple Saddle River, NJ: Prentice Hall, 1997.
  20.  A. Madougou, S. Varbanescu, C. Laat, and R. van Nieuwpoort, “The landscape of GPGPU performance modeling tools,” Parallel Comput., vol. 56, pp. 18–33, 2016.
  21.  H. Sunpyo and K. Hyesoon, “An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness,” ACM SIGARCH Comput. Architect. News, vol. 37, pp. 152–163, 2009.
  22.  C. Luo and R. Suda, “An execution time prediction analytical model for GPU with instruction-level and thread-level parallelism awareness,” IPSJ SIG Tech. Rep., vol. 2011-HPC-130, no. 19, pp. 1–9, 2011.
  23.  M. Amaris, D. Cordeiro, A. Goldman, and R.Y. de Camargo, “A simple BSP-based model to predict execution time in GPU applications,” in Proc. IEEE 22nd International Conference on High Performance Computing (HiPC), 2015, pp. 285–294.
  24.  L. Ma, R.D. Chamberlain, and K. Agrawal, “Performance modeling for highly-threaded many-core GPUs,” in Proc. IEEE 25th International Conference on Application-Specific Systems, Arch’s and Processors, 2014, pp. 84–91.
  25.  K. Kothapalli, R. Mukherjee, M.S. Rehman, S. Patidar, P.J. Narayanan, and K. Srinathan, “A performance prediction model for the CUDA GPGPU platform,” in Proc. International Conference on High Performance Computing (HiPC), 2009, pp. 463–472.
  26.  M. Amaris, R.Y. de Camargo, M. Dyab, A. Goldman, and D. Trystram, “A comparison of GPU execution time prediction using machine learning and analytical modeling,” in Proc. 15th IEEE International Symposium on Network Computing and Applications (NCA), 2016, pp. 326–333.
  27.  A. Karami, S.A. Mirsoleimani, and F. Khunjush, “A statistical performance prediction model for OpenCL kernels on NVIDIA GPUs,” in Proc. 17th CSI Int. Symposium on Computer Architecture & Digital Systems (CADS), 2013, pp. 15–22.
  28.  A. Kerr, E. Anger, G. Hendry, and S. Yalamanchili, “Eiger: A framework for the automated synthesis of statistical performance models,” in Proc. 19th Int. Conference on High Performance Computing, 2012, pp. 1–6.
  29.  Y. Zhang, Y. Hu, B. Li, and L. Peng, “Performance and power analysis of ATI GPU: A statistical approach,” in Proc. 6th IEEE International Conference on Networking, Architecture, and Storage, 2011, pp. 149–158.
  30.  G. Wu, J.L. Greathouse, A. Lyashevsky, N. Jayasena, and D. Chiou, “GPGPU performance and power estimation using machine learning,” in Proc. 21st IEEE Int. Symposium on High Performance Computer Architecture (HPCA), 2015, pp. 564– 576.
  31.  E. Ipek, B. Supinski, M. Schulz, and S. McKee, “An approach to performance prediction for parallel applications,” in Proc. 11th International Euro-Par Conference on Parallel Processing, 2005, pp. 196–205.
  32.  N. Ardalani, C. Lestourgeon, K. Sankaralingam, and X. Zhu, “Cross architecture performance prediction (XAPP) using CPU code to predict GPU performance,” in Proc. 48th Annual IEEE/ ACM International Symposium on Microarchitecture (MICRO), 2015, pp. 725–737.
  33.  “GPGPU-Sim project.” [Online]. Available: http://www.gpgpu-sim.org.
  34.  A. Bakhoda, W.L. Fung, H. Wong, and G.L. Yuan, “Analyzing CUDA workloads using a detailed GPU simulator,” in Proc. ISPASS International Symposium on Performance Analysis of Systems and Software, 2009, pp. 163–174.
  35.  “GPUSimPow – AES LPGPU Group Power Simulation Project.” [Online]. Available: https://www.aes.tu-berlin.de/menue/forschung/projekte/ gpusimpow_simulator/.
  36.  Z. Yu, L. Eeckhout, N. Goswami, T. Li, L.K. John, H. Jin, C. Xu, and J. Wu, “Accelerating GPGPU micro-architecture simulation,” IEEE Trans. Comput., vol. 64, no. 11, pp. 3153–3166, 2015.
  37.  R. Ubal, B. Jang, P. Mistry, D. Schaa, and D. Kaeli, “Multi2Sim: a simulation framework for CPU-GPU computing,” in Proc. 21st International Conf. on Parallel Architectures and Compilation Techniques (PACT), 2012, pp. 335–344.
  38.  G. Malhotra, S. Goel, and S. Sarangi, “GpuTejas: a parallel simulator for GPU architectures,” in Proc. 21st International Conference on High Performance Computing, 2014, pp. 1–10.
  39.  Y. Arafa, A.A. Badawy, G. Chennupati, N. Santhi, and S. Eidenbenz, “PPT-GPU: Scalable GPU performance modeling,” IEEE Comput. Archit. Lett., vol. 18, no. 1, pp. 55–58, 2019.
  40.  X. Wang, K. Huang, A. Knoll, and X. Qian, “A hybrid framework for fast and accurate GPU performance estimation through source-level analysis and trace-based simulation,” in Proc. IEEE International Symposium on High Performance Computer Architecture (HPCA), 2019, pp. 506–518.
  41.  K. Punniyamurthy, B. Boroujerdian, and A. Gerstlauer, “GATSim: Abstract timing simulation of GPUs,” in Proc. Design, Automation & Test, Europe Conf. & Exhibition (DATE), 2017, pp. 43–48.
  42.  M. Khairy, Z. Shen, T.M. Aamodt, and T.G. Rogers, “AccelSim: An extensible simulation framework for validated GPU modeling,” in Proc. 47th IEEE/ACM Int. Symposium on Computer Architecture (ISCA), 2020, pp. 473–486.
  43.  S. Collange, M. Daumas, D. Defour, and D. Parello, “Barra: A parallel functional simulator for GPGPU,” in Proc. IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2010, pp. 351–360.
  44.  “GPU Ocelot project: a dynamic compilation framework for GPU computing.” [Online]. Available: http://www.gpuocelot.gatech.edu/
  45.  J. Power, J. Hestness, M.S. Orr, M.D. Hill, and D.A. Wood, “gem5-gpu: A heterogeneous CPU-GPU simulator,” IEEE Comput. Archit. Lett., vol. 14, no. 1, pp. 34–36, 2015.
  46.  “FusionSim GPU simulator project.” [Online]. Available: https://sites.google.com/site/fusionsimulator/
  47.  A. Nakonechny and Z. Veres, “The wavelet based trained filter for image interpolation,” in Proc. IEEE 1st International Conference on Data Stream Mining & Processing, 2016, pp. 218–221.
  48.  G. Strang and T. Nguyen, Wavelets and Filter Banks. Welleslay, UK: Welleslay-Cambridge Press, 1996.
  49.  P. Lipiński and J. Stolarek, “Improving watermark resistance against removal attacks using orthogonal wavelet adaptation,” in Proc. 38th Conference on Current Trends in Theory and Practice of Computer Science, vol. 7147, 2012, pp. 588–599.
  50.  D. Bařina, M. Kula, and P. Zemčík, “Parallel wavelet schemes for images,” J. Real-Time Image Process., vol. 16, no. 5, pp. 1365–1381, 2019.
  51.  D. Bařina, M. Kula, M. Matýšek, and P. Zemčík, “Accelerating discrete wavelet transforms on GPUs,” in Proc. International Conference on Image Processing (ICIP), 2017, pp. 2707– 2710.
  52.  D. Bařina, M. Kula, M. Matýšek, and P. Zemčík, “Accelerating discrete wavelet transforms on parallel architectures,” J. WSCG, vol. 25, no. 2, pp. 77–85, 2017.
  53.  W. van der Laan, A. Jalba, and J. Roerdink, “Accelerating wavelet lifting on graphics hardware using CUDA,” IEEE Trans. Parallel Distrib. Syst., vol. 22, no. 1, pp. 132–146, 2011.
  54.  M. Yatsymirskyy, “A novel matrix model of two channel biorthogonal filter banks,” Metody Informatyki Stosowanej, pp. 205–212, 2011.
  55.  M. Yatsymirskyy and K. Stokfiszewski, “Effectiveness of lattice factorization of two-channel orthogonal filter banks,” in Proc. Joint Conference NTAV/SPA, 2012, pp. 275–279.
  56.  M. Yatsymirskyy, “Lattice structures for synthesis and implementation of wavelet transforms,” J. Appl. Comput. Sci., vol. 17, no. 1, pp. 133–141, 2009.
  57.  J. Stolarek, “Adaptive synthesis of a wavelet transform using fast neural network,” Bull. Pol. Acad. Sci. Tech. Sci., vol. 59, pp. 9– 13, 2011.
  58.  D. Puchala, K. Stokfiszewski, K. Wieloch, and M. Yatsymirskyy, “Comparative study of massively parallel GPU realizations of wavelet transform computation with lattice structure and matrixbased approach,” in Proc. IEEE International Conference on Data Stream Mining & Processing, 2018, pp. 88–93.
  59.  M. Harris, S. Sengupta, and J.D. Owens, “Parallel prefix sum (scan) with CUDA,” in GPU Gems 3, Part VI: GPU Computing, H. Nguyen, Ed. Addison Wesley, 2007, pp. 851–876.
  60.  S. Sengupta, A.E. Lefohn, and J.D. Owens, “A work-efficient step-efficient prefix sum algorithm,” in Proc. Workshop on Edge Computing Using New Commodity Architectures, 2006, pp. D–26–27.
  61.  J. Franco, G. Bernabe, J. Fernandez, and M.E. Acacio, “A parallel implementation of the 2d wavelet transform using CUDA,” in Proc. 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing, 2009, pp. 111–118.
  62.  H. Bantikyan, “CUDA based implementation of 2-D discrete Haar wavelet transformation,” in Proc. International Conference Parallel and Distributed Computing Systems, 2014, pp. 20–26.
  63.  M.J. Flynn and S.F. Oberman, Advanced Computer Arithmetic Design. New York, NY, USA: John Wiley & Sons, Inc., 2001.
  64.  Ł. Napierała, “Effectiveness measurements of linear transforms realized on graphics processing units with the use of GPGPUSim emulator” – MSc thesis, Institute of Information Technology, Łódz´ University of Technology, Poland, 2020.
Go to article

Authors and Affiliations

Dariusz Puchala
1
ORCID: ORCID
Kamil Stokfiszewski
1
ORCID: ORCID
Kamil Wieloch
1

  1. Institute of Information Technology, Łódź University of Technology, ul. Wólczańska 215, 90-924 Łódź, Poland

This page uses 'cookies'. Learn more