Evaluating the performance of FFT library implementations on modern hybrid computing systems
Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-perf...
Ausführliche Beschreibung
Autor*in: |
Malkovsky, Sergey I. [verfasserIn] |
---|
Format: |
Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2021 |
---|
Schlagwörter: |
---|
Anmerkung: |
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 |
---|
Übergeordnetes Werk: |
Enthalten in: The journal of supercomputing - Springer US, 1987, 77(2021), 8 vom: 20. Jan., Seite 8326-8354 |
---|---|
Übergeordnetes Werk: |
volume:77 ; year:2021 ; number:8 ; day:20 ; month:01 ; pages:8326-8354 |
Links: |
---|
DOI / URN: |
10.1007/s11227-020-03591-6 |
---|
Katalog-ID: |
OLC2126673537 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | OLC2126673537 | ||
003 | DE-627 | ||
005 | 20230505120523.0 | ||
007 | tu | ||
008 | 230505s2021 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1007/s11227-020-03591-6 |2 doi | |
035 | |a (DE-627)OLC2126673537 | ||
035 | |a (DE-He213)s11227-020-03591-6-p | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 004 |a 620 |q VZ |
100 | 1 | |a Malkovsky, Sergey I. |e verfasserin |0 (orcid)0000-0001-9731-2606 |4 aut | |
245 | 1 | 0 | |a Evaluating the performance of FFT library implementations on modern hybrid computing systems |
264 | 1 | |c 2021 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 | ||
520 | |a Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. | ||
650 | 4 | |a Hybrid computing systems | |
650 | 4 | |a IBM POWER | |
650 | 4 | |a Intel Xeon | |
650 | 4 | |a NVIDIA Tesla | |
650 | 4 | |a FFT | |
650 | 4 | |a IBM ESSL | |
650 | 4 | |a FFTW | |
650 | 4 | |a cuFFT | |
650 | 4 | |a cuFFTW | |
650 | 4 | |a Intel MKL | |
700 | 1 | |a Sorokin, Aleksei A. |0 (orcid)0000-0003-3334-3440 |4 aut | |
700 | 1 | |a Tsoy, Georgiy I. |0 (orcid)0000-0002-4209-1284 |4 aut | |
700 | 1 | |a Korolev, Sergey P. |0 (orcid)0000-0002-0002-7540 |4 aut | |
700 | 1 | |a Smagin, Sergey I. |4 aut | |
700 | 1 | |a Kondrashev, Vadim A. |0 (orcid)0000-0002-1224-1392 |4 aut | |
773 | 0 | 8 | |i Enthalten in |t The journal of supercomputing |d Springer US, 1987 |g 77(2021), 8 vom: 20. Jan., Seite 8326-8354 |w (DE-627)13046466X |w (DE-600)740510-8 |w (DE-576)018667775 |x 0920-8542 |7 nnns |
773 | 1 | 8 | |g volume:77 |g year:2021 |g number:8 |g day:20 |g month:01 |g pages:8326-8354 |
856 | 4 | 1 | |u https://doi.org/10.1007/s11227-020-03591-6 |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-TEC | ||
912 | |a SSG-OLC-MAT | ||
951 | |a AR | ||
952 | |d 77 |j 2021 |e 8 |b 20 |c 01 |h 8326-8354 |
author_variant |
s i m si sim a a s aa aas g i t gi git s p k sp spk s i s si sis v a k va vak |
---|---|
matchkey_str |
article:09208542:2021----::vlaighpromnefflbaymlmnainomdrh |
hierarchy_sort_str |
2021 |
publishDate |
2021 |
allfields |
10.1007/s11227-020-03591-6 doi (DE-627)OLC2126673537 (DE-He213)s11227-020-03591-6-p DE-627 ger DE-627 rakwb eng 004 620 VZ Malkovsky, Sergey I. verfasserin (orcid)0000-0001-9731-2606 aut Evaluating the performance of FFT library implementations on modern hybrid computing systems 2021 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL Sorokin, Aleksei A. (orcid)0000-0003-3334-3440 aut Tsoy, Georgiy I. (orcid)0000-0002-4209-1284 aut Korolev, Sergey P. (orcid)0000-0002-0002-7540 aut Smagin, Sergey I. aut Kondrashev, Vadim A. (orcid)0000-0002-1224-1392 aut Enthalten in The journal of supercomputing Springer US, 1987 77(2021), 8 vom: 20. Jan., Seite 8326-8354 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 https://doi.org/10.1007/s11227-020-03591-6 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 77 2021 8 20 01 8326-8354 |
spelling |
10.1007/s11227-020-03591-6 doi (DE-627)OLC2126673537 (DE-He213)s11227-020-03591-6-p DE-627 ger DE-627 rakwb eng 004 620 VZ Malkovsky, Sergey I. verfasserin (orcid)0000-0001-9731-2606 aut Evaluating the performance of FFT library implementations on modern hybrid computing systems 2021 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL Sorokin, Aleksei A. (orcid)0000-0003-3334-3440 aut Tsoy, Georgiy I. (orcid)0000-0002-4209-1284 aut Korolev, Sergey P. (orcid)0000-0002-0002-7540 aut Smagin, Sergey I. aut Kondrashev, Vadim A. (orcid)0000-0002-1224-1392 aut Enthalten in The journal of supercomputing Springer US, 1987 77(2021), 8 vom: 20. Jan., Seite 8326-8354 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 https://doi.org/10.1007/s11227-020-03591-6 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 77 2021 8 20 01 8326-8354 |
allfields_unstemmed |
10.1007/s11227-020-03591-6 doi (DE-627)OLC2126673537 (DE-He213)s11227-020-03591-6-p DE-627 ger DE-627 rakwb eng 004 620 VZ Malkovsky, Sergey I. verfasserin (orcid)0000-0001-9731-2606 aut Evaluating the performance of FFT library implementations on modern hybrid computing systems 2021 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL Sorokin, Aleksei A. (orcid)0000-0003-3334-3440 aut Tsoy, Georgiy I. (orcid)0000-0002-4209-1284 aut Korolev, Sergey P. (orcid)0000-0002-0002-7540 aut Smagin, Sergey I. aut Kondrashev, Vadim A. (orcid)0000-0002-1224-1392 aut Enthalten in The journal of supercomputing Springer US, 1987 77(2021), 8 vom: 20. Jan., Seite 8326-8354 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 https://doi.org/10.1007/s11227-020-03591-6 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 77 2021 8 20 01 8326-8354 |
allfieldsGer |
10.1007/s11227-020-03591-6 doi (DE-627)OLC2126673537 (DE-He213)s11227-020-03591-6-p DE-627 ger DE-627 rakwb eng 004 620 VZ Malkovsky, Sergey I. verfasserin (orcid)0000-0001-9731-2606 aut Evaluating the performance of FFT library implementations on modern hybrid computing systems 2021 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL Sorokin, Aleksei A. (orcid)0000-0003-3334-3440 aut Tsoy, Georgiy I. (orcid)0000-0002-4209-1284 aut Korolev, Sergey P. (orcid)0000-0002-0002-7540 aut Smagin, Sergey I. aut Kondrashev, Vadim A. (orcid)0000-0002-1224-1392 aut Enthalten in The journal of supercomputing Springer US, 1987 77(2021), 8 vom: 20. Jan., Seite 8326-8354 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 https://doi.org/10.1007/s11227-020-03591-6 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 77 2021 8 20 01 8326-8354 |
allfieldsSound |
10.1007/s11227-020-03591-6 doi (DE-627)OLC2126673537 (DE-He213)s11227-020-03591-6-p DE-627 ger DE-627 rakwb eng 004 620 VZ Malkovsky, Sergey I. verfasserin (orcid)0000-0001-9731-2606 aut Evaluating the performance of FFT library implementations on modern hybrid computing systems 2021 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL Sorokin, Aleksei A. (orcid)0000-0003-3334-3440 aut Tsoy, Georgiy I. (orcid)0000-0002-4209-1284 aut Korolev, Sergey P. (orcid)0000-0002-0002-7540 aut Smagin, Sergey I. aut Kondrashev, Vadim A. (orcid)0000-0002-1224-1392 aut Enthalten in The journal of supercomputing Springer US, 1987 77(2021), 8 vom: 20. Jan., Seite 8326-8354 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 https://doi.org/10.1007/s11227-020-03591-6 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 77 2021 8 20 01 8326-8354 |
language |
English |
source |
Enthalten in The journal of supercomputing 77(2021), 8 vom: 20. Jan., Seite 8326-8354 volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 |
sourceStr |
Enthalten in The journal of supercomputing 77(2021), 8 vom: 20. Jan., Seite 8326-8354 volume:77 year:2021 number:8 day:20 month:01 pages:8326-8354 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL |
dewey-raw |
004 |
isfreeaccess_bool |
false |
container_title |
The journal of supercomputing |
authorswithroles_txt_mv |
Malkovsky, Sergey I. @@aut@@ Sorokin, Aleksei A. @@aut@@ Tsoy, Georgiy I. @@aut@@ Korolev, Sergey P. @@aut@@ Smagin, Sergey I. @@aut@@ Kondrashev, Vadim A. @@aut@@ |
publishDateDaySort_date |
2021-01-20T00:00:00Z |
hierarchy_top_id |
13046466X |
dewey-sort |
14 |
id |
OLC2126673537 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000naa a22002652 4500</leader><controlfield tag="001">OLC2126673537</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230505120523.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">230505s2021 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11227-020-03591-6</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2126673537</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11227-020-03591-6-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Malkovsky, Sergey I.</subfield><subfield code="e">verfasserin</subfield><subfield code="0">(orcid)0000-0001-9731-2606</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Evaluating the performance of FFT library implementations on modern hybrid computing systems</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2021</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Hybrid computing systems</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">IBM POWER</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Intel Xeon</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">NVIDIA Tesla</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">FFT</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">IBM ESSL</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">FFTW</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">cuFFT</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">cuFFTW</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Intel MKL</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sorokin, Aleksei A.</subfield><subfield code="0">(orcid)0000-0003-3334-3440</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Tsoy, Georgiy I.</subfield><subfield code="0">(orcid)0000-0002-4209-1284</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Korolev, Sergey P.</subfield><subfield code="0">(orcid)0000-0002-0002-7540</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Smagin, Sergey I.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kondrashev, Vadim A.</subfield><subfield code="0">(orcid)0000-0002-1224-1392</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The journal of supercomputing</subfield><subfield code="d">Springer US, 1987</subfield><subfield code="g">77(2021), 8 vom: 20. Jan., Seite 8326-8354</subfield><subfield code="w">(DE-627)13046466X</subfield><subfield code="w">(DE-600)740510-8</subfield><subfield code="w">(DE-576)018667775</subfield><subfield code="x">0920-8542</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:77</subfield><subfield code="g">year:2021</subfield><subfield code="g">number:8</subfield><subfield code="g">day:20</subfield><subfield code="g">month:01</subfield><subfield code="g">pages:8326-8354</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11227-020-03591-6</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">77</subfield><subfield code="j">2021</subfield><subfield code="e">8</subfield><subfield code="b">20</subfield><subfield code="c">01</subfield><subfield code="h">8326-8354</subfield></datafield></record></collection>
|
author |
Malkovsky, Sergey I. |
spellingShingle |
Malkovsky, Sergey I. ddc 004 misc Hybrid computing systems misc IBM POWER misc Intel Xeon misc NVIDIA Tesla misc FFT misc IBM ESSL misc FFTW misc cuFFT misc cuFFTW misc Intel MKL Evaluating the performance of FFT library implementations on modern hybrid computing systems |
authorStr |
Malkovsky, Sergey I. |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)13046466X |
format |
Article |
dewey-ones |
004 - Data processing & computer science 620 - Engineering & allied operations |
delete_txt_mv |
keep |
author_role |
aut aut aut aut aut aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
0920-8542 |
topic_title |
004 620 VZ Evaluating the performance of FFT library implementations on modern hybrid computing systems Hybrid computing systems IBM POWER Intel Xeon NVIDIA Tesla FFT IBM ESSL FFTW cuFFT cuFFTW Intel MKL |
topic |
ddc 004 misc Hybrid computing systems misc IBM POWER misc Intel Xeon misc NVIDIA Tesla misc FFT misc IBM ESSL misc FFTW misc cuFFT misc cuFFTW misc Intel MKL |
topic_unstemmed |
ddc 004 misc Hybrid computing systems misc IBM POWER misc Intel Xeon misc NVIDIA Tesla misc FFT misc IBM ESSL misc FFTW misc cuFFT misc cuFFTW misc Intel MKL |
topic_browse |
ddc 004 misc Hybrid computing systems misc IBM POWER misc Intel Xeon misc NVIDIA Tesla misc FFT misc IBM ESSL misc FFTW misc cuFFT misc cuFFTW misc Intel MKL |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
hierarchy_parent_title |
The journal of supercomputing |
hierarchy_parent_id |
13046466X |
dewey-tens |
000 - Computer science, knowledge & systems 620 - Engineering |
hierarchy_top_title |
The journal of supercomputing |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 |
title |
Evaluating the performance of FFT library implementations on modern hybrid computing systems |
ctrlnum |
(DE-627)OLC2126673537 (DE-He213)s11227-020-03591-6-p |
title_full |
Evaluating the performance of FFT library implementations on modern hybrid computing systems |
author_sort |
Malkovsky, Sergey I. |
journal |
The journal of supercomputing |
journalStr |
The journal of supercomputing |
lang_code |
eng |
isOA_bool |
false |
dewey-hundreds |
000 - Computer science, information & general works 600 - Technology |
recordtype |
marc |
publishDateSort |
2021 |
contenttype_str_mv |
txt |
container_start_page |
8326 |
author_browse |
Malkovsky, Sergey I. Sorokin, Aleksei A. Tsoy, Georgiy I. Korolev, Sergey P. Smagin, Sergey I. Kondrashev, Vadim A. |
container_volume |
77 |
class |
004 620 VZ |
format_se |
Aufsätze |
author-letter |
Malkovsky, Sergey I. |
doi_str_mv |
10.1007/s11227-020-03591-6 |
normlink |
(ORCID)0000-0001-9731-2606 (ORCID)0000-0003-3334-3440 (ORCID)0000-0002-4209-1284 (ORCID)0000-0002-0002-7540 (ORCID)0000-0002-1224-1392 |
normlink_prefix_str_mv |
(orcid)0000-0001-9731-2606 (orcid)0000-0003-3334-3440 (orcid)0000-0002-4209-1284 (orcid)0000-0002-0002-7540 (orcid)0000-0002-1224-1392 |
dewey-full |
004 620 |
title_sort |
evaluating the performance of fft library implementations on modern hybrid computing systems |
title_auth |
Evaluating the performance of FFT library implementations on modern hybrid computing systems |
abstract |
Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 |
abstractGer |
Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 |
abstract_unstemmed |
Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT |
container_issue |
8 |
title_short |
Evaluating the performance of FFT library implementations on modern hybrid computing systems |
url |
https://doi.org/10.1007/s11227-020-03591-6 |
remote_bool |
false |
author2 |
Sorokin, Aleksei A. Tsoy, Georgiy I. Korolev, Sergey P. Smagin, Sergey I. Kondrashev, Vadim A. |
author2Str |
Sorokin, Aleksei A. Tsoy, Georgiy I. Korolev, Sergey P. Smagin, Sergey I. Kondrashev, Vadim A. |
ppnlink |
13046466X |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1007/s11227-020-03591-6 |
up_date |
2024-07-04T07:57:45.242Z |
_version_ |
1803634464773898240 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000naa a22002652 4500</leader><controlfield tag="001">OLC2126673537</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230505120523.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">230505s2021 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11227-020-03591-6</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2126673537</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11227-020-03591-6-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Malkovsky, Sergey I.</subfield><subfield code="e">verfasserin</subfield><subfield code="0">(orcid)0000-0001-9731-2606</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Evaluating the performance of FFT library implementations on modern hybrid computing systems</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2021</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Hybrid computing systems</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">IBM POWER</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Intel Xeon</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">NVIDIA Tesla</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">FFT</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">IBM ESSL</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">FFTW</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">cuFFT</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">cuFFTW</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Intel MKL</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sorokin, Aleksei A.</subfield><subfield code="0">(orcid)0000-0003-3334-3440</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Tsoy, Georgiy I.</subfield><subfield code="0">(orcid)0000-0002-4209-1284</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Korolev, Sergey P.</subfield><subfield code="0">(orcid)0000-0002-0002-7540</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Smagin, Sergey I.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kondrashev, Vadim A.</subfield><subfield code="0">(orcid)0000-0002-1224-1392</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The journal of supercomputing</subfield><subfield code="d">Springer US, 1987</subfield><subfield code="g">77(2021), 8 vom: 20. Jan., Seite 8326-8354</subfield><subfield code="w">(DE-627)13046466X</subfield><subfield code="w">(DE-600)740510-8</subfield><subfield code="w">(DE-576)018667775</subfield><subfield code="x">0920-8542</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:77</subfield><subfield code="g">year:2021</subfield><subfield code="g">number:8</subfield><subfield code="g">day:20</subfield><subfield code="g">month:01</subfield><subfield code="g">pages:8326-8354</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11227-020-03591-6</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">77</subfield><subfield code="j">2021</subfield><subfield code="e">8</subfield><subfield code="b">20</subfield><subfield code="c">01</subfield><subfield code="h">8326-8354</subfield></datafield></record></collection>
|
score |
7.3991175 |