Scene optimization of GPU-based back-projection algorithm
Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-project...
Ausführliche Beschreibung
Autor*in: |
Gong, Hao [verfasserIn] |
---|
Format: |
Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2022 |
---|
Schlagwörter: |
---|
Anmerkung: |
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. |
---|
Übergeordnetes Werk: |
Enthalten in: The journal of supercomputing - Springer US, 1987, 79(2022), 4 vom: 22. Sept., Seite 4192-4214 |
---|---|
Übergeordnetes Werk: |
volume:79 ; year:2022 ; number:4 ; day:22 ; month:09 ; pages:4192-4214 |
Links: |
---|
DOI / URN: |
10.1007/s11227-022-04785-w |
---|
Katalog-ID: |
OLC2133608893 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | OLC2133608893 | ||
003 | DE-627 | ||
005 | 20230506152103.0 | ||
007 | tu | ||
008 | 230506s2022 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1007/s11227-022-04785-w |2 doi | |
035 | |a (DE-627)OLC2133608893 | ||
035 | |a (DE-He213)s11227-022-04785-w-p | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 004 |a 620 |q VZ |
100 | 1 | |a Gong, Hao |e verfasserin |4 aut | |
245 | 1 | 0 | |a Scene optimization of GPU-based back-projection algorithm |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. | ||
520 | |a Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. | ||
650 | 4 | |a Back-projection | |
650 | 4 | |a GPU | |
650 | 4 | |a Heuristic block strategy | |
650 | 4 | |a Pinned host memory | |
650 | 4 | |a Unified memory | |
700 | 1 | |a Liu, Ying |4 aut | |
700 | 1 | |a Chen, Xiaoying |4 aut | |
700 | 1 | |a Wang, Cheng |4 aut | |
773 | 0 | 8 | |i Enthalten in |t The journal of supercomputing |d Springer US, 1987 |g 79(2022), 4 vom: 22. Sept., Seite 4192-4214 |w (DE-627)13046466X |w (DE-600)740510-8 |w (DE-576)018667775 |x 0920-8542 |7 nnns |
773 | 1 | 8 | |g volume:79 |g year:2022 |g number:4 |g day:22 |g month:09 |g pages:4192-4214 |
856 | 4 | 1 | |u https://doi.org/10.1007/s11227-022-04785-w |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-TEC | ||
912 | |a SSG-OLC-MAT | ||
951 | |a AR | ||
952 | |d 79 |j 2022 |e 4 |b 22 |c 09 |h 4192-4214 |
author_variant |
h g hg y l yl x c xc c w cw |
---|---|
matchkey_str |
article:09208542:2022----::cnotmztooguaebcpoe |
hierarchy_sort_str |
2022 |
publishDate |
2022 |
allfields |
10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214 |
spelling |
10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214 |
allfields_unstemmed |
10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214 |
allfieldsGer |
10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214 |
allfieldsSound |
10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214 |
language |
English |
source |
Enthalten in The journal of supercomputing 79(2022), 4 vom: 22. Sept., Seite 4192-4214 volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 |
sourceStr |
Enthalten in The journal of supercomputing 79(2022), 4 vom: 22. Sept., Seite 4192-4214 volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Back-projection GPU Heuristic block strategy Pinned host memory Unified memory |
dewey-raw |
004 |
isfreeaccess_bool |
false |
container_title |
The journal of supercomputing |
authorswithroles_txt_mv |
Gong, Hao @@aut@@ Liu, Ying @@aut@@ Chen, Xiaoying @@aut@@ Wang, Cheng @@aut@@ |
publishDateDaySort_date |
2022-09-22T00:00:00Z |
hierarchy_top_id |
13046466X |
dewey-sort |
14 |
id |
OLC2133608893 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000naa a22002652 4500</leader><controlfield tag="001">OLC2133608893</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230506152103.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">230506s2022 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11227-022-04785-w</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2133608893</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11227-022-04785-w-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Gong, Hao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene optimization of GPU-based back-projection algorithm</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2022</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Back-projection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">GPU</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Heuristic block strategy</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Pinned host memory</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Unified memory</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Liu, Ying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Chen, Xiaoying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Wang, Cheng</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The journal of supercomputing</subfield><subfield code="d">Springer US, 1987</subfield><subfield code="g">79(2022), 4 vom: 22. Sept., Seite 4192-4214</subfield><subfield code="w">(DE-627)13046466X</subfield><subfield code="w">(DE-600)740510-8</subfield><subfield code="w">(DE-576)018667775</subfield><subfield code="x">0920-8542</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:79</subfield><subfield code="g">year:2022</subfield><subfield code="g">number:4</subfield><subfield code="g">day:22</subfield><subfield code="g">month:09</subfield><subfield code="g">pages:4192-4214</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11227-022-04785-w</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">79</subfield><subfield code="j">2022</subfield><subfield code="e">4</subfield><subfield code="b">22</subfield><subfield code="c">09</subfield><subfield code="h">4192-4214</subfield></datafield></record></collection>
|
author |
Gong, Hao |
spellingShingle |
Gong, Hao ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory Scene optimization of GPU-based back-projection algorithm |
authorStr |
Gong, Hao |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)13046466X |
format |
Article |
dewey-ones |
004 - Data processing & computer science 620 - Engineering & allied operations |
delete_txt_mv |
keep |
author_role |
aut aut aut aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
0920-8542 |
topic_title |
004 620 VZ Scene optimization of GPU-based back-projection algorithm Back-projection GPU Heuristic block strategy Pinned host memory Unified memory |
topic |
ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory |
topic_unstemmed |
ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory |
topic_browse |
ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
hierarchy_parent_title |
The journal of supercomputing |
hierarchy_parent_id |
13046466X |
dewey-tens |
000 - Computer science, knowledge & systems 620 - Engineering |
hierarchy_top_title |
The journal of supercomputing |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 |
title |
Scene optimization of GPU-based back-projection algorithm |
ctrlnum |
(DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p |
title_full |
Scene optimization of GPU-based back-projection algorithm |
author_sort |
Gong, Hao |
journal |
The journal of supercomputing |
journalStr |
The journal of supercomputing |
lang_code |
eng |
isOA_bool |
false |
dewey-hundreds |
000 - Computer science, information & general works 600 - Technology |
recordtype |
marc |
publishDateSort |
2022 |
contenttype_str_mv |
txt |
container_start_page |
4192 |
author_browse |
Gong, Hao Liu, Ying Chen, Xiaoying Wang, Cheng |
container_volume |
79 |
class |
004 620 VZ |
format_se |
Aufsätze |
author-letter |
Gong, Hao |
doi_str_mv |
10.1007/s11227-022-04785-w |
dewey-full |
004 620 |
title_sort |
scene optimization of gpu-based back-projection algorithm |
title_auth |
Scene optimization of GPU-based back-projection algorithm |
abstract |
Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. |
abstractGer |
Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. |
abstract_unstemmed |
Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT |
container_issue |
4 |
title_short |
Scene optimization of GPU-based back-projection algorithm |
url |
https://doi.org/10.1007/s11227-022-04785-w |
remote_bool |
false |
author2 |
Liu, Ying Chen, Xiaoying Wang, Cheng |
author2Str |
Liu, Ying Chen, Xiaoying Wang, Cheng |
ppnlink |
13046466X |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1007/s11227-022-04785-w |
up_date |
2024-07-03T20:28:43.881Z |
_version_ |
1803591115218092032 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000naa a22002652 4500</leader><controlfield tag="001">OLC2133608893</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230506152103.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">230506s2022 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11227-022-04785-w</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2133608893</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11227-022-04785-w-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Gong, Hao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene optimization of GPU-based back-projection algorithm</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2022</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Back-projection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">GPU</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Heuristic block strategy</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Pinned host memory</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Unified memory</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Liu, Ying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Chen, Xiaoying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Wang, Cheng</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The journal of supercomputing</subfield><subfield code="d">Springer US, 1987</subfield><subfield code="g">79(2022), 4 vom: 22. Sept., Seite 4192-4214</subfield><subfield code="w">(DE-627)13046466X</subfield><subfield code="w">(DE-600)740510-8</subfield><subfield code="w">(DE-576)018667775</subfield><subfield code="x">0920-8542</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:79</subfield><subfield code="g">year:2022</subfield><subfield code="g">number:4</subfield><subfield code="g">day:22</subfield><subfield code="g">month:09</subfield><subfield code="g">pages:4192-4214</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11227-022-04785-w</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">79</subfield><subfield code="j">2022</subfield><subfield code="e">4</subfield><subfield code="b">22</subfield><subfield code="c">09</subfield><subfield code="h">4192-4214</subfield></datafield></record></collection>
|
score |
7.3974237 |