Scene optimization of GPU-based back-projection algorithm

Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-project...
Ausführliche Beschreibung

Gespeichert in:

Autor*in:	Gong, Hao [verfasserIn] Liu, Ying Chen, Xiaoying Wang, Cheng

Format:	Artikel
Sprache:	Englisch

Erschienen:	2022

Schlagwörter:	Back-projection GPU Heuristic block strategy Pinned host memory Unified memory

Anmerkung:	© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Übergeordnetes Werk:	Enthalten in: The journal of supercomputing - Springer US, 1987, 79(2022), 4 vom: 22. Sept., Seite 4192-4214
Übergeordnetes Werk:	volume:79 ; year:2022 ; number:4 ; day:22 ; month:09 ; pages:4192-4214

Links:	Volltext

DOI / URN:	10.1007/s11227-022-04785-w

Katalog-ID:	OLC2133608893

Internformat


LEADER	01000naa a22002652 4500
001	OLC2133608893
003	DE-627
005	20230506152103.0
007	tu
008	230506s2022 xx \|\|\|\|\| 00\| \|\|eng c
024	7		\|a 10.1007/s11227-022-04785-w \|2 doi
035			\|a (DE-627)OLC2133608893
035			\|a (DE-He213)s11227-022-04785-w-p
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
082	0	4	\|a 004 \|a 620 \|q VZ
100	1		\|a Gong, Hao \|e verfasserin \|4 aut
245	1	0	\|a Scene optimization of GPU-based back-projection algorithm
264		1	\|c 2022
336			\|a Text \|b txt \|2 rdacontent
337			\|a ohne Hilfsmittel zu benutzen \|b n \|2 rdamedia
338			\|a Band \|b nc \|2 rdacarrier
500			\|a © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
520			\|a Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance.
650		4	\|a Back-projection
650		4	\|a GPU
650		4	\|a Heuristic block strategy
650		4	\|a Pinned host memory
650		4	\|a Unified memory
700	1		\|a Liu, Ying \|4 aut
700	1		\|a Chen, Xiaoying \|4 aut
700	1		\|a Wang, Cheng \|4 aut
773	0	8	\|i Enthalten in \|t The journal of supercomputing \|d Springer US, 1987 \|g 79(2022), 4 vom: 22. Sept., Seite 4192-4214 \|w (DE-627)13046466X \|w (DE-600)740510-8 \|w (DE-576)018667775 \|x 0920-8542 \|7 nnns
773	1	8	\|g volume:79 \|g year:2022 \|g number:4 \|g day:22 \|g month:09 \|g pages:4192-4214
856	4	1	\|u https://doi.org/10.1007/s11227-022-04785-w \|z lizenzpflichtig \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_OLC
912			\|a SSG-OLC-TEC
912			\|a SSG-OLC-MAT
951			\|a AR
952			\|d 79 \|j 2022 \|e 4 \|b 22 \|c 09 \|h 4192-4214

Indexfelder

author_variant	h g hg y l yl x c xc c w cw
matchkey_str	article:09208542:2022----::cnotmztooguaebcpoe
hierarchy_sort_str	2022
publishDate	2022
allfields	10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214
spelling	10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214
allfields_unstemmed	10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214
allfieldsGer	10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214
allfieldsSound	10.1007/s11227-022-04785-w doi (DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p DE-627 ger DE-627 rakwb eng 004 620 VZ Gong, Hao verfasserin aut Scene optimization of GPU-based back-projection algorithm 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. Back-projection GPU Heuristic block strategy Pinned host memory Unified memory Liu, Ying aut Chen, Xiaoying aut Wang, Cheng aut Enthalten in The journal of supercomputing Springer US, 1987 79(2022), 4 vom: 22. Sept., Seite 4192-4214 (DE-627)13046466X (DE-600)740510-8 (DE-576)018667775 0920-8542 nnns volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214 https://doi.org/10.1007/s11227-022-04785-w lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT AR 79 2022 4 22 09 4192-4214
language	English
source	Enthalten in The journal of supercomputing 79(2022), 4 vom: 22. Sept., Seite 4192-4214 volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214
sourceStr	Enthalten in The journal of supercomputing 79(2022), 4 vom: 22. Sept., Seite 4192-4214 volume:79 year:2022 number:4 day:22 month:09 pages:4192-4214
format_phy_str_mv	Article
institution	findex.gbv.de
topic_facet	Back-projection GPU Heuristic block strategy Pinned host memory Unified memory
dewey-raw	004
isfreeaccess_bool	false
container_title	The journal of supercomputing
authorswithroles_txt_mv	Gong, Hao @@aut@@ Liu, Ying @@aut@@ Chen, Xiaoying @@aut@@ Wang, Cheng @@aut@@
publishDateDaySort_date	2022-09-22T00:00:00Z
hierarchy_top_id	13046466X
dewey-sort	14
id	OLC2133608893
language_de	englisch
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000naa a22002652 4500</leader><controlfield tag="001">OLC2133608893</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230506152103.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">230506s2022 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11227-022-04785-w</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2133608893</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11227-022-04785-w-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Gong, Hao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene optimization of GPU-based back-projection algorithm</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2022</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Back-projection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">GPU</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Heuristic block strategy</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Pinned host memory</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Unified memory</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Liu, Ying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Chen, Xiaoying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Wang, Cheng</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The journal of supercomputing</subfield><subfield code="d">Springer US, 1987</subfield><subfield code="g">79(2022), 4 vom: 22. Sept., Seite 4192-4214</subfield><subfield code="w">(DE-627)13046466X</subfield><subfield code="w">(DE-600)740510-8</subfield><subfield code="w">(DE-576)018667775</subfield><subfield code="x">0920-8542</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:79</subfield><subfield code="g">year:2022</subfield><subfield code="g">number:4</subfield><subfield code="g">day:22</subfield><subfield code="g">month:09</subfield><subfield code="g">pages:4192-4214</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11227-022-04785-w</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">79</subfield><subfield code="j">2022</subfield><subfield code="e">4</subfield><subfield code="b">22</subfield><subfield code="c">09</subfield><subfield code="h">4192-4214</subfield></datafield></record></collection>
author	Gong, Hao
spellingShingle	Gong, Hao ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory Scene optimization of GPU-based back-projection algorithm
authorStr	Gong, Hao
ppnlink_with_tag_str_mv	@@773@@(DE-627)13046466X
format	Article
dewey-ones	004 - Data processing & computer science 620 - Engineering & allied operations
delete_txt_mv	keep
author_role	aut aut aut aut
collection	OLC
remote_str	false
illustrated	Not Illustrated
issn	0920-8542
topic_title	004 620 VZ Scene optimization of GPU-based back-projection algorithm Back-projection GPU Heuristic block strategy Pinned host memory Unified memory
topic	ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory
topic_unstemmed	ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory
topic_browse	ddc 004 misc Back-projection misc GPU misc Heuristic block strategy misc Pinned host memory misc Unified memory
format_facet	Aufsätze Gedruckte Aufsätze
format_main_str_mv	Text Zeitschrift/Artikel
carriertype_str_mv	nc
hierarchy_parent_title	The journal of supercomputing
hierarchy_parent_id	13046466X
dewey-tens	000 - Computer science, knowledge & systems 620 - Engineering
hierarchy_top_title	The journal of supercomputing
isfreeaccess_txt	false
familylinks_str_mv	(DE-627)13046466X (DE-600)740510-8 (DE-576)018667775
title	Scene optimization of GPU-based back-projection algorithm
ctrlnum	(DE-627)OLC2133608893 (DE-He213)s11227-022-04785-w-p
title_full	Scene optimization of GPU-based back-projection algorithm
author_sort	Gong, Hao
journal	The journal of supercomputing
journalStr	The journal of supercomputing
lang_code	eng
isOA_bool	false
dewey-hundreds	000 - Computer science, information & general works 600 - Technology
recordtype	marc
publishDateSort	2022
contenttype_str_mv	txt
container_start_page	4192
author_browse	Gong, Hao Liu, Ying Chen, Xiaoying Wang, Cheng
container_volume	79
class	004 620 VZ
format_se	Aufsätze
author-letter	Gong, Hao
doi_str_mv	10.1007/s11227-022-04785-w
dewey-full	004 620
title_sort	scene optimization of gpu-based back-projection algorithm
title_auth	Scene optimization of GPU-based back-projection algorithm
abstract	Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
abstractGer	Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
abstract_unstemmed	Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
collection_details	GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT
container_issue	4
title_short	Scene optimization of GPU-based back-projection algorithm
url	https://doi.org/10.1007/s11227-022-04785-w
remote_bool	false
author2	Liu, Ying Chen, Xiaoying Wang, Cheng
author2Str	Liu, Ying Chen, Xiaoying Wang, Cheng
ppnlink	13046466X
mediatype_str_mv	n
isOA_txt	false
hochschulschrift_bool	false
doi_str	10.1007/s11227-022-04785-w
up_date	2024-07-03T20:28:43.881Z
_version_	1803591115218092032
fullrecord_marcxml	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000naa a22002652 4500</leader><controlfield tag="001">OLC2133608893</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230506152103.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">230506s2022 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11227-022-04785-w</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2133608893</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11227-022-04785-w-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Gong, Hao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene optimization of GPU-based back-projection algorithm</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2022</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract The back-projection algorithm can adapt to synthetic aperture radar imaging in any mode because of its time-domain imaging characteristics, so it has received increasing attention in scientific research and engineering practice. However, the large enormous of computation of the back-projection algorithm limits its application. Since the back-projection algorithm has a high degree of data parallelism, it is very suitable for GPU parallel processing. The data scale and hardware environment in different application scenarios will result in different bottlenecks which may prevent the back-projection algorithm from performing optimally. Thus, this paper proposes a series of strategies to aid the back-projection algorithm in achieving the best peak performance with any given problem scale on any given platform. A heuristic block strategy is proposed to optimize the peak performance of the back-projection algorithm on servers and miniaturized GPU devices, which can handle the differences in hardware platforms as well as the differences in data scales. To optimize the peak performance of the back-projection algorithm on miniaturized GPU devices, a memory management strategy is proposed by using unified memory and pinned host memory. The method proposed in this paper has achieved an acceleration ratio close to the device’s peak performance on servers and miniaturized GPU devices. By using the proposed methods, any algorithm with data-independent chunking and any GPU device with a uniform memory architecture can achieve higher peak performance.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Back-projection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">GPU</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Heuristic block strategy</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Pinned host memory</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Unified memory</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Liu, Ying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Chen, Xiaoying</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Wang, Cheng</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The journal of supercomputing</subfield><subfield code="d">Springer US, 1987</subfield><subfield code="g">79(2022), 4 vom: 22. Sept., Seite 4192-4214</subfield><subfield code="w">(DE-627)13046466X</subfield><subfield code="w">(DE-600)740510-8</subfield><subfield code="w">(DE-576)018667775</subfield><subfield code="x">0920-8542</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:79</subfield><subfield code="g">year:2022</subfield><subfield code="g">number:4</subfield><subfield code="g">day:22</subfield><subfield code="g">month:09</subfield><subfield code="g">pages:4192-4214</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11227-022-04785-w</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">79</subfield><subfield code="j">2022</subfield><subfield code="e">4</subfield><subfield code="b">22</subfield><subfield code="c">09</subfield><subfield code="h">4192-4214</subfield></datafield></record></collection>
score	7.3974237

Nicht das Richtige dabei?

Schreiben Sie uns!

Scene optimization of GPU-based back-projection algorithm

Nicht das Richtige dabei?

Zugang & Verfügbarkeit

Vorhandene Bände

Nicht das Richtige dabei?