A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures

Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This method...
Ausführliche Beschreibung

Gespeichert in:

Autor*in:	Chen, Yen-Kuang [verfasserIn] Kung, S.Y.

Format:	Artikel
Sprache:	Englisch

Erschienen:	1998

Schlagwörter:	Systolic Array Search Window Current Block Array Processor VLSI Signal Processing

Anmerkung:	© Kluwer Academic Publishers 1998

Übergeordnetes Werk:	Enthalten in: Journal of VLSI signal processing systems for signal, image and video technology - Kluwer Academic Publishers, 1989, 19(1998), 1 vom: 01. Mai, Seite 51-77
Übergeordnetes Werk:	volume:19 ; year:1998 ; number:1 ; day:01 ; month:05 ; pages:51-77

Links:	Volltext

DOI / URN:	10.1023/A:1008012332212

Katalog-ID:	OLC2062082878

Internformat


LEADER	01000caa a22002652 4500
001	OLC2062082878
003	DE-627
005	20230504072253.0
007	tu
008	200819s1998 xx \|\|\|\|\| 00\| \|\|eng c
024	7		\|a 10.1023/A:1008012332212 \|2 doi
035			\|a (DE-627)OLC2062082878
035			\|a (DE-He213)A:1008012332212-p
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
082	0	4	\|a 620 \|q VZ
100	1		\|a Chen, Yen-Kuang \|e verfasserin \|4 aut
245	1	0	\|a A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
264		1	\|c 1998
336			\|a Text \|b txt \|2 rdacontent
337			\|a ohne Hilfsmittel zu benutzen \|b n \|2 rdamedia
338			\|a Band \|b nc \|2 rdacarrier
500			\|a © Kluwer Academic Publishers 1998
520			\|a Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%).
650		4	\|a Systolic Array
650		4	\|a Search Window
650		4	\|a Current Block
650		4	\|a Array Processor
650		4	\|a VLSI Signal Processing
700	1		\|a Kung, S.Y. \|4 aut
773	0	8	\|i Enthalten in \|t Journal of VLSI signal processing systems for signal, image and video technology \|d Kluwer Academic Publishers, 1989 \|g 19(1998), 1 vom: 01. Mai, Seite 51-77 \|w (DE-627)130761508 \|w (DE-600)1000618-7 \|w (DE-576)02508416X \|x 0922-5773 \|7 nnns
773	1	8	\|g volume:19 \|g year:1998 \|g number:1 \|g day:01 \|g month:05 \|g pages:51-77
856	4	1	\|u https://doi.org/10.1023/A:1008012332212 \|z lizenzpflichtig \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_OLC
912			\|a SSG-OLC-TEC
912			\|a SSG-OLC-MAT
912			\|a GBV_ILN_70
912			\|a GBV_ILN_2006
912			\|a GBV_ILN_2020
912			\|a GBV_ILN_2244
912			\|a GBV_ILN_4318
912			\|a GBV_ILN_4319
951			\|a AR
952			\|d 19 \|j 1998 \|e 1 \|b 01 \|c 05 \|h 51-77

Indexfelder

author_variant	y k c ykc s k sk
matchkey_str	article:09225773:1998----::ssoidsgmtoooyihplctotflsacbo
hierarchy_sort_str	1998
publishDate	1998
allfields	10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77
spelling	10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77
allfields_unstemmed	10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77
allfieldsGer	10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77
allfieldsSound	10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77
language	English
source	Enthalten in Journal of VLSI signal processing systems for signal, image and video technology 19(1998), 1 vom: 01. Mai, Seite 51-77 volume:19 year:1998 number:1 day:01 month:05 pages:51-77
sourceStr	Enthalten in Journal of VLSI signal processing systems for signal, image and video technology 19(1998), 1 vom: 01. Mai, Seite 51-77 volume:19 year:1998 number:1 day:01 month:05 pages:51-77
format_phy_str_mv	Article
institution	findex.gbv.de
topic_facet	Systolic Array Search Window Current Block Array Processor VLSI Signal Processing
dewey-raw	620
isfreeaccess_bool	false
container_title	Journal of VLSI signal processing systems for signal, image and video technology
authorswithroles_txt_mv	Chen, Yen-Kuang @@aut@@ Kung, S.Y. @@aut@@
publishDateDaySort_date	1998-05-01T00:00:00Z
hierarchy_top_id	130761508
dewey-sort	3620
id	OLC2062082878
language_de	englisch
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2062082878</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230504072253.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200819s1998 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1023/A:1008012332212</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2062082878</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)A:1008012332212-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Chen, Yen-Kuang</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1998</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Kluwer Academic Publishers 1998</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%).</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Systolic Array</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Search Window</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Current Block</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Array Processor</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">VLSI Signal Processing</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kung, S.Y.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Journal of VLSI signal processing systems for signal, image and video technology</subfield><subfield code="d">Kluwer Academic Publishers, 1989</subfield><subfield code="g">19(1998), 1 vom: 01. Mai, Seite 51-77</subfield><subfield code="w">(DE-627)130761508</subfield><subfield code="w">(DE-600)1000618-7</subfield><subfield code="w">(DE-576)02508416X</subfield><subfield code="x">0922-5773</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:19</subfield><subfield code="g">year:1998</subfield><subfield code="g">number:1</subfield><subfield code="g">day:01</subfield><subfield code="g">month:05</subfield><subfield code="g">pages:51-77</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1023/A:1008012332212</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2020</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2244</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4318</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4319</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">19</subfield><subfield code="j">1998</subfield><subfield code="e">1</subfield><subfield code="b">01</subfield><subfield code="c">05</subfield><subfield code="h">51-77</subfield></datafield></record></collection>
author	Chen, Yen-Kuang
spellingShingle	Chen, Yen-Kuang ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
authorStr	Chen, Yen-Kuang
ppnlink_with_tag_str_mv	@@773@@(DE-627)130761508
format	Article
dewey-ones	620 - Engineering & allied operations
delete_txt_mv	keep
author_role	aut aut
collection	OLC
remote_str	false
illustrated	Not Illustrated
issn	0922-5773
topic_title	620 VZ A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures Systolic Array Search Window Current Block Array Processor VLSI Signal Processing
topic	ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing
topic_unstemmed	ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing
topic_browse	ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing
format_facet	Aufsätze Gedruckte Aufsätze
format_main_str_mv	Text Zeitschrift/Artikel
carriertype_str_mv	nc
hierarchy_parent_title	Journal of VLSI signal processing systems for signal, image and video technology
hierarchy_parent_id	130761508
dewey-tens	620 - Engineering
hierarchy_top_title	Journal of VLSI signal processing systems for signal, image and video technology
isfreeaccess_txt	false
familylinks_str_mv	(DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X
title	A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
ctrlnum	(DE-627)OLC2062082878 (DE-He213)A:1008012332212-p
title_full	A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
author_sort	Chen, Yen-Kuang
journal	Journal of VLSI signal processing systems for signal, image and video technology
journalStr	Journal of VLSI signal processing systems for signal, image and video technology
lang_code	eng
isOA_bool	false
dewey-hundreds	600 - Technology
recordtype	marc
publishDateSort	1998
contenttype_str_mv	txt
container_start_page	51
author_browse	Chen, Yen-Kuang Kung, S.Y.
container_volume	19
class	620 VZ
format_se	Aufsätze
author-letter	Chen, Yen-Kuang
doi_str_mv	10.1023/A:1008012332212
dewey-full	620
title_sort	a systolic design methodology with application to full-search block-matching architectures
title_auth	A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
abstract	Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). © Kluwer Academic Publishers 1998
abstractGer	Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). © Kluwer Academic Publishers 1998
abstract_unstemmed	Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). © Kluwer Academic Publishers 1998
collection_details	GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319
container_issue	1
title_short	A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
url	https://doi.org/10.1023/A:1008012332212
remote_bool	false
author2	Kung, S.Y.
author2Str	Kung, S.Y.
ppnlink	130761508
mediatype_str_mv	n
isOA_txt	false
hochschulschrift_bool	false
doi_str	10.1023/A:1008012332212
up_date	2024-07-03T13:40:26.730Z
_version_	1803565428094533632
fullrecord_marcxml	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2062082878</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230504072253.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200819s1998 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1023/A:1008012332212</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2062082878</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)A:1008012332212-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Chen, Yen-Kuang</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1998</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Kluwer Academic Publishers 1998</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%).</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Systolic Array</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Search Window</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Current Block</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Array Processor</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">VLSI Signal Processing</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kung, S.Y.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Journal of VLSI signal processing systems for signal, image and video technology</subfield><subfield code="d">Kluwer Academic Publishers, 1989</subfield><subfield code="g">19(1998), 1 vom: 01. Mai, Seite 51-77</subfield><subfield code="w">(DE-627)130761508</subfield><subfield code="w">(DE-600)1000618-7</subfield><subfield code="w">(DE-576)02508416X</subfield><subfield code="x">0922-5773</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:19</subfield><subfield code="g">year:1998</subfield><subfield code="g">number:1</subfield><subfield code="g">day:01</subfield><subfield code="g">month:05</subfield><subfield code="g">pages:51-77</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1023/A:1008012332212</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2020</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2244</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4318</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4319</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">19</subfield><subfield code="j">1998</subfield><subfield code="e">1</subfield><subfield code="b">01</subfield><subfield code="c">05</subfield><subfield code="h">51-77</subfield></datafield></record></collection>
score	7.399208

Nicht das Richtige dabei?

Schreiben Sie uns!

A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures

Nicht das Richtige dabei?

Zugang & Verfügbarkeit

Vorhandene Bände

Nicht das Richtige dabei?