A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This method...
Ausführliche Beschreibung
Autor*in: |
Chen, Yen-Kuang [verfasserIn] |
---|
Format: |
Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
1998 |
---|
Schlagwörter: |
---|
Anmerkung: |
© Kluwer Academic Publishers 1998 |
---|
Übergeordnetes Werk: |
Enthalten in: Journal of VLSI signal processing systems for signal, image and video technology - Kluwer Academic Publishers, 1989, 19(1998), 1 vom: 01. Mai, Seite 51-77 |
---|---|
Übergeordnetes Werk: |
volume:19 ; year:1998 ; number:1 ; day:01 ; month:05 ; pages:51-77 |
Links: |
---|
DOI / URN: |
10.1023/A:1008012332212 |
---|
Katalog-ID: |
OLC2062082878 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | OLC2062082878 | ||
003 | DE-627 | ||
005 | 20230504072253.0 | ||
007 | tu | ||
008 | 200819s1998 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1023/A:1008012332212 |2 doi | |
035 | |a (DE-627)OLC2062082878 | ||
035 | |a (DE-He213)A:1008012332212-p | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 620 |q VZ |
100 | 1 | |a Chen, Yen-Kuang |e verfasserin |4 aut | |
245 | 1 | 0 | |a A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures |
264 | 1 | |c 1998 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a © Kluwer Academic Publishers 1998 | ||
520 | |a Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). | ||
650 | 4 | |a Systolic Array | |
650 | 4 | |a Search Window | |
650 | 4 | |a Current Block | |
650 | 4 | |a Array Processor | |
650 | 4 | |a VLSI Signal Processing | |
700 | 1 | |a Kung, S.Y. |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of VLSI signal processing systems for signal, image and video technology |d Kluwer Academic Publishers, 1989 |g 19(1998), 1 vom: 01. Mai, Seite 51-77 |w (DE-627)130761508 |w (DE-600)1000618-7 |w (DE-576)02508416X |x 0922-5773 |7 nnns |
773 | 1 | 8 | |g volume:19 |g year:1998 |g number:1 |g day:01 |g month:05 |g pages:51-77 |
856 | 4 | 1 | |u https://doi.org/10.1023/A:1008012332212 |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-TEC | ||
912 | |a SSG-OLC-MAT | ||
912 | |a GBV_ILN_70 | ||
912 | |a GBV_ILN_2006 | ||
912 | |a GBV_ILN_2020 | ||
912 | |a GBV_ILN_2244 | ||
912 | |a GBV_ILN_4318 | ||
912 | |a GBV_ILN_4319 | ||
951 | |a AR | ||
952 | |d 19 |j 1998 |e 1 |b 01 |c 05 |h 51-77 |
author_variant |
y k c ykc s k sk |
---|---|
matchkey_str |
article:09225773:1998----::ssoidsgmtoooyihplctotflsacbo |
hierarchy_sort_str |
1998 |
publishDate |
1998 |
allfields |
10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77 |
spelling |
10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77 |
allfields_unstemmed |
10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77 |
allfieldsGer |
10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77 |
allfieldsSound |
10.1023/A:1008012332212 doi (DE-627)OLC2062082878 (DE-He213)A:1008012332212-p DE-627 ger DE-627 rakwb eng 620 VZ Chen, Yen-Kuang verfasserin aut A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures 1998 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Kluwer Academic Publishers 1998 Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). Systolic Array Search Window Current Block Array Processor VLSI Signal Processing Kung, S.Y. aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Kluwer Academic Publishers, 1989 19(1998), 1 vom: 01. Mai, Seite 51-77 (DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X 0922-5773 nnns volume:19 year:1998 number:1 day:01 month:05 pages:51-77 https://doi.org/10.1023/A:1008012332212 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 AR 19 1998 1 01 05 51-77 |
language |
English |
source |
Enthalten in Journal of VLSI signal processing systems for signal, image and video technology 19(1998), 1 vom: 01. Mai, Seite 51-77 volume:19 year:1998 number:1 day:01 month:05 pages:51-77 |
sourceStr |
Enthalten in Journal of VLSI signal processing systems for signal, image and video technology 19(1998), 1 vom: 01. Mai, Seite 51-77 volume:19 year:1998 number:1 day:01 month:05 pages:51-77 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Systolic Array Search Window Current Block Array Processor VLSI Signal Processing |
dewey-raw |
620 |
isfreeaccess_bool |
false |
container_title |
Journal of VLSI signal processing systems for signal, image and video technology |
authorswithroles_txt_mv |
Chen, Yen-Kuang @@aut@@ Kung, S.Y. @@aut@@ |
publishDateDaySort_date |
1998-05-01T00:00:00Z |
hierarchy_top_id |
130761508 |
dewey-sort |
3620 |
id |
OLC2062082878 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2062082878</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230504072253.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200819s1998 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1023/A:1008012332212</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2062082878</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)A:1008012332212-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Chen, Yen-Kuang</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1998</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Kluwer Academic Publishers 1998</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%).</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Systolic Array</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Search Window</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Current Block</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Array Processor</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">VLSI Signal Processing</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kung, S.Y.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Journal of VLSI signal processing systems for signal, image and video technology</subfield><subfield code="d">Kluwer Academic Publishers, 1989</subfield><subfield code="g">19(1998), 1 vom: 01. Mai, Seite 51-77</subfield><subfield code="w">(DE-627)130761508</subfield><subfield code="w">(DE-600)1000618-7</subfield><subfield code="w">(DE-576)02508416X</subfield><subfield code="x">0922-5773</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:19</subfield><subfield code="g">year:1998</subfield><subfield code="g">number:1</subfield><subfield code="g">day:01</subfield><subfield code="g">month:05</subfield><subfield code="g">pages:51-77</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1023/A:1008012332212</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2020</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2244</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4318</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4319</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">19</subfield><subfield code="j">1998</subfield><subfield code="e">1</subfield><subfield code="b">01</subfield><subfield code="c">05</subfield><subfield code="h">51-77</subfield></datafield></record></collection>
|
author |
Chen, Yen-Kuang |
spellingShingle |
Chen, Yen-Kuang ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures |
authorStr |
Chen, Yen-Kuang |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)130761508 |
format |
Article |
dewey-ones |
620 - Engineering & allied operations |
delete_txt_mv |
keep |
author_role |
aut aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
0922-5773 |
topic_title |
620 VZ A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures Systolic Array Search Window Current Block Array Processor VLSI Signal Processing |
topic |
ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing |
topic_unstemmed |
ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing |
topic_browse |
ddc 620 misc Systolic Array misc Search Window misc Current Block misc Array Processor misc VLSI Signal Processing |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
hierarchy_parent_title |
Journal of VLSI signal processing systems for signal, image and video technology |
hierarchy_parent_id |
130761508 |
dewey-tens |
620 - Engineering |
hierarchy_top_title |
Journal of VLSI signal processing systems for signal, image and video technology |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)130761508 (DE-600)1000618-7 (DE-576)02508416X |
title |
A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures |
ctrlnum |
(DE-627)OLC2062082878 (DE-He213)A:1008012332212-p |
title_full |
A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures |
author_sort |
Chen, Yen-Kuang |
journal |
Journal of VLSI signal processing systems for signal, image and video technology |
journalStr |
Journal of VLSI signal processing systems for signal, image and video technology |
lang_code |
eng |
isOA_bool |
false |
dewey-hundreds |
600 - Technology |
recordtype |
marc |
publishDateSort |
1998 |
contenttype_str_mv |
txt |
container_start_page |
51 |
author_browse |
Chen, Yen-Kuang Kung, S.Y. |
container_volume |
19 |
class |
620 VZ |
format_se |
Aufsätze |
author-letter |
Chen, Yen-Kuang |
doi_str_mv |
10.1023/A:1008012332212 |
dewey-full |
620 |
title_sort |
a systolic design methodology with application to full-search block-matching architectures |
title_auth |
A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures |
abstract |
Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). © Kluwer Academic Publishers 1998 |
abstractGer |
Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). © Kluwer Academic Publishers 1998 |
abstract_unstemmed |
Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%). © Kluwer Academic Publishers 1998 |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2006 GBV_ILN_2020 GBV_ILN_2244 GBV_ILN_4318 GBV_ILN_4319 |
container_issue |
1 |
title_short |
A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures |
url |
https://doi.org/10.1023/A:1008012332212 |
remote_bool |
false |
author2 |
Kung, S.Y. |
author2Str |
Kung, S.Y. |
ppnlink |
130761508 |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1023/A:1008012332212 |
up_date |
2024-07-03T13:40:26.730Z |
_version_ |
1803565428094533632 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2062082878</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230504072253.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200819s1998 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1023/A:1008012332212</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2062082878</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)A:1008012332212-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">620</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Chen, Yen-Kuang</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">1998</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Kluwer Academic Publishers 1998</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%).</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Systolic Array</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Search Window</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Current Block</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Array Processor</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">VLSI Signal Processing</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kung, S.Y.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Journal of VLSI signal processing systems for signal, image and video technology</subfield><subfield code="d">Kluwer Academic Publishers, 1989</subfield><subfield code="g">19(1998), 1 vom: 01. Mai, Seite 51-77</subfield><subfield code="w">(DE-627)130761508</subfield><subfield code="w">(DE-600)1000618-7</subfield><subfield code="w">(DE-576)02508416X</subfield><subfield code="x">0922-5773</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:19</subfield><subfield code="g">year:1998</subfield><subfield code="g">number:1</subfield><subfield code="g">day:01</subfield><subfield code="g">month:05</subfield><subfield code="g">pages:51-77</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1023/A:1008012332212</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2020</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2244</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4318</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4319</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">19</subfield><subfield code="j">1998</subfield><subfield code="e">1</subfield><subfield code="b">01</subfield><subfield code="c">05</subfield><subfield code="h">51-77</subfield></datafield></record></collection>
|
score |
7.399208 |