Large Alphabet Source Coding Using Independent Component Analysis
Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at leas...
Ausführliche Beschreibung
Autor*in: |
Painsky, Amichai [verfasserIn] |
---|
Format: |
Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2017 |
---|
Schlagwörter: |
---|
Systematik: |
|
---|
Übergeordnetes Werk: |
Enthalten in: IEEE transactions on information theory - Piscataway, NJ : IEEE, 1963, 63(2017), 10, Seite 6514-6529 |
---|---|
Übergeordnetes Werk: |
volume:63 ; year:2017 ; number:10 ; pages:6514-6529 |
Links: |
---|
DOI / URN: |
10.1109/TIT.2017.2728017 |
---|
Katalog-ID: |
OLC1996944088 |
---|
LEADER | 01000caa a2200265 4500 | ||
---|---|---|---|
001 | OLC1996944088 | ||
003 | DE-627 | ||
005 | 20220221163316.0 | ||
007 | tu | ||
008 | 171125s2017 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1109/TIT.2017.2728017 |2 doi | |
028 | 5 | 2 | |a PQ20171228 |
035 | |a (DE-627)OLC1996944088 | ||
035 | |a (DE-599)GBVOLC1996944088 | ||
035 | |a (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 | ||
035 | |a (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 070 |a 620 |q DE-600 |
084 | |a SA 5570 |q AVZ |2 rvk | ||
100 | 1 | |a Painsky, Amichai |e verfasserin |4 aut | |
245 | 1 | 0 | |a Large Alphabet Source Coding Using Independent Component Analysis |
264 | 1 | |c 2017 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
520 | |a Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. | ||
650 | 4 | |a Independent component analysis | |
650 | 4 | |a Source coding | |
650 | 4 | |a Entropy Coding | |
650 | 4 | |a Decoding | |
650 | 4 | |a Redundancy | |
650 | 4 | |a Data Compression | |
650 | 4 | |a Entropy | |
700 | 1 | |a Rosset, Saharon |4 oth | |
700 | 1 | |a Feder, Meir |4 oth | |
773 | 0 | 8 | |i Enthalten in |t IEEE transactions on information theory |d Piscataway, NJ : IEEE, 1963 |g 63(2017), 10, Seite 6514-6529 |w (DE-627)12954731X |w (DE-600)218505-2 |w (DE-576)01499819X |x 0018-9448 |7 nnns |
773 | 1 | 8 | |g volume:63 |g year:2017 |g number:10 |g pages:6514-6529 |
856 | 4 | 1 | |u http://dx.doi.org/10.1109/TIT.2017.2728017 |3 Volltext |
856 | 4 | 2 | |u http://ieeexplore.ieee.org/document/7983009 |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-TEC | ||
912 | |a SSG-OLC-MAT | ||
912 | |a SSG-OLC-BUB | ||
912 | |a SSG-OPC-BBI | ||
912 | |a GBV_ILN_65 | ||
912 | |a GBV_ILN_70 | ||
912 | |a GBV_ILN_2002 | ||
912 | |a GBV_ILN_2088 | ||
936 | r | v | |a SA 5570 |
951 | |a AR | ||
952 | |d 63 |j 2017 |e 10 |h 6514-6529 |
author_variant |
a p ap |
---|---|
matchkey_str |
article:00189448:2017----::agapaesuccdnuigneednc |
hierarchy_sort_str |
2017 |
publishDate |
2017 |
allfields |
10.1109/TIT.2017.2728017 doi PQ20171228 (DE-627)OLC1996944088 (DE-599)GBVOLC1996944088 (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent DE-627 ger DE-627 rakwb eng 070 620 DE-600 SA 5570 AVZ rvk Painsky, Amichai verfasserin aut Large Alphabet Source Coding Using Independent Component Analysis 2017 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy Rosset, Saharon oth Feder, Meir oth Enthalten in IEEE transactions on information theory Piscataway, NJ : IEEE, 1963 63(2017), 10, Seite 6514-6529 (DE-627)12954731X (DE-600)218505-2 (DE-576)01499819X 0018-9448 nnns volume:63 year:2017 number:10 pages:6514-6529 http://dx.doi.org/10.1109/TIT.2017.2728017 Volltext http://ieeexplore.ieee.org/document/7983009 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT SSG-OLC-BUB SSG-OPC-BBI GBV_ILN_65 GBV_ILN_70 GBV_ILN_2002 GBV_ILN_2088 SA 5570 AR 63 2017 10 6514-6529 |
spelling |
10.1109/TIT.2017.2728017 doi PQ20171228 (DE-627)OLC1996944088 (DE-599)GBVOLC1996944088 (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent DE-627 ger DE-627 rakwb eng 070 620 DE-600 SA 5570 AVZ rvk Painsky, Amichai verfasserin aut Large Alphabet Source Coding Using Independent Component Analysis 2017 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy Rosset, Saharon oth Feder, Meir oth Enthalten in IEEE transactions on information theory Piscataway, NJ : IEEE, 1963 63(2017), 10, Seite 6514-6529 (DE-627)12954731X (DE-600)218505-2 (DE-576)01499819X 0018-9448 nnns volume:63 year:2017 number:10 pages:6514-6529 http://dx.doi.org/10.1109/TIT.2017.2728017 Volltext http://ieeexplore.ieee.org/document/7983009 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT SSG-OLC-BUB SSG-OPC-BBI GBV_ILN_65 GBV_ILN_70 GBV_ILN_2002 GBV_ILN_2088 SA 5570 AR 63 2017 10 6514-6529 |
allfields_unstemmed |
10.1109/TIT.2017.2728017 doi PQ20171228 (DE-627)OLC1996944088 (DE-599)GBVOLC1996944088 (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent DE-627 ger DE-627 rakwb eng 070 620 DE-600 SA 5570 AVZ rvk Painsky, Amichai verfasserin aut Large Alphabet Source Coding Using Independent Component Analysis 2017 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy Rosset, Saharon oth Feder, Meir oth Enthalten in IEEE transactions on information theory Piscataway, NJ : IEEE, 1963 63(2017), 10, Seite 6514-6529 (DE-627)12954731X (DE-600)218505-2 (DE-576)01499819X 0018-9448 nnns volume:63 year:2017 number:10 pages:6514-6529 http://dx.doi.org/10.1109/TIT.2017.2728017 Volltext http://ieeexplore.ieee.org/document/7983009 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT SSG-OLC-BUB SSG-OPC-BBI GBV_ILN_65 GBV_ILN_70 GBV_ILN_2002 GBV_ILN_2088 SA 5570 AR 63 2017 10 6514-6529 |
allfieldsGer |
10.1109/TIT.2017.2728017 doi PQ20171228 (DE-627)OLC1996944088 (DE-599)GBVOLC1996944088 (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent DE-627 ger DE-627 rakwb eng 070 620 DE-600 SA 5570 AVZ rvk Painsky, Amichai verfasserin aut Large Alphabet Source Coding Using Independent Component Analysis 2017 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy Rosset, Saharon oth Feder, Meir oth Enthalten in IEEE transactions on information theory Piscataway, NJ : IEEE, 1963 63(2017), 10, Seite 6514-6529 (DE-627)12954731X (DE-600)218505-2 (DE-576)01499819X 0018-9448 nnns volume:63 year:2017 number:10 pages:6514-6529 http://dx.doi.org/10.1109/TIT.2017.2728017 Volltext http://ieeexplore.ieee.org/document/7983009 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT SSG-OLC-BUB SSG-OPC-BBI GBV_ILN_65 GBV_ILN_70 GBV_ILN_2002 GBV_ILN_2088 SA 5570 AR 63 2017 10 6514-6529 |
allfieldsSound |
10.1109/TIT.2017.2728017 doi PQ20171228 (DE-627)OLC1996944088 (DE-599)GBVOLC1996944088 (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent DE-627 ger DE-627 rakwb eng 070 620 DE-600 SA 5570 AVZ rvk Painsky, Amichai verfasserin aut Large Alphabet Source Coding Using Independent Component Analysis 2017 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy Rosset, Saharon oth Feder, Meir oth Enthalten in IEEE transactions on information theory Piscataway, NJ : IEEE, 1963 63(2017), 10, Seite 6514-6529 (DE-627)12954731X (DE-600)218505-2 (DE-576)01499819X 0018-9448 nnns volume:63 year:2017 number:10 pages:6514-6529 http://dx.doi.org/10.1109/TIT.2017.2728017 Volltext http://ieeexplore.ieee.org/document/7983009 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT SSG-OLC-BUB SSG-OPC-BBI GBV_ILN_65 GBV_ILN_70 GBV_ILN_2002 GBV_ILN_2088 SA 5570 AR 63 2017 10 6514-6529 |
language |
English |
source |
Enthalten in IEEE transactions on information theory 63(2017), 10, Seite 6514-6529 volume:63 year:2017 number:10 pages:6514-6529 |
sourceStr |
Enthalten in IEEE transactions on information theory 63(2017), 10, Seite 6514-6529 volume:63 year:2017 number:10 pages:6514-6529 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy |
dewey-raw |
070 |
isfreeaccess_bool |
false |
container_title |
IEEE transactions on information theory |
authorswithroles_txt_mv |
Painsky, Amichai @@aut@@ Rosset, Saharon @@oth@@ Feder, Meir @@oth@@ |
publishDateDaySort_date |
2017-01-01T00:00:00Z |
hierarchy_top_id |
12954731X |
dewey-sort |
270 |
id |
OLC1996944088 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a2200265 4500</leader><controlfield tag="001">OLC1996944088</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20220221163316.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">171125s2017 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1109/TIT.2017.2728017</subfield><subfield code="2">doi</subfield></datafield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">PQ20171228</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC1996944088</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)GBVOLC1996944088</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">070</subfield><subfield code="a">620</subfield><subfield code="q">DE-600</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SA 5570</subfield><subfield code="q">AVZ</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Painsky, Amichai</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Large Alphabet Source Coding Using Independent Component Analysis</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2017</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Independent component analysis</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Source coding</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Entropy Coding</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Decoding</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Redundancy</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data Compression</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Entropy</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Rosset, Saharon</subfield><subfield code="4">oth</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Feder, Meir</subfield><subfield code="4">oth</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">IEEE transactions on information theory</subfield><subfield code="d">Piscataway, NJ : IEEE, 1963</subfield><subfield code="g">63(2017), 10, Seite 6514-6529</subfield><subfield code="w">(DE-627)12954731X</subfield><subfield code="w">(DE-600)218505-2</subfield><subfield code="w">(DE-576)01499819X</subfield><subfield code="x">0018-9448</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:63</subfield><subfield code="g">year:2017</subfield><subfield code="g">number:10</subfield><subfield code="g">pages:6514-6529</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">http://dx.doi.org/10.1109/TIT.2017.2728017</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="u">http://ieeexplore.ieee.org/document/7983009</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-BUB</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OPC-BBI</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_65</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2002</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2088</subfield></datafield><datafield tag="936" ind1="r" ind2="v"><subfield code="a">SA 5570</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">63</subfield><subfield code="j">2017</subfield><subfield code="e">10</subfield><subfield code="h">6514-6529</subfield></datafield></record></collection>
|
author |
Painsky, Amichai |
spellingShingle |
Painsky, Amichai ddc 070 rvk SA 5570 misc Independent component analysis misc Source coding misc Entropy Coding misc Decoding misc Redundancy misc Data Compression misc Entropy Large Alphabet Source Coding Using Independent Component Analysis |
authorStr |
Painsky, Amichai |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)12954731X |
format |
Article |
dewey-ones |
070 - News media, journalism & publishing 620 - Engineering & allied operations |
delete_txt_mv |
keep |
author_role |
aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
0018-9448 |
topic_title |
070 620 DE-600 SA 5570 AVZ rvk Large Alphabet Source Coding Using Independent Component Analysis Independent component analysis Source coding Entropy Coding Decoding Redundancy Data Compression Entropy |
topic |
ddc 070 rvk SA 5570 misc Independent component analysis misc Source coding misc Entropy Coding misc Decoding misc Redundancy misc Data Compression misc Entropy |
topic_unstemmed |
ddc 070 rvk SA 5570 misc Independent component analysis misc Source coding misc Entropy Coding misc Decoding misc Redundancy misc Data Compression misc Entropy |
topic_browse |
ddc 070 rvk SA 5570 misc Independent component analysis misc Source coding misc Entropy Coding misc Decoding misc Redundancy misc Data Compression misc Entropy |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
author2_variant |
s r sr m f mf |
hierarchy_parent_title |
IEEE transactions on information theory |
hierarchy_parent_id |
12954731X |
dewey-tens |
070 - News media, journalism & publishing 620 - Engineering |
hierarchy_top_title |
IEEE transactions on information theory |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)12954731X (DE-600)218505-2 (DE-576)01499819X |
title |
Large Alphabet Source Coding Using Independent Component Analysis |
ctrlnum |
(DE-627)OLC1996944088 (DE-599)GBVOLC1996944088 (PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0 (KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent |
title_full |
Large Alphabet Source Coding Using Independent Component Analysis |
author_sort |
Painsky, Amichai |
journal |
IEEE transactions on information theory |
journalStr |
IEEE transactions on information theory |
lang_code |
eng |
isOA_bool |
false |
dewey-hundreds |
000 - Computer science, information & general works 600 - Technology |
recordtype |
marc |
publishDateSort |
2017 |
contenttype_str_mv |
txt |
container_start_page |
6514 |
author_browse |
Painsky, Amichai |
container_volume |
63 |
class |
070 620 DE-600 SA 5570 AVZ rvk |
format_se |
Aufsätze |
author-letter |
Painsky, Amichai |
doi_str_mv |
10.1109/TIT.2017.2728017 |
dewey-full |
070 620 |
title_sort |
large alphabet source coding using independent component analysis |
title_auth |
Large Alphabet Source Coding Using Independent Component Analysis |
abstract |
Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. |
abstractGer |
Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. |
abstract_unstemmed |
Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases. |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-TEC SSG-OLC-MAT SSG-OLC-BUB SSG-OPC-BBI GBV_ILN_65 GBV_ILN_70 GBV_ILN_2002 GBV_ILN_2088 |
container_issue |
10 |
title_short |
Large Alphabet Source Coding Using Independent Component Analysis |
url |
http://dx.doi.org/10.1109/TIT.2017.2728017 http://ieeexplore.ieee.org/document/7983009 |
remote_bool |
false |
author2 |
Rosset, Saharon Feder, Meir |
author2Str |
Rosset, Saharon Feder, Meir |
ppnlink |
12954731X |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
author2_role |
oth oth |
doi_str |
10.1109/TIT.2017.2728017 |
up_date |
2024-07-04T01:48:22.021Z |
_version_ |
1803611224993169408 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a2200265 4500</leader><controlfield tag="001">OLC1996944088</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20220221163316.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">171125s2017 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1109/TIT.2017.2728017</subfield><subfield code="2">doi</subfield></datafield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">PQ20171228</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC1996944088</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)GBVOLC1996944088</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(PRQ)i657-fd2c304c29599d7b084a47a4126702c653d332b174d5829971ea83d0b7f3c8f0</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(KEY)0023448620170000063001006514largealphabetsourcecodingusingindependentcomponent</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">070</subfield><subfield code="a">620</subfield><subfield code="q">DE-600</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">SA 5570</subfield><subfield code="q">AVZ</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Painsky, Amichai</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Large Alphabet Source Coding Using Independent Component Analysis</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2017</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Large alphabet source coding is a basic and well-studied problem in data compression. It has many applications, such as compression of natural language text, speech, and images. The classic perception of most commonly used methods is that a source is best described over an alphabet, which is at least as large as the observed alphabet. In this paper, we challenge this approach and introduce a conceptual framework in which a large alphabet source is decomposed into "as statistically independent as possible" components. This decomposition allows us to apply entropy encoding to each component separately, while benefiting from their reduced alphabet size. We show that in many cases, such decomposition results in a sum of marginal entropies which is only slightly greater than the entropy of the source. Our suggested algorithm, based on a generalization of the binary independent component analysis, is applicable for a variety of large alphabet source coding setups. This includes the classical lossless compression, universal compression, and high-dimensional vector quantization. In each of these setups, our suggested approach outperforms most commonly used methods. Moreover, our proposed framework is significantly easier to implement in most of these cases.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Independent component analysis</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Source coding</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Entropy Coding</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Decoding</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Redundancy</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data Compression</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Entropy</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Rosset, Saharon</subfield><subfield code="4">oth</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Feder, Meir</subfield><subfield code="4">oth</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">IEEE transactions on information theory</subfield><subfield code="d">Piscataway, NJ : IEEE, 1963</subfield><subfield code="g">63(2017), 10, Seite 6514-6529</subfield><subfield code="w">(DE-627)12954731X</subfield><subfield code="w">(DE-600)218505-2</subfield><subfield code="w">(DE-576)01499819X</subfield><subfield code="x">0018-9448</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:63</subfield><subfield code="g">year:2017</subfield><subfield code="g">number:10</subfield><subfield code="g">pages:6514-6529</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">http://dx.doi.org/10.1109/TIT.2017.2728017</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="u">http://ieeexplore.ieee.org/document/7983009</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-TEC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-BUB</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OPC-BBI</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_65</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2002</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2088</subfield></datafield><datafield tag="936" ind1="r" ind2="v"><subfield code="a">SA 5570</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">63</subfield><subfield code="j">2017</subfield><subfield code="e">10</subfield><subfield code="h">6514-6529</subfield></datafield></record></collection>
|
score |
7.400482 |