Histograms based on the minimum description length principle
Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms,...
Ausführliche Beschreibung
Autor*in: |
Wang, Hai [verfasserIn] |
---|
Format: |
Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2006 |
---|
Schlagwörter: |
---|
Anmerkung: |
© Springer-Verlag 2006 |
---|
Übergeordnetes Werk: |
Enthalten in: The VLDB journal - Springer-Verlag, 1992, 17(2006), 3 vom: 14. Dez., Seite 419-442 |
---|---|
Übergeordnetes Werk: |
volume:17 ; year:2006 ; number:3 ; day:14 ; month:12 ; pages:419-442 |
Links: |
---|
DOI / URN: |
10.1007/s00778-006-0015-0 |
---|
Katalog-ID: |
OLC2051357439 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | OLC2051357439 | ||
003 | DE-627 | ||
005 | 20230502151944.0 | ||
007 | tu | ||
008 | 200819s2006 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1007/s00778-006-0015-0 |2 doi | |
035 | |a (DE-627)OLC2051357439 | ||
035 | |a (DE-He213)s00778-006-0015-0-p | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 004 |q VZ |
100 | 1 | |a Wang, Hai |e verfasserin |4 aut | |
245 | 1 | 0 | |a Histograms based on the minimum description length principle |
264 | 1 | |c 2006 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a © Springer-Verlag 2006 | ||
520 | |a Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. | ||
650 | 4 | |a Query processing | |
650 | 4 | |a Approximate query answering | |
650 | 4 | |a Data summarization | |
650 | 4 | |a Histograms | |
700 | 1 | |a Sevcik, Kenneth C. |4 aut | |
773 | 0 | 8 | |i Enthalten in |t The VLDB journal |d Springer-Verlag, 1992 |g 17(2006), 3 vom: 14. Dez., Seite 419-442 |w (DE-627)170933059 |w (DE-600)1129061-4 |w (DE-576)032856466 |x 1066-8888 |7 nnns |
773 | 1 | 8 | |g volume:17 |g year:2006 |g number:3 |g day:14 |g month:12 |g pages:419-442 |
856 | 4 | 1 | |u https://doi.org/10.1007/s00778-006-0015-0 |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-MAT | ||
912 | |a GBV_ILN_22 | ||
912 | |a GBV_ILN_24 | ||
912 | |a GBV_ILN_30 | ||
912 | |a GBV_ILN_31 | ||
912 | |a GBV_ILN_32 | ||
912 | |a GBV_ILN_62 | ||
912 | |a GBV_ILN_65 | ||
912 | |a GBV_ILN_70 | ||
912 | |a GBV_ILN_100 | ||
912 | |a GBV_ILN_2005 | ||
912 | |a GBV_ILN_2006 | ||
912 | |a GBV_ILN_2010 | ||
912 | |a GBV_ILN_2018 | ||
912 | |a GBV_ILN_2020 | ||
912 | |a GBV_ILN_2021 | ||
912 | |a GBV_ILN_4116 | ||
912 | |a GBV_ILN_4126 | ||
912 | |a GBV_ILN_4266 | ||
912 | |a GBV_ILN_4277 | ||
912 | |a GBV_ILN_4305 | ||
912 | |a GBV_ILN_4311 | ||
951 | |a AR | ||
952 | |d 17 |j 2006 |e 3 |b 14 |c 12 |h 419-442 |
author_variant |
h w hw k c s kc kcs |
---|---|
matchkey_str |
article:10668888:2006----::itgasaeoteiiudsrpin |
hierarchy_sort_str |
2006 |
publishDate |
2006 |
allfields |
10.1007/s00778-006-0015-0 doi (DE-627)OLC2051357439 (DE-He213)s00778-006-0015-0-p DE-627 ger DE-627 rakwb eng 004 VZ Wang, Hai verfasserin aut Histograms based on the minimum description length principle 2006 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag 2006 Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. Query processing Approximate query answering Data summarization Histograms Sevcik, Kenneth C. aut Enthalten in The VLDB journal Springer-Verlag, 1992 17(2006), 3 vom: 14. Dez., Seite 419-442 (DE-627)170933059 (DE-600)1129061-4 (DE-576)032856466 1066-8888 nnns volume:17 year:2006 number:3 day:14 month:12 pages:419-442 https://doi.org/10.1007/s00778-006-0015-0 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_22 GBV_ILN_24 GBV_ILN_30 GBV_ILN_31 GBV_ILN_32 GBV_ILN_62 GBV_ILN_65 GBV_ILN_70 GBV_ILN_100 GBV_ILN_2005 GBV_ILN_2006 GBV_ILN_2010 GBV_ILN_2018 GBV_ILN_2020 GBV_ILN_2021 GBV_ILN_4116 GBV_ILN_4126 GBV_ILN_4266 GBV_ILN_4277 GBV_ILN_4305 GBV_ILN_4311 AR 17 2006 3 14 12 419-442 |
spelling |
10.1007/s00778-006-0015-0 doi (DE-627)OLC2051357439 (DE-He213)s00778-006-0015-0-p DE-627 ger DE-627 rakwb eng 004 VZ Wang, Hai verfasserin aut Histograms based on the minimum description length principle 2006 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag 2006 Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. Query processing Approximate query answering Data summarization Histograms Sevcik, Kenneth C. aut Enthalten in The VLDB journal Springer-Verlag, 1992 17(2006), 3 vom: 14. Dez., Seite 419-442 (DE-627)170933059 (DE-600)1129061-4 (DE-576)032856466 1066-8888 nnns volume:17 year:2006 number:3 day:14 month:12 pages:419-442 https://doi.org/10.1007/s00778-006-0015-0 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_22 GBV_ILN_24 GBV_ILN_30 GBV_ILN_31 GBV_ILN_32 GBV_ILN_62 GBV_ILN_65 GBV_ILN_70 GBV_ILN_100 GBV_ILN_2005 GBV_ILN_2006 GBV_ILN_2010 GBV_ILN_2018 GBV_ILN_2020 GBV_ILN_2021 GBV_ILN_4116 GBV_ILN_4126 GBV_ILN_4266 GBV_ILN_4277 GBV_ILN_4305 GBV_ILN_4311 AR 17 2006 3 14 12 419-442 |
allfields_unstemmed |
10.1007/s00778-006-0015-0 doi (DE-627)OLC2051357439 (DE-He213)s00778-006-0015-0-p DE-627 ger DE-627 rakwb eng 004 VZ Wang, Hai verfasserin aut Histograms based on the minimum description length principle 2006 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag 2006 Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. Query processing Approximate query answering Data summarization Histograms Sevcik, Kenneth C. aut Enthalten in The VLDB journal Springer-Verlag, 1992 17(2006), 3 vom: 14. Dez., Seite 419-442 (DE-627)170933059 (DE-600)1129061-4 (DE-576)032856466 1066-8888 nnns volume:17 year:2006 number:3 day:14 month:12 pages:419-442 https://doi.org/10.1007/s00778-006-0015-0 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_22 GBV_ILN_24 GBV_ILN_30 GBV_ILN_31 GBV_ILN_32 GBV_ILN_62 GBV_ILN_65 GBV_ILN_70 GBV_ILN_100 GBV_ILN_2005 GBV_ILN_2006 GBV_ILN_2010 GBV_ILN_2018 GBV_ILN_2020 GBV_ILN_2021 GBV_ILN_4116 GBV_ILN_4126 GBV_ILN_4266 GBV_ILN_4277 GBV_ILN_4305 GBV_ILN_4311 AR 17 2006 3 14 12 419-442 |
allfieldsGer |
10.1007/s00778-006-0015-0 doi (DE-627)OLC2051357439 (DE-He213)s00778-006-0015-0-p DE-627 ger DE-627 rakwb eng 004 VZ Wang, Hai verfasserin aut Histograms based on the minimum description length principle 2006 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag 2006 Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. Query processing Approximate query answering Data summarization Histograms Sevcik, Kenneth C. aut Enthalten in The VLDB journal Springer-Verlag, 1992 17(2006), 3 vom: 14. Dez., Seite 419-442 (DE-627)170933059 (DE-600)1129061-4 (DE-576)032856466 1066-8888 nnns volume:17 year:2006 number:3 day:14 month:12 pages:419-442 https://doi.org/10.1007/s00778-006-0015-0 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_22 GBV_ILN_24 GBV_ILN_30 GBV_ILN_31 GBV_ILN_32 GBV_ILN_62 GBV_ILN_65 GBV_ILN_70 GBV_ILN_100 GBV_ILN_2005 GBV_ILN_2006 GBV_ILN_2010 GBV_ILN_2018 GBV_ILN_2020 GBV_ILN_2021 GBV_ILN_4116 GBV_ILN_4126 GBV_ILN_4266 GBV_ILN_4277 GBV_ILN_4305 GBV_ILN_4311 AR 17 2006 3 14 12 419-442 |
allfieldsSound |
10.1007/s00778-006-0015-0 doi (DE-627)OLC2051357439 (DE-He213)s00778-006-0015-0-p DE-627 ger DE-627 rakwb eng 004 VZ Wang, Hai verfasserin aut Histograms based on the minimum description length principle 2006 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag 2006 Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. Query processing Approximate query answering Data summarization Histograms Sevcik, Kenneth C. aut Enthalten in The VLDB journal Springer-Verlag, 1992 17(2006), 3 vom: 14. Dez., Seite 419-442 (DE-627)170933059 (DE-600)1129061-4 (DE-576)032856466 1066-8888 nnns volume:17 year:2006 number:3 day:14 month:12 pages:419-442 https://doi.org/10.1007/s00778-006-0015-0 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_22 GBV_ILN_24 GBV_ILN_30 GBV_ILN_31 GBV_ILN_32 GBV_ILN_62 GBV_ILN_65 GBV_ILN_70 GBV_ILN_100 GBV_ILN_2005 GBV_ILN_2006 GBV_ILN_2010 GBV_ILN_2018 GBV_ILN_2020 GBV_ILN_2021 GBV_ILN_4116 GBV_ILN_4126 GBV_ILN_4266 GBV_ILN_4277 GBV_ILN_4305 GBV_ILN_4311 AR 17 2006 3 14 12 419-442 |
language |
English |
source |
Enthalten in The VLDB journal 17(2006), 3 vom: 14. Dez., Seite 419-442 volume:17 year:2006 number:3 day:14 month:12 pages:419-442 |
sourceStr |
Enthalten in The VLDB journal 17(2006), 3 vom: 14. Dez., Seite 419-442 volume:17 year:2006 number:3 day:14 month:12 pages:419-442 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Query processing Approximate query answering Data summarization Histograms |
dewey-raw |
004 |
isfreeaccess_bool |
false |
container_title |
The VLDB journal |
authorswithroles_txt_mv |
Wang, Hai @@aut@@ Sevcik, Kenneth C. @@aut@@ |
publishDateDaySort_date |
2006-12-14T00:00:00Z |
hierarchy_top_id |
170933059 |
dewey-sort |
14 |
id |
OLC2051357439 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2051357439</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230502151944.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200819s2006 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00778-006-0015-0</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2051357439</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s00778-006-0015-0-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Wang, Hai</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Histograms based on the minimum description length principle</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2006</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Springer-Verlag 2006</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Query processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Approximate query answering</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data summarization</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Histograms</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sevcik, Kenneth C.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The VLDB journal</subfield><subfield code="d">Springer-Verlag, 1992</subfield><subfield code="g">17(2006), 3 vom: 14. Dez., Seite 419-442</subfield><subfield code="w">(DE-627)170933059</subfield><subfield code="w">(DE-600)1129061-4</subfield><subfield code="w">(DE-576)032856466</subfield><subfield code="x">1066-8888</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:17</subfield><subfield code="g">year:2006</subfield><subfield code="g">number:3</subfield><subfield code="g">day:14</subfield><subfield code="g">month:12</subfield><subfield code="g">pages:419-442</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s00778-006-0015-0</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_22</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_24</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_30</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_31</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_32</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_62</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_65</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_100</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2005</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2010</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2018</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2020</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2021</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4116</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4126</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4266</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4277</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4305</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4311</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">17</subfield><subfield code="j">2006</subfield><subfield code="e">3</subfield><subfield code="b">14</subfield><subfield code="c">12</subfield><subfield code="h">419-442</subfield></datafield></record></collection>
|
author |
Wang, Hai |
spellingShingle |
Wang, Hai ddc 004 misc Query processing misc Approximate query answering misc Data summarization misc Histograms Histograms based on the minimum description length principle |
authorStr |
Wang, Hai |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)170933059 |
format |
Article |
dewey-ones |
004 - Data processing & computer science |
delete_txt_mv |
keep |
author_role |
aut aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
1066-8888 |
topic_title |
004 VZ Histograms based on the minimum description length principle Query processing Approximate query answering Data summarization Histograms |
topic |
ddc 004 misc Query processing misc Approximate query answering misc Data summarization misc Histograms |
topic_unstemmed |
ddc 004 misc Query processing misc Approximate query answering misc Data summarization misc Histograms |
topic_browse |
ddc 004 misc Query processing misc Approximate query answering misc Data summarization misc Histograms |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
hierarchy_parent_title |
The VLDB journal |
hierarchy_parent_id |
170933059 |
dewey-tens |
000 - Computer science, knowledge & systems |
hierarchy_top_title |
The VLDB journal |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)170933059 (DE-600)1129061-4 (DE-576)032856466 |
title |
Histograms based on the minimum description length principle |
ctrlnum |
(DE-627)OLC2051357439 (DE-He213)s00778-006-0015-0-p |
title_full |
Histograms based on the minimum description length principle |
author_sort |
Wang, Hai |
journal |
The VLDB journal |
journalStr |
The VLDB journal |
lang_code |
eng |
isOA_bool |
false |
dewey-hundreds |
000 - Computer science, information & general works |
recordtype |
marc |
publishDateSort |
2006 |
contenttype_str_mv |
txt |
container_start_page |
419 |
author_browse |
Wang, Hai Sevcik, Kenneth C. |
container_volume |
17 |
class |
004 VZ |
format_se |
Aufsätze |
author-letter |
Wang, Hai |
doi_str_mv |
10.1007/s00778-006-0015-0 |
dewey-full |
004 |
title_sort |
histograms based on the minimum description length principle |
title_auth |
Histograms based on the minimum description length principle |
abstract |
Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. © Springer-Verlag 2006 |
abstractGer |
Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. © Springer-Verlag 2006 |
abstract_unstemmed |
Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads. © Springer-Verlag 2006 |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_22 GBV_ILN_24 GBV_ILN_30 GBV_ILN_31 GBV_ILN_32 GBV_ILN_62 GBV_ILN_65 GBV_ILN_70 GBV_ILN_100 GBV_ILN_2005 GBV_ILN_2006 GBV_ILN_2010 GBV_ILN_2018 GBV_ILN_2020 GBV_ILN_2021 GBV_ILN_4116 GBV_ILN_4126 GBV_ILN_4266 GBV_ILN_4277 GBV_ILN_4305 GBV_ILN_4311 |
container_issue |
3 |
title_short |
Histograms based on the minimum description length principle |
url |
https://doi.org/10.1007/s00778-006-0015-0 |
remote_bool |
false |
author2 |
Sevcik, Kenneth C. |
author2Str |
Sevcik, Kenneth C. |
ppnlink |
170933059 |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1007/s00778-006-0015-0 |
up_date |
2024-07-04T04:12:23.550Z |
_version_ |
1803620286263721984 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2051357439</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230502151944.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200819s2006 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00778-006-0015-0</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2051357439</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s00778-006-0015-0-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Wang, Hai</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Histograms based on the minimum description length principle</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2006</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Springer-Verlag 2006</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Histograms have been widely used for selectivity estimation in query optimization, as well as for fast approximate query answering in many OLAP, data mining, and data visualization applications. This paper presents a new family of histograms, the Hierarchical Model Fitting (HMF) histograms, based on the Minimum Description Length principle. Rather than having each bucket of a histogram described by the same type of model, the HMF histograms employ a local optimal model for each bucket. The improved effectiveness of the locally chosen models offsets more than the overhead of keeping track of the representation of each individual bucket. Through a set of experiments, we show that the HMF histograms are capable of providing more accurate approximations than previously proposed techniques for many real and synthetic data sets across a variety of query workloads.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Query processing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Approximate query answering</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data summarization</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Histograms</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sevcik, Kenneth C.</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">The VLDB journal</subfield><subfield code="d">Springer-Verlag, 1992</subfield><subfield code="g">17(2006), 3 vom: 14. Dez., Seite 419-442</subfield><subfield code="w">(DE-627)170933059</subfield><subfield code="w">(DE-600)1129061-4</subfield><subfield code="w">(DE-576)032856466</subfield><subfield code="x">1066-8888</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:17</subfield><subfield code="g">year:2006</subfield><subfield code="g">number:3</subfield><subfield code="g">day:14</subfield><subfield code="g">month:12</subfield><subfield code="g">pages:419-442</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s00778-006-0015-0</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_22</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_24</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_30</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_31</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_32</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_62</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_65</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_100</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2005</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2010</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2018</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2020</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2021</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4116</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4126</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4266</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4277</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4305</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4311</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">17</subfield><subfield code="j">2006</subfield><subfield code="e">3</subfield><subfield code="b">14</subfield><subfield code="c">12</subfield><subfield code="h">419-442</subfield></datafield></record></collection>
|
score |
7.3988037 |