Adaptive aggregation for reinforcement learning in average reward Markov decision processes
Autor*in: |
Ortner, Ronald [verfasserIn] |
---|
Format: |
Artikel |
---|
Erschienen: |
2013 |
---|
Umfang: |
16 |
---|
Übergeordnetes Werk: |
Enthalten in: Annals of operations research - Dordrecht, The Netherlands : Springer Nature B.V., 1984, 208(2013), 1 vom: Aug., Seite 321-336 |
---|---|
Übergeordnetes Werk: |
volume:208 ; year:2013 ; number:1 ; month:08 ; pages:321-336 ; extent:16 |
Katalog-ID: |
OLC1926438787 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | OLC1926438787 | ||
003 | DE-627 | ||
005 | 20230713205654.0 | ||
007 | tu | ||
008 | 130827s2013 xx ||||| 00| ||und c | ||
028 | 5 | 2 | |a sw130826_1 |
035 | |a (DE-627)OLC1926438787 | ||
035 | |a (DE-599)GBVOLC1926438787 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
082 | 0 | 4 | |a 004 |
100 | 1 | |a Ortner, Ronald |e verfasserin |4 aut | |
245 | 1 | 0 | |a Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
264 | 1 | |c 2013 | |
300 | |a 16 | ||
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
773 | 0 | 8 | |i Enthalten in |t Annals of operations research |d Dordrecht, The Netherlands : Springer Nature B.V., 1984 |g 208(2013), 1 vom: Aug., Seite 321-336 |w (DE-627)12964370X |w (DE-600)252629-3 |w (DE-576)018141862 |x 0254-5330 |
773 | 1 | 8 | |g volume:208 |g year:2013 |g number:1 |g month:08 |g pages:321-336 |g extent:16 |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-WIW | ||
912 | |a SSG-OLC-MAT | ||
912 | |a GBV_ILN_4029 | ||
951 | |a AR | ||
952 | |d 208 |j 2013 |e 1 |c 8 |h 321-336 |g 16 |
author_variant |
r o ro |
---|---|
matchkey_str |
article:02545330:2013----::dpiegrgtofrenocmnlanniaeaeeada |
hierarchy_sort_str |
2013 |
publishDate |
2013 |
allfields |
sw130826_1 (DE-627)OLC1926438787 (DE-599)GBVOLC1926438787 DE-627 ger DE-627 rakwb 004 Ortner, Ronald verfasserin aut Adaptive aggregation for reinforcement learning in average reward Markov decision processes 2013 16 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Enthalten in Annals of operations research Dordrecht, The Netherlands : Springer Nature B.V., 1984 208(2013), 1 vom: Aug., Seite 321-336 (DE-627)12964370X (DE-600)252629-3 (DE-576)018141862 0254-5330 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-WIW SSG-OLC-MAT GBV_ILN_4029 AR 208 2013 1 8 321-336 16 |
spelling |
sw130826_1 (DE-627)OLC1926438787 (DE-599)GBVOLC1926438787 DE-627 ger DE-627 rakwb 004 Ortner, Ronald verfasserin aut Adaptive aggregation for reinforcement learning in average reward Markov decision processes 2013 16 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Enthalten in Annals of operations research Dordrecht, The Netherlands : Springer Nature B.V., 1984 208(2013), 1 vom: Aug., Seite 321-336 (DE-627)12964370X (DE-600)252629-3 (DE-576)018141862 0254-5330 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-WIW SSG-OLC-MAT GBV_ILN_4029 AR 208 2013 1 8 321-336 16 |
allfields_unstemmed |
sw130826_1 (DE-627)OLC1926438787 (DE-599)GBVOLC1926438787 DE-627 ger DE-627 rakwb 004 Ortner, Ronald verfasserin aut Adaptive aggregation for reinforcement learning in average reward Markov decision processes 2013 16 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Enthalten in Annals of operations research Dordrecht, The Netherlands : Springer Nature B.V., 1984 208(2013), 1 vom: Aug., Seite 321-336 (DE-627)12964370X (DE-600)252629-3 (DE-576)018141862 0254-5330 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-WIW SSG-OLC-MAT GBV_ILN_4029 AR 208 2013 1 8 321-336 16 |
allfieldsGer |
sw130826_1 (DE-627)OLC1926438787 (DE-599)GBVOLC1926438787 DE-627 ger DE-627 rakwb 004 Ortner, Ronald verfasserin aut Adaptive aggregation for reinforcement learning in average reward Markov decision processes 2013 16 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Enthalten in Annals of operations research Dordrecht, The Netherlands : Springer Nature B.V., 1984 208(2013), 1 vom: Aug., Seite 321-336 (DE-627)12964370X (DE-600)252629-3 (DE-576)018141862 0254-5330 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-WIW SSG-OLC-MAT GBV_ILN_4029 AR 208 2013 1 8 321-336 16 |
allfieldsSound |
sw130826_1 (DE-627)OLC1926438787 (DE-599)GBVOLC1926438787 DE-627 ger DE-627 rakwb 004 Ortner, Ronald verfasserin aut Adaptive aggregation for reinforcement learning in average reward Markov decision processes 2013 16 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier Enthalten in Annals of operations research Dordrecht, The Netherlands : Springer Nature B.V., 1984 208(2013), 1 vom: Aug., Seite 321-336 (DE-627)12964370X (DE-600)252629-3 (DE-576)018141862 0254-5330 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-WIW SSG-OLC-MAT GBV_ILN_4029 AR 208 2013 1 8 321-336 16 |
source |
Enthalten in Annals of operations research 208(2013), 1 vom: Aug., Seite 321-336 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 |
sourceStr |
Enthalten in Annals of operations research 208(2013), 1 vom: Aug., Seite 321-336 volume:208 year:2013 number:1 month:08 pages:321-336 extent:16 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
dewey-raw |
004 |
isfreeaccess_bool |
false |
container_title |
Annals of operations research |
authorswithroles_txt_mv |
Ortner, Ronald @@aut@@ |
publishDateDaySort_date |
2013-08-01T00:00:00Z |
hierarchy_top_id |
12964370X |
dewey-sort |
14 |
id |
OLC1926438787 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC1926438787</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230713205654.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">130827s2013 xx ||||| 00| ||und c</controlfield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">sw130826_1</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC1926438787</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)GBVOLC1926438787</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Ortner, Ronald</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Adaptive aggregation for reinforcement learning in average reward Markov decision processes</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2013</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">16</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Annals of operations research</subfield><subfield code="d">Dordrecht, The Netherlands : Springer Nature B.V., 1984</subfield><subfield code="g">208(2013), 1 vom: Aug., Seite 321-336</subfield><subfield code="w">(DE-627)12964370X</subfield><subfield code="w">(DE-600)252629-3</subfield><subfield code="w">(DE-576)018141862</subfield><subfield code="x">0254-5330</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:208</subfield><subfield code="g">year:2013</subfield><subfield code="g">number:1</subfield><subfield code="g">month:08</subfield><subfield code="g">pages:321-336</subfield><subfield code="g">extent:16</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-WIW</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4029</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">208</subfield><subfield code="j">2013</subfield><subfield code="e">1</subfield><subfield code="c">8</subfield><subfield code="h">321-336</subfield><subfield code="g">16</subfield></datafield></record></collection>
|
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC1926438787</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230713205654.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">130827s2013 xx ||||| 00| ||und c</controlfield><datafield tag="028" ind1="5" ind2="2"><subfield code="a">sw130826_1</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC1926438787</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)GBVOLC1926438787</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Ortner, Ronald</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Adaptive aggregation for reinforcement learning in average reward Markov decision processes</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2013</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">16</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Annals of operations research</subfield><subfield code="d">Dordrecht, The Netherlands : Springer Nature B.V., 1984</subfield><subfield code="g">208(2013), 1 vom: Aug., Seite 321-336</subfield><subfield code="w">(DE-627)12964370X</subfield><subfield code="w">(DE-600)252629-3</subfield><subfield code="w">(DE-576)018141862</subfield><subfield code="x">0254-5330</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:208</subfield><subfield code="g">year:2013</subfield><subfield code="g">number:1</subfield><subfield code="g">month:08</subfield><subfield code="g">pages:321-336</subfield><subfield code="g">extent:16</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-WIW</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4029</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">208</subfield><subfield code="j">2013</subfield><subfield code="e">1</subfield><subfield code="c">8</subfield><subfield code="h">321-336</subfield><subfield code="g">16</subfield></datafield></record></collection>
|
author |
Ortner, Ronald |
spellingShingle |
Ortner, Ronald ddc 004 Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
authorStr |
Ortner, Ronald |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)12964370X |
format |
Article |
dewey-ones |
004 - Data processing & computer science |
delete_txt_mv |
keep |
author_role |
aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
0254-5330 |
topic_title |
004 Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
topic |
ddc 004 |
topic_unstemmed |
ddc 004 |
topic_browse |
ddc 004 |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
hierarchy_parent_title |
Annals of operations research |
hierarchy_parent_id |
12964370X |
dewey-tens |
000 - Computer science, knowledge & systems |
hierarchy_top_title |
Annals of operations research |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)12964370X (DE-600)252629-3 (DE-576)018141862 |
title |
Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
ctrlnum |
(DE-627)OLC1926438787 (DE-599)GBVOLC1926438787 |
title_full |
Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
author_sort |
Ortner, Ronald |
journal |
Annals of operations research |
journalStr |
Annals of operations research |
isOA_bool |
false |
dewey-hundreds |
000 - Computer science, information & general works |
recordtype |
marc |
publishDateSort |
2013 |
contenttype_str_mv |
txt |
container_start_page |
321 |
author_browse |
Ortner, Ronald |
container_volume |
208 |
physical |
16 |
class |
004 |
format_se |
Aufsätze |
author-letter |
Ortner, Ronald |
dewey-full |
004 |
title_sort |
adaptive aggregation for reinforcement learning in average reward markov decision processes |
title_auth |
Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-WIW SSG-OLC-MAT GBV_ILN_4029 |
container_issue |
1 |
title_short |
Adaptive aggregation for reinforcement learning in average reward Markov decision processes |
remote_bool |
false |
ppnlink |
12964370X |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
up_date |
2024-07-03T16:12:24.989Z |
_version_ |
1803574989279985664 |
score |
7.400876 |