Scene text recognition using residual convolutional recurrent neural network
Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural ne...
Ausführliche Beschreibung
Autor*in: |
Lei, Zhengchao [verfasserIn] |
---|
Format: |
Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2018 |
---|
Schlagwörter: |
---|
Anmerkung: |
© Springer-Verlag GmbH Germany, part of Springer Nature 2018 |
---|
Übergeordnetes Werk: |
Enthalten in: Machine vision and applications - Springer Berlin Heidelberg, 1988, 29(2018), 5 vom: 16. Juni, Seite 861-871 |
---|---|
Übergeordnetes Werk: |
volume:29 ; year:2018 ; number:5 ; day:16 ; month:06 ; pages:861-871 |
Links: |
---|
DOI / URN: |
10.1007/s00138-018-0942-y |
---|
Katalog-ID: |
OLC2074632053 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | OLC2074632053 | ||
003 | DE-627 | ||
005 | 20230401063324.0 | ||
007 | tu | ||
008 | 200820s2018 xx ||||| 00| ||eng c | ||
024 | 7 | |a 10.1007/s00138-018-0942-y |2 doi | |
035 | |a (DE-627)OLC2074632053 | ||
035 | |a (DE-He213)s00138-018-0942-y-p | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | 4 | |a 004 |q VZ |
084 | |a 11 |2 ssgn | ||
100 | 1 | |a Lei, Zhengchao |e verfasserin |4 aut | |
245 | 1 | 0 | |a Scene text recognition using residual convolutional recurrent neural network |
264 | 1 | |c 2018 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ohne Hilfsmittel zu benutzen |b n |2 rdamedia | ||
338 | |a Band |b nc |2 rdacarrier | ||
500 | |a © Springer-Verlag GmbH Germany, part of Springer Nature 2018 | ||
520 | |a Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. | ||
650 | 4 | |a Residual convolutional recurrent neural network | |
650 | 4 | |a Scene text recognition | |
650 | 4 | |a Convolutional neural network | |
650 | 4 | |a Recurrent neural network | |
650 | 4 | |a Residual network | |
700 | 1 | |a Zhao, Sanyuan |4 aut | |
700 | 1 | |a Song, Hongmei |4 aut | |
700 | 1 | |a Shen, Jianbing |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Machine vision and applications |d Springer Berlin Heidelberg, 1988 |g 29(2018), 5 vom: 16. Juni, Seite 861-871 |w (DE-627)129248843 |w (DE-600)59385-0 |w (DE-576)017944139 |x 0932-8092 |7 nnns |
773 | 1 | 8 | |g volume:29 |g year:2018 |g number:5 |g day:16 |g month:06 |g pages:861-871 |
856 | 4 | 1 | |u https://doi.org/10.1007/s00138-018-0942-y |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_OLC | ||
912 | |a SSG-OLC-MAT | ||
912 | |a GBV_ILN_70 | ||
912 | |a GBV_ILN_2018 | ||
912 | |a GBV_ILN_4277 | ||
951 | |a AR | ||
952 | |d 29 |j 2018 |e 5 |b 16 |c 06 |h 861-871 |
author_variant |
z l zl s z sz h s hs j s js |
---|---|
matchkey_str |
article:09328092:2018----::cntxrcgiinsnrsdacnouinleu |
hierarchy_sort_str |
2018 |
publishDate |
2018 |
allfields |
10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871 |
spelling |
10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871 |
allfields_unstemmed |
10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871 |
allfieldsGer |
10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871 |
allfieldsSound |
10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871 |
language |
English |
source |
Enthalten in Machine vision and applications 29(2018), 5 vom: 16. Juni, Seite 861-871 volume:29 year:2018 number:5 day:16 month:06 pages:861-871 |
sourceStr |
Enthalten in Machine vision and applications 29(2018), 5 vom: 16. Juni, Seite 861-871 volume:29 year:2018 number:5 day:16 month:06 pages:861-871 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network |
dewey-raw |
004 |
isfreeaccess_bool |
false |
container_title |
Machine vision and applications |
authorswithroles_txt_mv |
Lei, Zhengchao @@aut@@ Zhao, Sanyuan @@aut@@ Song, Hongmei @@aut@@ Shen, Jianbing @@aut@@ |
publishDateDaySort_date |
2018-06-16T00:00:00Z |
hierarchy_top_id |
129248843 |
dewey-sort |
14 |
id |
OLC2074632053 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2074632053</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230401063324.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200820s2018 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00138-018-0942-y</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2074632053</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s00138-018-0942-y-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">11</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Lei, Zhengchao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene text recognition using residual convolutional recurrent neural network</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2018</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Springer-Verlag GmbH Germany, part of Springer Nature 2018</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual convolutional recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Scene text recognition</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Convolutional neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual network</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhao, Sanyuan</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Song, Hongmei</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Shen, Jianbing</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Machine vision and applications</subfield><subfield code="d">Springer Berlin Heidelberg, 1988</subfield><subfield code="g">29(2018), 5 vom: 16. Juni, Seite 861-871</subfield><subfield code="w">(DE-627)129248843</subfield><subfield code="w">(DE-600)59385-0</subfield><subfield code="w">(DE-576)017944139</subfield><subfield code="x">0932-8092</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:29</subfield><subfield code="g">year:2018</subfield><subfield code="g">number:5</subfield><subfield code="g">day:16</subfield><subfield code="g">month:06</subfield><subfield code="g">pages:861-871</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s00138-018-0942-y</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2018</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4277</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">29</subfield><subfield code="j">2018</subfield><subfield code="e">5</subfield><subfield code="b">16</subfield><subfield code="c">06</subfield><subfield code="h">861-871</subfield></datafield></record></collection>
|
author |
Lei, Zhengchao |
spellingShingle |
Lei, Zhengchao ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network Scene text recognition using residual convolutional recurrent neural network |
authorStr |
Lei, Zhengchao |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)129248843 |
format |
Article |
dewey-ones |
004 - Data processing & computer science |
delete_txt_mv |
keep |
author_role |
aut aut aut aut |
collection |
OLC |
remote_str |
false |
illustrated |
Not Illustrated |
issn |
0932-8092 |
topic_title |
004 VZ 11 ssgn Scene text recognition using residual convolutional recurrent neural network Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network |
topic |
ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network |
topic_unstemmed |
ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network |
topic_browse |
ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network |
format_facet |
Aufsätze Gedruckte Aufsätze |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
nc |
hierarchy_parent_title |
Machine vision and applications |
hierarchy_parent_id |
129248843 |
dewey-tens |
000 - Computer science, knowledge & systems |
hierarchy_top_title |
Machine vision and applications |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 |
title |
Scene text recognition using residual convolutional recurrent neural network |
ctrlnum |
(DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p |
title_full |
Scene text recognition using residual convolutional recurrent neural network |
author_sort |
Lei, Zhengchao |
journal |
Machine vision and applications |
journalStr |
Machine vision and applications |
lang_code |
eng |
isOA_bool |
false |
dewey-hundreds |
000 - Computer science, information & general works |
recordtype |
marc |
publishDateSort |
2018 |
contenttype_str_mv |
txt |
container_start_page |
861 |
author_browse |
Lei, Zhengchao Zhao, Sanyuan Song, Hongmei Shen, Jianbing |
container_volume |
29 |
class |
004 VZ 11 ssgn |
format_se |
Aufsätze |
author-letter |
Lei, Zhengchao |
doi_str_mv |
10.1007/s00138-018-0942-y |
dewey-full |
004 |
title_sort |
scene text recognition using residual convolutional recurrent neural network |
title_auth |
Scene text recognition using residual convolutional recurrent neural network |
abstract |
Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. © Springer-Verlag GmbH Germany, part of Springer Nature 2018 |
abstractGer |
Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. © Springer-Verlag GmbH Germany, part of Springer Nature 2018 |
abstract_unstemmed |
Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. © Springer-Verlag GmbH Germany, part of Springer Nature 2018 |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 |
container_issue |
5 |
title_short |
Scene text recognition using residual convolutional recurrent neural network |
url |
https://doi.org/10.1007/s00138-018-0942-y |
remote_bool |
false |
author2 |
Zhao, Sanyuan Song, Hongmei Shen, Jianbing |
author2Str |
Zhao, Sanyuan Song, Hongmei Shen, Jianbing |
ppnlink |
129248843 |
mediatype_str_mv |
n |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1007/s00138-018-0942-y |
up_date |
2024-07-03T22:54:42.375Z |
_version_ |
1803600299167842304 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2074632053</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230401063324.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200820s2018 xx ||||| 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00138-018-0942-y</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2074632053</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s00138-018-0942-y-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">11</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Lei, Zhengchao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene text recognition using residual convolutional recurrent neural network</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2018</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Springer-Verlag GmbH Germany, part of Springer Nature 2018</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual convolutional recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Scene text recognition</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Convolutional neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual network</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhao, Sanyuan</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Song, Hongmei</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Shen, Jianbing</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Machine vision and applications</subfield><subfield code="d">Springer Berlin Heidelberg, 1988</subfield><subfield code="g">29(2018), 5 vom: 16. Juni, Seite 861-871</subfield><subfield code="w">(DE-627)129248843</subfield><subfield code="w">(DE-600)59385-0</subfield><subfield code="w">(DE-576)017944139</subfield><subfield code="x">0932-8092</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:29</subfield><subfield code="g">year:2018</subfield><subfield code="g">number:5</subfield><subfield code="g">day:16</subfield><subfield code="g">month:06</subfield><subfield code="g">pages:861-871</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s00138-018-0942-y</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2018</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4277</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">29</subfield><subfield code="j">2018</subfield><subfield code="e">5</subfield><subfield code="b">16</subfield><subfield code="c">06</subfield><subfield code="h">861-871</subfield></datafield></record></collection>
|
score |
7.3983088 |