Scene text recognition using residual convolutional recurrent neural network

Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural ne...
Ausführliche Beschreibung

Gespeichert in:

Autor*in:	Lei, Zhengchao [verfasserIn] Zhao, Sanyuan Song, Hongmei Shen, Jianbing

Format:	Artikel
Sprache:	Englisch

Erschienen:	2018

Schlagwörter:	Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network

Anmerkung:	© Springer-Verlag GmbH Germany, part of Springer Nature 2018

Übergeordnetes Werk:	Enthalten in: Machine vision and applications - Springer Berlin Heidelberg, 1988, 29(2018), 5 vom: 16. Juni, Seite 861-871
Übergeordnetes Werk:	volume:29 ; year:2018 ; number:5 ; day:16 ; month:06 ; pages:861-871

Links:	Volltext

DOI / URN:	10.1007/s00138-018-0942-y

Katalog-ID:	OLC2074632053

Internformat


LEADER	01000caa a22002652 4500
001	OLC2074632053
003	DE-627
005	20230401063324.0
007	tu
008	200820s2018 xx \|\|\|\|\| 00\| \|\|eng c
024	7		\|a 10.1007/s00138-018-0942-y \|2 doi
035			\|a (DE-627)OLC2074632053
035			\|a (DE-He213)s00138-018-0942-y-p
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
082	0	4	\|a 004 \|q VZ
084			\|a 11 \|2 ssgn
100	1		\|a Lei, Zhengchao \|e verfasserin \|4 aut
245	1	0	\|a Scene text recognition using residual convolutional recurrent neural network
264		1	\|c 2018
336			\|a Text \|b txt \|2 rdacontent
337			\|a ohne Hilfsmittel zu benutzen \|b n \|2 rdamedia
338			\|a Band \|b nc \|2 rdacarrier
500			\|a © Springer-Verlag GmbH Germany, part of Springer Nature 2018
520			\|a Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.
650		4	\|a Residual convolutional recurrent neural network
650		4	\|a Scene text recognition
650		4	\|a Convolutional neural network
650		4	\|a Recurrent neural network
650		4	\|a Residual network
700	1		\|a Zhao, Sanyuan \|4 aut
700	1		\|a Song, Hongmei \|4 aut
700	1		\|a Shen, Jianbing \|4 aut
773	0	8	\|i Enthalten in \|t Machine vision and applications \|d Springer Berlin Heidelberg, 1988 \|g 29(2018), 5 vom: 16. Juni, Seite 861-871 \|w (DE-627)129248843 \|w (DE-600)59385-0 \|w (DE-576)017944139 \|x 0932-8092 \|7 nnns
773	1	8	\|g volume:29 \|g year:2018 \|g number:5 \|g day:16 \|g month:06 \|g pages:861-871
856	4	1	\|u https://doi.org/10.1007/s00138-018-0942-y \|z lizenzpflichtig \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_OLC
912			\|a SSG-OLC-MAT
912			\|a GBV_ILN_70
912			\|a GBV_ILN_2018
912			\|a GBV_ILN_4277
951			\|a AR
952			\|d 29 \|j 2018 \|e 5 \|b 16 \|c 06 \|h 861-871

Indexfelder

author_variant	z l zl s z sz h s hs j s js
matchkey_str	article:09328092:2018----::cntxrcgiinsnrsdacnouinleu
hierarchy_sort_str	2018
publishDate	2018
allfields	10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871
spelling	10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871
allfields_unstemmed	10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871
allfieldsGer	10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871
allfieldsSound	10.1007/s00138-018-0942-y doi (DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p DE-627 ger DE-627 rakwb eng 004 VZ 11 ssgn Lei, Zhengchao verfasserin aut Scene text recognition using residual convolutional recurrent neural network 2018 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © Springer-Verlag GmbH Germany, part of Springer Nature 2018 Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network Zhao, Sanyuan aut Song, Hongmei aut Shen, Jianbing aut Enthalten in Machine vision and applications Springer Berlin Heidelberg, 1988 29(2018), 5 vom: 16. Juni, Seite 861-871 (DE-627)129248843 (DE-600)59385-0 (DE-576)017944139 0932-8092 nnns volume:29 year:2018 number:5 day:16 month:06 pages:861-871 https://doi.org/10.1007/s00138-018-0942-y lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277 AR 29 2018 5 16 06 861-871
language	English
source	Enthalten in Machine vision and applications 29(2018), 5 vom: 16. Juni, Seite 861-871 volume:29 year:2018 number:5 day:16 month:06 pages:861-871
sourceStr	Enthalten in Machine vision and applications 29(2018), 5 vom: 16. Juni, Seite 861-871 volume:29 year:2018 number:5 day:16 month:06 pages:861-871
format_phy_str_mv	Article
institution	findex.gbv.de
topic_facet	Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network
dewey-raw	004
isfreeaccess_bool	false
container_title	Machine vision and applications
authorswithroles_txt_mv	Lei, Zhengchao @@aut@@ Zhao, Sanyuan @@aut@@ Song, Hongmei @@aut@@ Shen, Jianbing @@aut@@
publishDateDaySort_date	2018-06-16T00:00:00Z
hierarchy_top_id	129248843
dewey-sort	14
id	OLC2074632053
language_de	englisch
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2074632053</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230401063324.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200820s2018 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00138-018-0942-y</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2074632053</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s00138-018-0942-y-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">11</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Lei, Zhengchao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene text recognition using residual convolutional recurrent neural network</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2018</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Springer-Verlag GmbH Germany, part of Springer Nature 2018</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual convolutional recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Scene text recognition</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Convolutional neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual network</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhao, Sanyuan</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Song, Hongmei</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Shen, Jianbing</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Machine vision and applications</subfield><subfield code="d">Springer Berlin Heidelberg, 1988</subfield><subfield code="g">29(2018), 5 vom: 16. Juni, Seite 861-871</subfield><subfield code="w">(DE-627)129248843</subfield><subfield code="w">(DE-600)59385-0</subfield><subfield code="w">(DE-576)017944139</subfield><subfield code="x">0932-8092</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:29</subfield><subfield code="g">year:2018</subfield><subfield code="g">number:5</subfield><subfield code="g">day:16</subfield><subfield code="g">month:06</subfield><subfield code="g">pages:861-871</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s00138-018-0942-y</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2018</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4277</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">29</subfield><subfield code="j">2018</subfield><subfield code="e">5</subfield><subfield code="b">16</subfield><subfield code="c">06</subfield><subfield code="h">861-871</subfield></datafield></record></collection>
author	Lei, Zhengchao
spellingShingle	Lei, Zhengchao ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network Scene text recognition using residual convolutional recurrent neural network
authorStr	Lei, Zhengchao
ppnlink_with_tag_str_mv	@@773@@(DE-627)129248843
format	Article
dewey-ones	004 - Data processing & computer science
delete_txt_mv	keep
author_role	aut aut aut aut
collection	OLC
remote_str	false
illustrated	Not Illustrated
issn	0932-8092
topic_title	004 VZ 11 ssgn Scene text recognition using residual convolutional recurrent neural network Residual convolutional recurrent neural network Scene text recognition Convolutional neural network Recurrent neural network Residual network
topic	ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network
topic_unstemmed	ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network
topic_browse	ddc 004 ssgn 11 misc Residual convolutional recurrent neural network misc Scene text recognition misc Convolutional neural network misc Recurrent neural network misc Residual network
format_facet	Aufsätze Gedruckte Aufsätze
format_main_str_mv	Text Zeitschrift/Artikel
carriertype_str_mv	nc
hierarchy_parent_title	Machine vision and applications
hierarchy_parent_id	129248843
dewey-tens	000 - Computer science, knowledge & systems
hierarchy_top_title	Machine vision and applications
isfreeaccess_txt	false
familylinks_str_mv	(DE-627)129248843 (DE-600)59385-0 (DE-576)017944139
title	Scene text recognition using residual convolutional recurrent neural network
ctrlnum	(DE-627)OLC2074632053 (DE-He213)s00138-018-0942-y-p
title_full	Scene text recognition using residual convolutional recurrent neural network
author_sort	Lei, Zhengchao
journal	Machine vision and applications
journalStr	Machine vision and applications
lang_code	eng
isOA_bool	false
dewey-hundreds	000 - Computer science, information & general works
recordtype	marc
publishDateSort	2018
contenttype_str_mv	txt
container_start_page	861
author_browse	Lei, Zhengchao Zhao, Sanyuan Song, Hongmei Shen, Jianbing
container_volume	29
class	004 VZ 11 ssgn
format_se	Aufsätze
author-letter	Lei, Zhengchao
doi_str_mv	10.1007/s00138-018-0942-y
dewey-full	004
title_sort	scene text recognition using residual convolutional recurrent neural network
title_auth	Scene text recognition using residual convolutional recurrent neural network
abstract	Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. © Springer-Verlag GmbH Germany, part of Springer Nature 2018
abstractGer	Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. © Springer-Verlag GmbH Germany, part of Springer Nature 2018
abstract_unstemmed	Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method. © Springer-Verlag GmbH Germany, part of Springer Nature 2018
collection_details	GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT GBV_ILN_70 GBV_ILN_2018 GBV_ILN_4277
container_issue	5
title_short	Scene text recognition using residual convolutional recurrent neural network
url	https://doi.org/10.1007/s00138-018-0942-y
remote_bool	false
author2	Zhao, Sanyuan Song, Hongmei Shen, Jianbing
author2Str	Zhao, Sanyuan Song, Hongmei Shen, Jianbing
ppnlink	129248843
mediatype_str_mv	n
isOA_txt	false
hochschulschrift_bool	false
doi_str	10.1007/s00138-018-0942-y
up_date	2024-07-03T22:54:42.375Z
_version_	1803600299167842304
fullrecord_marcxml	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2074632053</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230401063324.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">200820s2018 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00138-018-0942-y</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2074632053</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s00138-018-0942-y-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">11</subfield><subfield code="2">ssgn</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Lei, Zhengchao</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Scene text recognition using residual convolutional recurrent neural network</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2018</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© Springer-Verlag GmbH Germany, part of Springer Nature 2018</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual convolutional recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Scene text recognition</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Convolutional neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Recurrent neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Residual network</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhao, Sanyuan</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Song, Hongmei</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Shen, Jianbing</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Machine vision and applications</subfield><subfield code="d">Springer Berlin Heidelberg, 1988</subfield><subfield code="g">29(2018), 5 vom: 16. Juni, Seite 861-871</subfield><subfield code="w">(DE-627)129248843</subfield><subfield code="w">(DE-600)59385-0</subfield><subfield code="w">(DE-576)017944139</subfield><subfield code="x">0932-8092</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:29</subfield><subfield code="g">year:2018</subfield><subfield code="g">number:5</subfield><subfield code="g">day:16</subfield><subfield code="g">month:06</subfield><subfield code="g">pages:861-871</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s00138-018-0942-y</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_70</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2018</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_4277</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">29</subfield><subfield code="j">2018</subfield><subfield code="e">5</subfield><subfield code="b">16</subfield><subfield code="c">06</subfield><subfield code="h">861-871</subfield></datafield></record></collection>
score	7.3983088

Nicht das Richtige dabei?

Schreiben Sie uns!

Scene text recognition using residual convolutional recurrent neural network

Nicht das Richtige dabei?

Zugang & Verfügbarkeit

Vorhandene Bände

Nicht das Richtige dabei?