A spatial feature adaptive network for text detection

Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In...
Ausführliche Beschreibung

Gespeichert in:

Autor*in:	Tang, Qingsong [verfasserIn] Feng, Xiaoxu Zhang, Xiangde

Format:	Artikel
Sprache:	Englisch

Erschienen:	2022

Schlagwörter:	Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network

Anmerkung:	© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022

Übergeordnetes Werk:	Enthalten in: Multimedia tools and applications - Springer US, 1995, 81(2022), 11 vom: 28. Feb., Seite 15285-15302
Übergeordnetes Werk:	volume:81 ; year:2022 ; number:11 ; day:28 ; month:02 ; pages:15285-15302

Links:	Volltext

DOI / URN:	10.1007/s11042-022-12619-3

Katalog-ID:	OLC2078570826

Internformat


LEADER	01000caa a22002652 4500
001	OLC2078570826
003	DE-627
005	20230506012652.0
007	tu
008	221220s2022 xx \|\|\|\|\| 00\| \|\|eng c
024	7		\|a 10.1007/s11042-022-12619-3 \|2 doi
035			\|a (DE-627)OLC2078570826
035			\|a (DE-He213)s11042-022-12619-3-p
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
082	0	4	\|a 070 \|a 004 \|q VZ
100	1		\|a Tang, Qingsong \|e verfasserin \|4 aut
245	1	0	\|a A spatial feature adaptive network for text detection
264		1	\|c 2022
336			\|a Text \|b txt \|2 rdacontent
337			\|a ohne Hilfsmittel zu benutzen \|b n \|2 rdamedia
338			\|a Band \|b nc \|2 rdacarrier
500			\|a © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022
520			\|a Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet.
650		4	\|a Text detection
650		4	\|a Desubpixel convolution
650		4	\|a Convolution neural network
650		4	\|a Spatial adaptive convolutional network
700	1		\|a Feng, Xiaoxu \|4 aut
700	1		\|a Zhang, Xiangde \|4 aut
773	0	8	\|i Enthalten in \|t Multimedia tools and applications \|d Springer US, 1995 \|g 81(2022), 11 vom: 28. Feb., Seite 15285-15302 \|w (DE-627)189064145 \|w (DE-600)1287642-2 \|w (DE-576)052842126 \|x 1380-7501 \|7 nnns
773	1	8	\|g volume:81 \|g year:2022 \|g number:11 \|g day:28 \|g month:02 \|g pages:15285-15302
856	4	1	\|u https://doi.org/10.1007/s11042-022-12619-3 \|z lizenzpflichtig \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_OLC
912			\|a SSG-OLC-MAT
912			\|a SSG-OLC-BUB
912			\|a SSG-OLC-MKW
951			\|a AR
952			\|d 81 \|j 2022 \|e 11 \|b 28 \|c 02 \|h 15285-15302

Indexfelder

author_variant	q t qt x f xf x z xz
matchkey_str	article:13807501:2022----::sailetraatvntoko
hierarchy_sort_str	2022
publishDate	2022
allfields	10.1007/s11042-022-12619-3 doi (DE-627)OLC2078570826 (DE-He213)s11042-022-12619-3-p DE-627 ger DE-627 rakwb eng 070 004 VZ Tang, Qingsong verfasserin aut A spatial feature adaptive network for text detection 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network Feng, Xiaoxu aut Zhang, Xiangde aut Enthalten in Multimedia tools and applications Springer US, 1995 81(2022), 11 vom: 28. Feb., Seite 15285-15302 (DE-627)189064145 (DE-600)1287642-2 (DE-576)052842126 1380-7501 nnns volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302 https://doi.org/10.1007/s11042-022-12619-3 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT SSG-OLC-BUB SSG-OLC-MKW AR 81 2022 11 28 02 15285-15302
spelling	10.1007/s11042-022-12619-3 doi (DE-627)OLC2078570826 (DE-He213)s11042-022-12619-3-p DE-627 ger DE-627 rakwb eng 070 004 VZ Tang, Qingsong verfasserin aut A spatial feature adaptive network for text detection 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network Feng, Xiaoxu aut Zhang, Xiangde aut Enthalten in Multimedia tools and applications Springer US, 1995 81(2022), 11 vom: 28. Feb., Seite 15285-15302 (DE-627)189064145 (DE-600)1287642-2 (DE-576)052842126 1380-7501 nnns volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302 https://doi.org/10.1007/s11042-022-12619-3 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT SSG-OLC-BUB SSG-OLC-MKW AR 81 2022 11 28 02 15285-15302
allfields_unstemmed	10.1007/s11042-022-12619-3 doi (DE-627)OLC2078570826 (DE-He213)s11042-022-12619-3-p DE-627 ger DE-627 rakwb eng 070 004 VZ Tang, Qingsong verfasserin aut A spatial feature adaptive network for text detection 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network Feng, Xiaoxu aut Zhang, Xiangde aut Enthalten in Multimedia tools and applications Springer US, 1995 81(2022), 11 vom: 28. Feb., Seite 15285-15302 (DE-627)189064145 (DE-600)1287642-2 (DE-576)052842126 1380-7501 nnns volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302 https://doi.org/10.1007/s11042-022-12619-3 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT SSG-OLC-BUB SSG-OLC-MKW AR 81 2022 11 28 02 15285-15302
allfieldsGer	10.1007/s11042-022-12619-3 doi (DE-627)OLC2078570826 (DE-He213)s11042-022-12619-3-p DE-627 ger DE-627 rakwb eng 070 004 VZ Tang, Qingsong verfasserin aut A spatial feature adaptive network for text detection 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network Feng, Xiaoxu aut Zhang, Xiangde aut Enthalten in Multimedia tools and applications Springer US, 1995 81(2022), 11 vom: 28. Feb., Seite 15285-15302 (DE-627)189064145 (DE-600)1287642-2 (DE-576)052842126 1380-7501 nnns volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302 https://doi.org/10.1007/s11042-022-12619-3 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT SSG-OLC-BUB SSG-OLC-MKW AR 81 2022 11 28 02 15285-15302
allfieldsSound	10.1007/s11042-022-12619-3 doi (DE-627)OLC2078570826 (DE-He213)s11042-022-12619-3-p DE-627 ger DE-627 rakwb eng 070 004 VZ Tang, Qingsong verfasserin aut A spatial feature adaptive network for text detection 2022 Text txt rdacontent ohne Hilfsmittel zu benutzen n rdamedia Band nc rdacarrier © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network Feng, Xiaoxu aut Zhang, Xiangde aut Enthalten in Multimedia tools and applications Springer US, 1995 81(2022), 11 vom: 28. Feb., Seite 15285-15302 (DE-627)189064145 (DE-600)1287642-2 (DE-576)052842126 1380-7501 nnns volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302 https://doi.org/10.1007/s11042-022-12619-3 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT SSG-OLC-BUB SSG-OLC-MKW AR 81 2022 11 28 02 15285-15302
language	English
source	Enthalten in Multimedia tools and applications 81(2022), 11 vom: 28. Feb., Seite 15285-15302 volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302
sourceStr	Enthalten in Multimedia tools and applications 81(2022), 11 vom: 28. Feb., Seite 15285-15302 volume:81 year:2022 number:11 day:28 month:02 pages:15285-15302
format_phy_str_mv	Article
institution	findex.gbv.de
topic_facet	Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network
dewey-raw	070
isfreeaccess_bool	false
container_title	Multimedia tools and applications
authorswithroles_txt_mv	Tang, Qingsong @@aut@@ Feng, Xiaoxu @@aut@@ Zhang, Xiangde @@aut@@
publishDateDaySort_date	2022-02-28T00:00:00Z
hierarchy_top_id	189064145
dewey-sort	270
id	OLC2078570826
language_de	englisch
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2078570826</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230506012652.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">221220s2022 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11042-022-12619-3</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2078570826</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11042-022-12619-3-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">070</subfield><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Tang, Qingsong</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A spatial feature adaptive network for text detection</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2022</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Text detection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Desubpixel convolution</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Convolution neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Spatial adaptive convolutional network</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Feng, Xiaoxu</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhang, Xiangde</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Multimedia tools and applications</subfield><subfield code="d">Springer US, 1995</subfield><subfield code="g">81(2022), 11 vom: 28. Feb., Seite 15285-15302</subfield><subfield code="w">(DE-627)189064145</subfield><subfield code="w">(DE-600)1287642-2</subfield><subfield code="w">(DE-576)052842126</subfield><subfield code="x">1380-7501</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:81</subfield><subfield code="g">year:2022</subfield><subfield code="g">number:11</subfield><subfield code="g">day:28</subfield><subfield code="g">month:02</subfield><subfield code="g">pages:15285-15302</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11042-022-12619-3</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-BUB</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MKW</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">81</subfield><subfield code="j">2022</subfield><subfield code="e">11</subfield><subfield code="b">28</subfield><subfield code="c">02</subfield><subfield code="h">15285-15302</subfield></datafield></record></collection>
author	Tang, Qingsong
spellingShingle	Tang, Qingsong ddc 070 misc Text detection misc Desubpixel convolution misc Convolution neural network misc Spatial adaptive convolutional network A spatial feature adaptive network for text detection
authorStr	Tang, Qingsong
ppnlink_with_tag_str_mv	@@773@@(DE-627)189064145
format	Article
dewey-ones	070 - News media, journalism & publishing 004 - Data processing & computer science
delete_txt_mv	keep
author_role	aut aut aut
collection	OLC
remote_str	false
illustrated	Not Illustrated
issn	1380-7501
topic_title	070 004 VZ A spatial feature adaptive network for text detection Text detection Desubpixel convolution Convolution neural network Spatial adaptive convolutional network
topic	ddc 070 misc Text detection misc Desubpixel convolution misc Convolution neural network misc Spatial adaptive convolutional network
topic_unstemmed	ddc 070 misc Text detection misc Desubpixel convolution misc Convolution neural network misc Spatial adaptive convolutional network
topic_browse	ddc 070 misc Text detection misc Desubpixel convolution misc Convolution neural network misc Spatial adaptive convolutional network
format_facet	Aufsätze Gedruckte Aufsätze
format_main_str_mv	Text Zeitschrift/Artikel
carriertype_str_mv	nc
hierarchy_parent_title	Multimedia tools and applications
hierarchy_parent_id	189064145
dewey-tens	070 - News media, journalism & publishing 000 - Computer science, knowledge & systems
hierarchy_top_title	Multimedia tools and applications
isfreeaccess_txt	false
familylinks_str_mv	(DE-627)189064145 (DE-600)1287642-2 (DE-576)052842126
title	A spatial feature adaptive network for text detection
ctrlnum	(DE-627)OLC2078570826 (DE-He213)s11042-022-12619-3-p
title_full	A spatial feature adaptive network for text detection
author_sort	Tang, Qingsong
journal	Multimedia tools and applications
journalStr	Multimedia tools and applications
lang_code	eng
isOA_bool	false
dewey-hundreds	000 - Computer science, information & general works
recordtype	marc
publishDateSort	2022
contenttype_str_mv	txt
container_start_page	15285
author_browse	Tang, Qingsong Feng, Xiaoxu Zhang, Xiangde
container_volume	81
class	070 004 VZ
format_se	Aufsätze
author-letter	Tang, Qingsong
doi_str_mv	10.1007/s11042-022-12619-3
dewey-full	070 004
title_sort	a spatial feature adaptive network for text detection
title_auth	A spatial feature adaptive network for text detection
abstract	Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022
abstractGer	Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022
abstract_unstemmed	Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022
collection_details	GBV_USEFLAG_A SYSFLAG_A GBV_OLC SSG-OLC-MAT SSG-OLC-BUB SSG-OLC-MKW
container_issue	11
title_short	A spatial feature adaptive network for text detection
url	https://doi.org/10.1007/s11042-022-12619-3
remote_bool	false
author2	Feng, Xiaoxu Zhang, Xiangde
author2Str	Feng, Xiaoxu Zhang, Xiangde
ppnlink	189064145
mediatype_str_mv	n
isOA_txt	false
hochschulschrift_bool	false
doi_str	10.1007/s11042-022-12619-3
up_date	2024-07-03T21:05:33.576Z
_version_	1803593432251236352
fullrecord_marcxml	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">OLC2078570826</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20230506012652.0</controlfield><controlfield tag="007">tu</controlfield><controlfield tag="008">221220s2022 xx \|\|\|\|\| 00\| \|\|eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11042-022-12619-3</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)OLC2078570826</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-He213)s11042-022-12619-3-p</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2="4"><subfield code="a">070</subfield><subfield code="a">004</subfield><subfield code="q">VZ</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Tang, Qingsong</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">A spatial feature adaptive network for text detection</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2022</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">ohne Hilfsmittel zu benutzen</subfield><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Band</subfield><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Due to the capacity of detection arbitrary shapes of text and the robustness in practical applications, scene text detection methods based on segmentation have attached more attention. More accurate segmentation and better feature extraction are the core of segmentation-based detection. In order to refine the result of segmentation, we replace the convolution in the first block of the ResNet50 by desubpixel convolution to enhance the feature extraction capabilities of the network. We also propose a spatial adaptive convolutional network to adjust the features extracted by the backbone so that the network can extract features more suitable for natural scene text detection. We implement the presented network based on PSENet. The results on ICDAR2015 and SCUT-CTW1500 demonstrate that our module can improve the performance of text detection. The precision, recall and F-measure have reached 87.27%, 84.88% and 86.06% on ICDAR2015. And they have reached 81.99%, 82.63% and 82.31% on CTW1500. Our code will be available at https://github.com/fengdashuai/Ada-PSENet.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Text detection</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Desubpixel convolution</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Convolution neural network</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Spatial adaptive convolutional network</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Feng, Xiaoxu</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhang, Xiangde</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Multimedia tools and applications</subfield><subfield code="d">Springer US, 1995</subfield><subfield code="g">81(2022), 11 vom: 28. Feb., Seite 15285-15302</subfield><subfield code="w">(DE-627)189064145</subfield><subfield code="w">(DE-600)1287642-2</subfield><subfield code="w">(DE-576)052842126</subfield><subfield code="x">1380-7501</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:81</subfield><subfield code="g">year:2022</subfield><subfield code="g">number:11</subfield><subfield code="g">day:28</subfield><subfield code="g">month:02</subfield><subfield code="g">pages:15285-15302</subfield></datafield><datafield tag="856" ind1="4" ind2="1"><subfield code="u">https://doi.org/10.1007/s11042-022-12619-3</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_OLC</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MAT</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-BUB</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SSG-OLC-MKW</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">81</subfield><subfield code="j">2022</subfield><subfield code="e">11</subfield><subfield code="b">28</subfield><subfield code="c">02</subfield><subfield code="h">15285-15302</subfield></datafield></record></collection>
score	7.3986187

Nicht das Richtige dabei?

Schreiben Sie uns!

A spatial feature adaptive network for text detection

Nicht das Richtige dabei?

Zugang & Verfügbarkeit

Vorhandene Bände

Nicht das Richtige dabei?