Investigation of Acoustic and Visual Features for Frog Call Classification
Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifi...
Ausführliche Beschreibung
Autor*in: |
Xie, Jie [verfasserIn] Towsey, Michael [verfasserIn] Zhang, Jinglan [verfasserIn] Roe, Paul [verfasserIn] |
---|
Format: |
E-Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2019 |
---|
Schlagwörter: |
---|
Übergeordnetes Werk: |
Enthalten in: Journal of VLSI signal processing systems for signal, image and video technology - Springer Netherlands, 1989, 92(2019), 1 vom: 26. Feb., Seite 23-36 |
---|---|
Übergeordnetes Werk: |
volume:92 ; year:2019 ; number:1 ; day:26 ; month:02 ; pages:23-36 |
Links: |
---|
DOI / URN: |
10.1007/s11265-019-1445-4 |
---|
Katalog-ID: |
SPR018334326 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | SPR018334326 | ||
003 | DE-627 | ||
005 | 20201124222422.0 | ||
007 | cr uuu---uuuuu | ||
008 | 201006s2019 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1007/s11265-019-1445-4 |2 doi | |
035 | |a (DE-627)SPR018334326 | ||
035 | |a (SPR)s11265-019-1445-4-e | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Xie, Jie |e verfasserin |4 aut | |
245 | 1 | 0 | |a Investigation of Acoustic and Visual Features for Frog Call Classification |
264 | 1 | |c 2019 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. | ||
650 | 4 | |a Bioacoustics |7 (dpeaa)DE-He213 | |
650 | 4 | |a Ecoacoustics |7 (dpeaa)DE-He213 | |
650 | 4 | |a Acoustic feature extraction |7 (dpeaa)DE-He213 | |
650 | 4 | |a Visual feature extraction |7 (dpeaa)DE-He213 | |
650 | 4 | |a Frog call classification |7 (dpeaa)DE-He213 | |
700 | 1 | |a Towsey, Michael |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Jinglan |e verfasserin |4 aut | |
700 | 1 | |a Roe, Paul |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of VLSI signal processing systems for signal, image and video technology |d Springer Netherlands, 1989 |g 92(2019), 1 vom: 26. Feb., Seite 23-36 |w (DE-627)SPR018308090 |7 nnns |
773 | 1 | 8 | |g volume:92 |g year:2019 |g number:1 |g day:26 |g month:02 |g pages:23-36 |
856 | 4 | 0 | |u https://dx.doi.org/10.1007/s11265-019-1445-4 |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_SPRINGER | ||
912 | |a GBV_ILN_40 | ||
912 | |a GBV_ILN_2006 | ||
912 | |a GBV_ILN_2027 | ||
951 | |a AR | ||
952 | |d 92 |j 2019 |e 1 |b 26 |c 02 |h 23-36 |
author_variant |
j x jx m t mt j z jz p r pr |
---|---|
matchkey_str |
xiejietowseymichaelzhangjinglanroepaul:2019----:netgtooaosiadiuletrsofoc |
hierarchy_sort_str |
2019 |
publishDate |
2019 |
allfields |
10.1007/s11265-019-1445-4 doi (DE-627)SPR018334326 (SPR)s11265-019-1445-4-e DE-627 ger DE-627 rakwb eng Xie, Jie verfasserin aut Investigation of Acoustic and Visual Features for Frog Call Classification 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. Bioacoustics (dpeaa)DE-He213 Ecoacoustics (dpeaa)DE-He213 Acoustic feature extraction (dpeaa)DE-He213 Visual feature extraction (dpeaa)DE-He213 Frog call classification (dpeaa)DE-He213 Towsey, Michael verfasserin aut Zhang, Jinglan verfasserin aut Roe, Paul verfasserin aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Springer Netherlands, 1989 92(2019), 1 vom: 26. Feb., Seite 23-36 (DE-627)SPR018308090 nnns volume:92 year:2019 number:1 day:26 month:02 pages:23-36 https://dx.doi.org/10.1007/s11265-019-1445-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER GBV_ILN_40 GBV_ILN_2006 GBV_ILN_2027 AR 92 2019 1 26 02 23-36 |
spelling |
10.1007/s11265-019-1445-4 doi (DE-627)SPR018334326 (SPR)s11265-019-1445-4-e DE-627 ger DE-627 rakwb eng Xie, Jie verfasserin aut Investigation of Acoustic and Visual Features for Frog Call Classification 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. Bioacoustics (dpeaa)DE-He213 Ecoacoustics (dpeaa)DE-He213 Acoustic feature extraction (dpeaa)DE-He213 Visual feature extraction (dpeaa)DE-He213 Frog call classification (dpeaa)DE-He213 Towsey, Michael verfasserin aut Zhang, Jinglan verfasserin aut Roe, Paul verfasserin aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Springer Netherlands, 1989 92(2019), 1 vom: 26. Feb., Seite 23-36 (DE-627)SPR018308090 nnns volume:92 year:2019 number:1 day:26 month:02 pages:23-36 https://dx.doi.org/10.1007/s11265-019-1445-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER GBV_ILN_40 GBV_ILN_2006 GBV_ILN_2027 AR 92 2019 1 26 02 23-36 |
allfields_unstemmed |
10.1007/s11265-019-1445-4 doi (DE-627)SPR018334326 (SPR)s11265-019-1445-4-e DE-627 ger DE-627 rakwb eng Xie, Jie verfasserin aut Investigation of Acoustic and Visual Features for Frog Call Classification 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. Bioacoustics (dpeaa)DE-He213 Ecoacoustics (dpeaa)DE-He213 Acoustic feature extraction (dpeaa)DE-He213 Visual feature extraction (dpeaa)DE-He213 Frog call classification (dpeaa)DE-He213 Towsey, Michael verfasserin aut Zhang, Jinglan verfasserin aut Roe, Paul verfasserin aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Springer Netherlands, 1989 92(2019), 1 vom: 26. Feb., Seite 23-36 (DE-627)SPR018308090 nnns volume:92 year:2019 number:1 day:26 month:02 pages:23-36 https://dx.doi.org/10.1007/s11265-019-1445-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER GBV_ILN_40 GBV_ILN_2006 GBV_ILN_2027 AR 92 2019 1 26 02 23-36 |
allfieldsGer |
10.1007/s11265-019-1445-4 doi (DE-627)SPR018334326 (SPR)s11265-019-1445-4-e DE-627 ger DE-627 rakwb eng Xie, Jie verfasserin aut Investigation of Acoustic and Visual Features for Frog Call Classification 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. Bioacoustics (dpeaa)DE-He213 Ecoacoustics (dpeaa)DE-He213 Acoustic feature extraction (dpeaa)DE-He213 Visual feature extraction (dpeaa)DE-He213 Frog call classification (dpeaa)DE-He213 Towsey, Michael verfasserin aut Zhang, Jinglan verfasserin aut Roe, Paul verfasserin aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Springer Netherlands, 1989 92(2019), 1 vom: 26. Feb., Seite 23-36 (DE-627)SPR018308090 nnns volume:92 year:2019 number:1 day:26 month:02 pages:23-36 https://dx.doi.org/10.1007/s11265-019-1445-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER GBV_ILN_40 GBV_ILN_2006 GBV_ILN_2027 AR 92 2019 1 26 02 23-36 |
allfieldsSound |
10.1007/s11265-019-1445-4 doi (DE-627)SPR018334326 (SPR)s11265-019-1445-4-e DE-627 ger DE-627 rakwb eng Xie, Jie verfasserin aut Investigation of Acoustic and Visual Features for Frog Call Classification 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. Bioacoustics (dpeaa)DE-He213 Ecoacoustics (dpeaa)DE-He213 Acoustic feature extraction (dpeaa)DE-He213 Visual feature extraction (dpeaa)DE-He213 Frog call classification (dpeaa)DE-He213 Towsey, Michael verfasserin aut Zhang, Jinglan verfasserin aut Roe, Paul verfasserin aut Enthalten in Journal of VLSI signal processing systems for signal, image and video technology Springer Netherlands, 1989 92(2019), 1 vom: 26. Feb., Seite 23-36 (DE-627)SPR018308090 nnns volume:92 year:2019 number:1 day:26 month:02 pages:23-36 https://dx.doi.org/10.1007/s11265-019-1445-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER GBV_ILN_40 GBV_ILN_2006 GBV_ILN_2027 AR 92 2019 1 26 02 23-36 |
language |
English |
source |
Enthalten in Journal of VLSI signal processing systems for signal, image and video technology 92(2019), 1 vom: 26. Feb., Seite 23-36 volume:92 year:2019 number:1 day:26 month:02 pages:23-36 |
sourceStr |
Enthalten in Journal of VLSI signal processing systems for signal, image and video technology 92(2019), 1 vom: 26. Feb., Seite 23-36 volume:92 year:2019 number:1 day:26 month:02 pages:23-36 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Bioacoustics Ecoacoustics Acoustic feature extraction Visual feature extraction Frog call classification |
isfreeaccess_bool |
false |
container_title |
Journal of VLSI signal processing systems for signal, image and video technology |
authorswithroles_txt_mv |
Xie, Jie @@aut@@ Towsey, Michael @@aut@@ Zhang, Jinglan @@aut@@ Roe, Paul @@aut@@ |
publishDateDaySort_date |
2019-02-26T00:00:00Z |
hierarchy_top_id |
SPR018308090 |
id |
SPR018334326 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">SPR018334326</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20201124222422.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">201006s2019 xx |||||o 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11265-019-1445-4</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)SPR018334326</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(SPR)s11265-019-1445-4-e</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Xie, Jie</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Investigation of Acoustic and Visual Features for Frog Call Classification</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2019</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Bioacoustics</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Ecoacoustics</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Acoustic feature extraction</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Visual feature extraction</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Frog call classification</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Towsey, Michael</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhang, Jinglan</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Roe, Paul</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Journal of VLSI signal processing systems for signal, image and video technology</subfield><subfield code="d">Springer Netherlands, 1989</subfield><subfield code="g">92(2019), 1 vom: 26. Feb., Seite 23-36</subfield><subfield code="w">(DE-627)SPR018308090</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:92</subfield><subfield code="g">year:2019</subfield><subfield code="g">number:1</subfield><subfield code="g">day:26</subfield><subfield code="g">month:02</subfield><subfield code="g">pages:23-36</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://dx.doi.org/10.1007/s11265-019-1445-4</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_SPRINGER</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_40</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2027</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">92</subfield><subfield code="j">2019</subfield><subfield code="e">1</subfield><subfield code="b">26</subfield><subfield code="c">02</subfield><subfield code="h">23-36</subfield></datafield></record></collection>
|
author |
Xie, Jie |
spellingShingle |
Xie, Jie misc Bioacoustics misc Ecoacoustics misc Acoustic feature extraction misc Visual feature extraction misc Frog call classification Investigation of Acoustic and Visual Features for Frog Call Classification |
authorStr |
Xie, Jie |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)SPR018308090 |
format |
electronic Article |
delete_txt_mv |
keep |
author_role |
aut aut aut aut |
collection |
springer |
remote_str |
true |
illustrated |
Not Illustrated |
topic_title |
Investigation of Acoustic and Visual Features for Frog Call Classification Bioacoustics (dpeaa)DE-He213 Ecoacoustics (dpeaa)DE-He213 Acoustic feature extraction (dpeaa)DE-He213 Visual feature extraction (dpeaa)DE-He213 Frog call classification (dpeaa)DE-He213 |
topic |
misc Bioacoustics misc Ecoacoustics misc Acoustic feature extraction misc Visual feature extraction misc Frog call classification |
topic_unstemmed |
misc Bioacoustics misc Ecoacoustics misc Acoustic feature extraction misc Visual feature extraction misc Frog call classification |
topic_browse |
misc Bioacoustics misc Ecoacoustics misc Acoustic feature extraction misc Visual feature extraction misc Frog call classification |
format_facet |
Elektronische Aufsätze Aufsätze Elektronische Ressource |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
cr |
hierarchy_parent_title |
Journal of VLSI signal processing systems for signal, image and video technology |
hierarchy_parent_id |
SPR018308090 |
hierarchy_top_title |
Journal of VLSI signal processing systems for signal, image and video technology |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)SPR018308090 |
title |
Investigation of Acoustic and Visual Features for Frog Call Classification |
ctrlnum |
(DE-627)SPR018334326 (SPR)s11265-019-1445-4-e |
title_full |
Investigation of Acoustic and Visual Features for Frog Call Classification |
author_sort |
Xie, Jie |
journal |
Journal of VLSI signal processing systems for signal, image and video technology |
journalStr |
Journal of VLSI signal processing systems for signal, image and video technology |
lang_code |
eng |
isOA_bool |
false |
recordtype |
marc |
publishDateSort |
2019 |
contenttype_str_mv |
txt |
container_start_page |
23 |
author_browse |
Xie, Jie Towsey, Michael Zhang, Jinglan Roe, Paul |
container_volume |
92 |
format_se |
Elektronische Aufsätze |
author-letter |
Xie, Jie |
doi_str_mv |
10.1007/s11265-019-1445-4 |
author2-role |
verfasserin |
title_sort |
investigation of acoustic and visual features for frog call classification |
title_auth |
Investigation of Acoustic and Visual Features for Frog Call Classification |
abstract |
Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. |
abstractGer |
Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. |
abstract_unstemmed |
Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls. |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER GBV_ILN_40 GBV_ILN_2006 GBV_ILN_2027 |
container_issue |
1 |
title_short |
Investigation of Acoustic and Visual Features for Frog Call Classification |
url |
https://dx.doi.org/10.1007/s11265-019-1445-4 |
remote_bool |
true |
author2 |
Towsey, Michael Zhang, Jinglan Roe, Paul |
author2Str |
Towsey, Michael Zhang, Jinglan Roe, Paul |
ppnlink |
SPR018308090 |
mediatype_str_mv |
c |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1007/s11265-019-1445-4 |
up_date |
2024-07-03T18:57:40.551Z |
_version_ |
1803585386500325376 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">SPR018334326</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20201124222422.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">201006s2019 xx |||||o 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s11265-019-1445-4</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)SPR018334326</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(SPR)s11265-019-1445-4-e</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Xie, Jie</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Investigation of Acoustic and Visual Features for Frog Call Classification</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2019</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract Rapid decreases in frog populations have been spotted worldwide, which are regarded as one of the most critical threats to the global biodiversity. Recent advances in acoustic sensors provide a novel way to assess frog vocalizations and further optimize the global protection policy. Specifically, frog populations can be reflected by detecting frog species using collected recordings. Previous studies have explored various acoustic features for classifying frog calls. However, few studies investigate visual features for frog call classification, which have been successfully used in acoustic event detection, speech/speaker recognition. In this study, various acoustic and visual features are proposed for frog call classification: MPEG-7 audio descriptor, syllable duration, oscillation rate, entropy related features, linear prediction codings, Mel-frequency Cepstral coefficients, local binary patterns, and histogram of oriented gradients. After segmenting continuous frog calls into individual syllables, different constructed feature sets are evaluated with a k-nearest neighbor classifier and support vector machines. Comprehensive results on 16 frog species demonstrate the effectiveness of both acoustic and visual features for classifying frog calls.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Bioacoustics</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Ecoacoustics</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Acoustic feature extraction</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Visual feature extraction</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Frog call classification</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Towsey, Michael</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Zhang, Jinglan</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Roe, Paul</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Journal of VLSI signal processing systems for signal, image and video technology</subfield><subfield code="d">Springer Netherlands, 1989</subfield><subfield code="g">92(2019), 1 vom: 26. Feb., Seite 23-36</subfield><subfield code="w">(DE-627)SPR018308090</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:92</subfield><subfield code="g">year:2019</subfield><subfield code="g">number:1</subfield><subfield code="g">day:26</subfield><subfield code="g">month:02</subfield><subfield code="g">pages:23-36</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://dx.doi.org/10.1007/s11265-019-1445-4</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_SPRINGER</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_40</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2006</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_ILN_2027</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">92</subfield><subfield code="j">2019</subfield><subfield code="e">1</subfield><subfield code="b">26</subfield><subfield code="c">02</subfield><subfield code="h">23-36</subfield></datafield></record></collection>
|
score |
7.400193 |