SimAndro: an effective method to compute similarity of Android applications
Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challengi...
Ausführliche Beschreibung
Autor*in: |
Hamednai, Masoud Reyhani [verfasserIn] Kim, Gyoosik [verfasserIn] Cho, Seong-je [verfasserIn] |
---|
Format: |
E-Artikel |
---|---|
Sprache: |
Englisch |
Erschienen: |
2019 |
---|
Schlagwörter: |
---|
Übergeordnetes Werk: |
Enthalten in: Soft Computing - Springer-Verlag, 2003, 23(2019), 17 vom: 12. Jan., Seite 7569-7590 |
---|---|
Übergeordnetes Werk: |
volume:23 ; year:2019 ; number:17 ; day:12 ; month:01 ; pages:7569-7590 |
Links: |
---|
DOI / URN: |
10.1007/s00500-019-03755-4 |
---|
Katalog-ID: |
SPR006506291 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | SPR006506291 | ||
003 | DE-627 | ||
005 | 20201124002909.0 | ||
007 | cr uuu---uuuuu | ||
008 | 201005s2019 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1007/s00500-019-03755-4 |2 doi | |
035 | |a (DE-627)SPR006506291 | ||
035 | |a (SPR)s00500-019-03755-4-e | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Hamednai, Masoud Reyhani |e verfasserin |4 aut | |
245 | 1 | 0 | |a SimAndro: an effective method to compute similarity of Android applications |
264 | 1 | |c 2019 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. | ||
650 | 4 | |a Similarity |7 (dpeaa)DE-He213 | |
650 | 4 | |a Android apps |7 (dpeaa)DE-He213 | |
650 | 4 | |a Feature extraction |7 (dpeaa)DE-He213 | |
650 | 4 | |a Automatic weighting |7 (dpeaa)DE-He213 | |
700 | 1 | |a Kim, Gyoosik |e verfasserin |4 aut | |
700 | 1 | |a Cho, Seong-je |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Soft Computing |d Springer-Verlag, 2003 |g 23(2019), 17 vom: 12. Jan., Seite 7569-7590 |w (DE-627)SPR006469531 |7 nnns |
773 | 1 | 8 | |g volume:23 |g year:2019 |g number:17 |g day:12 |g month:01 |g pages:7569-7590 |
856 | 4 | 0 | |u https://dx.doi.org/10.1007/s00500-019-03755-4 |z lizenzpflichtig |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_SPRINGER | ||
951 | |a AR | ||
952 | |d 23 |j 2019 |e 17 |b 12 |c 01 |h 7569-7590 |
author_variant |
m r h mr mrh g k gk s j c sjc |
---|---|
matchkey_str |
hamednaimasoudreyhanikimgyoosikchoseongj:2019----:iadonfetvmtotcmueiiaiyfn |
hierarchy_sort_str |
2019 |
publishDate |
2019 |
allfields |
10.1007/s00500-019-03755-4 doi (DE-627)SPR006506291 (SPR)s00500-019-03755-4-e DE-627 ger DE-627 rakwb eng Hamednai, Masoud Reyhani verfasserin aut SimAndro: an effective method to compute similarity of Android applications 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. Similarity (dpeaa)DE-He213 Android apps (dpeaa)DE-He213 Feature extraction (dpeaa)DE-He213 Automatic weighting (dpeaa)DE-He213 Kim, Gyoosik verfasserin aut Cho, Seong-je verfasserin aut Enthalten in Soft Computing Springer-Verlag, 2003 23(2019), 17 vom: 12. Jan., Seite 7569-7590 (DE-627)SPR006469531 nnns volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 https://dx.doi.org/10.1007/s00500-019-03755-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER AR 23 2019 17 12 01 7569-7590 |
spelling |
10.1007/s00500-019-03755-4 doi (DE-627)SPR006506291 (SPR)s00500-019-03755-4-e DE-627 ger DE-627 rakwb eng Hamednai, Masoud Reyhani verfasserin aut SimAndro: an effective method to compute similarity of Android applications 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. Similarity (dpeaa)DE-He213 Android apps (dpeaa)DE-He213 Feature extraction (dpeaa)DE-He213 Automatic weighting (dpeaa)DE-He213 Kim, Gyoosik verfasserin aut Cho, Seong-je verfasserin aut Enthalten in Soft Computing Springer-Verlag, 2003 23(2019), 17 vom: 12. Jan., Seite 7569-7590 (DE-627)SPR006469531 nnns volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 https://dx.doi.org/10.1007/s00500-019-03755-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER AR 23 2019 17 12 01 7569-7590 |
allfields_unstemmed |
10.1007/s00500-019-03755-4 doi (DE-627)SPR006506291 (SPR)s00500-019-03755-4-e DE-627 ger DE-627 rakwb eng Hamednai, Masoud Reyhani verfasserin aut SimAndro: an effective method to compute similarity of Android applications 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. Similarity (dpeaa)DE-He213 Android apps (dpeaa)DE-He213 Feature extraction (dpeaa)DE-He213 Automatic weighting (dpeaa)DE-He213 Kim, Gyoosik verfasserin aut Cho, Seong-je verfasserin aut Enthalten in Soft Computing Springer-Verlag, 2003 23(2019), 17 vom: 12. Jan., Seite 7569-7590 (DE-627)SPR006469531 nnns volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 https://dx.doi.org/10.1007/s00500-019-03755-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER AR 23 2019 17 12 01 7569-7590 |
allfieldsGer |
10.1007/s00500-019-03755-4 doi (DE-627)SPR006506291 (SPR)s00500-019-03755-4-e DE-627 ger DE-627 rakwb eng Hamednai, Masoud Reyhani verfasserin aut SimAndro: an effective method to compute similarity of Android applications 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. Similarity (dpeaa)DE-He213 Android apps (dpeaa)DE-He213 Feature extraction (dpeaa)DE-He213 Automatic weighting (dpeaa)DE-He213 Kim, Gyoosik verfasserin aut Cho, Seong-je verfasserin aut Enthalten in Soft Computing Springer-Verlag, 2003 23(2019), 17 vom: 12. Jan., Seite 7569-7590 (DE-627)SPR006469531 nnns volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 https://dx.doi.org/10.1007/s00500-019-03755-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER AR 23 2019 17 12 01 7569-7590 |
allfieldsSound |
10.1007/s00500-019-03755-4 doi (DE-627)SPR006506291 (SPR)s00500-019-03755-4-e DE-627 ger DE-627 rakwb eng Hamednai, Masoud Reyhani verfasserin aut SimAndro: an effective method to compute similarity of Android applications 2019 Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. Similarity (dpeaa)DE-He213 Android apps (dpeaa)DE-He213 Feature extraction (dpeaa)DE-He213 Automatic weighting (dpeaa)DE-He213 Kim, Gyoosik verfasserin aut Cho, Seong-je verfasserin aut Enthalten in Soft Computing Springer-Verlag, 2003 23(2019), 17 vom: 12. Jan., Seite 7569-7590 (DE-627)SPR006469531 nnns volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 https://dx.doi.org/10.1007/s00500-019-03755-4 lizenzpflichtig Volltext GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER AR 23 2019 17 12 01 7569-7590 |
language |
English |
source |
Enthalten in Soft Computing 23(2019), 17 vom: 12. Jan., Seite 7569-7590 volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 |
sourceStr |
Enthalten in Soft Computing 23(2019), 17 vom: 12. Jan., Seite 7569-7590 volume:23 year:2019 number:17 day:12 month:01 pages:7569-7590 |
format_phy_str_mv |
Article |
institution |
findex.gbv.de |
topic_facet |
Similarity Android apps Feature extraction Automatic weighting |
isfreeaccess_bool |
false |
container_title |
Soft Computing |
authorswithroles_txt_mv |
Hamednai, Masoud Reyhani @@aut@@ Kim, Gyoosik @@aut@@ Cho, Seong-je @@aut@@ |
publishDateDaySort_date |
2019-01-12T00:00:00Z |
hierarchy_top_id |
SPR006469531 |
id |
SPR006506291 |
language_de |
englisch |
fullrecord |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">SPR006506291</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20201124002909.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">201005s2019 xx |||||o 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00500-019-03755-4</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)SPR006506291</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(SPR)s00500-019-03755-4-e</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Hamednai, Masoud Reyhani</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">SimAndro: an effective method to compute similarity of Android applications</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2019</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Similarity</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Android apps</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Feature extraction</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Automatic weighting</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kim, Gyoosik</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Cho, Seong-je</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Soft Computing</subfield><subfield code="d">Springer-Verlag, 2003</subfield><subfield code="g">23(2019), 17 vom: 12. Jan., Seite 7569-7590</subfield><subfield code="w">(DE-627)SPR006469531</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:23</subfield><subfield code="g">year:2019</subfield><subfield code="g">number:17</subfield><subfield code="g">day:12</subfield><subfield code="g">month:01</subfield><subfield code="g">pages:7569-7590</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://dx.doi.org/10.1007/s00500-019-03755-4</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_SPRINGER</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">23</subfield><subfield code="j">2019</subfield><subfield code="e">17</subfield><subfield code="b">12</subfield><subfield code="c">01</subfield><subfield code="h">7569-7590</subfield></datafield></record></collection>
|
author |
Hamednai, Masoud Reyhani |
spellingShingle |
Hamednai, Masoud Reyhani misc Similarity misc Android apps misc Feature extraction misc Automatic weighting SimAndro: an effective method to compute similarity of Android applications |
authorStr |
Hamednai, Masoud Reyhani |
ppnlink_with_tag_str_mv |
@@773@@(DE-627)SPR006469531 |
format |
electronic Article |
delete_txt_mv |
keep |
author_role |
aut aut aut |
collection |
springer |
remote_str |
true |
illustrated |
Not Illustrated |
topic_title |
SimAndro: an effective method to compute similarity of Android applications Similarity (dpeaa)DE-He213 Android apps (dpeaa)DE-He213 Feature extraction (dpeaa)DE-He213 Automatic weighting (dpeaa)DE-He213 |
topic |
misc Similarity misc Android apps misc Feature extraction misc Automatic weighting |
topic_unstemmed |
misc Similarity misc Android apps misc Feature extraction misc Automatic weighting |
topic_browse |
misc Similarity misc Android apps misc Feature extraction misc Automatic weighting |
format_facet |
Elektronische Aufsätze Aufsätze Elektronische Ressource |
format_main_str_mv |
Text Zeitschrift/Artikel |
carriertype_str_mv |
cr |
hierarchy_parent_title |
Soft Computing |
hierarchy_parent_id |
SPR006469531 |
hierarchy_top_title |
Soft Computing |
isfreeaccess_txt |
false |
familylinks_str_mv |
(DE-627)SPR006469531 |
title |
SimAndro: an effective method to compute similarity of Android applications |
ctrlnum |
(DE-627)SPR006506291 (SPR)s00500-019-03755-4-e |
title_full |
SimAndro: an effective method to compute similarity of Android applications |
author_sort |
Hamednai, Masoud Reyhani |
journal |
Soft Computing |
journalStr |
Soft Computing |
lang_code |
eng |
isOA_bool |
false |
recordtype |
marc |
publishDateSort |
2019 |
contenttype_str_mv |
txt |
container_start_page |
7569 |
author_browse |
Hamednai, Masoud Reyhani Kim, Gyoosik Cho, Seong-je |
container_volume |
23 |
format_se |
Elektronische Aufsätze |
author-letter |
Hamednai, Masoud Reyhani |
doi_str_mv |
10.1007/s00500-019-03755-4 |
author2-role |
verfasserin |
title_sort |
simandro: an effective method to compute similarity of android applications |
title_auth |
SimAndro: an effective method to compute similarity of Android applications |
abstract |
Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. |
abstractGer |
Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. |
abstract_unstemmed |
Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro. |
collection_details |
GBV_USEFLAG_A SYSFLAG_A GBV_SPRINGER |
container_issue |
17 |
title_short |
SimAndro: an effective method to compute similarity of Android applications |
url |
https://dx.doi.org/10.1007/s00500-019-03755-4 |
remote_bool |
true |
author2 |
Kim, Gyoosik Cho, Seong-je |
author2Str |
Kim, Gyoosik Cho, Seong-je |
ppnlink |
SPR006469531 |
mediatype_str_mv |
c |
isOA_txt |
false |
hochschulschrift_bool |
false |
doi_str |
10.1007/s00500-019-03755-4 |
up_date |
2024-07-03T23:19:12.046Z |
_version_ |
1803601840226435072 |
fullrecord_marcxml |
<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01000caa a22002652 4500</leader><controlfield tag="001">SPR006506291</controlfield><controlfield tag="003">DE-627</controlfield><controlfield tag="005">20201124002909.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">201005s2019 xx |||||o 00| ||eng c</controlfield><datafield tag="024" ind1="7" ind2=" "><subfield code="a">10.1007/s00500-019-03755-4</subfield><subfield code="2">doi</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627)SPR006506291</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(SPR)s00500-019-03755-4-e</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Hamednai, Masoud Reyhani</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">SimAndro: an effective method to compute similarity of Android applications</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="c">2019</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Abstract As the number of Android applications (apps) is increasing dramatically, users face a serious problem to find relevant apps to their needs. Therefore, there is an important demand for app search engines or recommendation services where developing an accurate similarity method is a challenging issue. Contrary to malware detection, very fewer efforts have been devoted to similarity computation of apps. Furthermore, all the existing methods use the features obtained only from the app stores such as description and rating, which could be inaccurate, varied in different stores, and affected by language barrier; they totally neglect useful information clearly capturing the app’s functionalities and behaviors that can be mined from the apps themselves such as the API calls and manifest information. In this paper, we propose an effective method called SimAndro to compute the similarity of apps, which extracts the features based on the information obtained only from apps themselves and the Android platform without using information obtained from third-party sources such as app stores. SimAndro performs both feature extraction and similarity computation where the API calls, manifest information, package name, and strings are used as features. To compute the similarity score of an app-pair, a separate similarity score is computed based on each feature, and a weighted linear combination of these four scores is regarded as the final similarity score by utilizing an automatic weighting scheme based on TreeRankSVM. The results of extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of SimAndro.</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Similarity</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Android apps</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Feature extraction</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Automatic weighting</subfield><subfield code="7">(dpeaa)DE-He213</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Kim, Gyoosik</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Cho, Seong-je</subfield><subfield code="e">verfasserin</subfield><subfield code="4">aut</subfield></datafield><datafield tag="773" ind1="0" ind2="8"><subfield code="i">Enthalten in</subfield><subfield code="t">Soft Computing</subfield><subfield code="d">Springer-Verlag, 2003</subfield><subfield code="g">23(2019), 17 vom: 12. Jan., Seite 7569-7590</subfield><subfield code="w">(DE-627)SPR006469531</subfield><subfield code="7">nnns</subfield></datafield><datafield tag="773" ind1="1" ind2="8"><subfield code="g">volume:23</subfield><subfield code="g">year:2019</subfield><subfield code="g">number:17</subfield><subfield code="g">day:12</subfield><subfield code="g">month:01</subfield><subfield code="g">pages:7569-7590</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://dx.doi.org/10.1007/s00500-019-03755-4</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_USEFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">SYSFLAG_A</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">GBV_SPRINGER</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">AR</subfield></datafield><datafield tag="952" ind1=" " ind2=" "><subfield code="d">23</subfield><subfield code="j">2019</subfield><subfield code="e">17</subfield><subfield code="b">12</subfield><subfield code="c">01</subfield><subfield code="h">7569-7590</subfield></datafield></record></collection>
|
score |
7.400386 |