GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting

Yu, Xiaodong; Wei, Fengguo; Ou, Xinming; Becchi, Michela; Bicer, Tekin; Yao, Danfeng

doi:10.1109/IPDPS47924.2020.00037

GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting

Conference · Wed Jan 01 04:00:00 EST 2020

DOI:https://doi.org/10.1109/IPDPS47924.2020.00037· OSTI ID:1804071

Yu, Xiaodong; Wei, Fengguo; Ou, Xinming; Becchi, Michela; Bicer, Tekin; Yao, Danfeng

Many popular vetting tools for Android applications use static code analysis techniques. In particular, Interprocedural Data-Flow Graph (IDFG) construction is the computation at the core of Android static data-flow analysis and consumes most of the analysis time. Many analysis tools use a worklist algorithm, an iterative fixed-point approach, to construct the IDFG. In this paper, we observe that a straightforward GPU parallelization of the worklist algorithm leads to significant underutilization of the GPU resources. We identify four performance bottlenecks, namely, frequent dynamic memory allocations, high branch divergence, workload imbalance, and irregular memory access patterns. Accordingly, we propose GDroid, a GPU-based worklist algorithm implementation with multiple fine-grained optimizations tailored to common characteristics of Android applications. The optimizations considered are: matrix-based data structure, memory access-based node grouping, and worklist merging. Our experimental evaluation, performed on 1000 Android applications, shows that the proposed optimizations are beneficial to performance, and GDroid can achieve up to 128X speedups against a plain GPU implementation.

View Conference

Research Organization:: Argonne National Laboratory (ANL)

Sponsoring Organization:: National Science Foundation (NSF); USDOE Office of Science - Office of Advanced Scientific Computing Research (ASCR); USDOE Office of Science - Office of Basic Energy Sciences

DOE Contract Number:: AC02-06CH11357

OSTI ID:: 1804071

Country of Publication:: United States

Language:: English

References (39)

A GPU implementation of inclusion-based points-to analysis Mendez-Lojo, Mario; Burtscher, Martin; Pingali, Keshav Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming - PPoPP '12 https://doi.org/10.1145/2145816.2145831	conference	January 2012
GPU-Based Iterative Medical CT Image Reconstructions Yu, Xiaodong; Wang, Hao; Feng, Wu-chun Journal of Signal Processing Systems, Vol. 91, Issue 3-4 https://doi.org/10.1007/s11265-018-1352-0	journal	March 2018
Algorithmic Techniques for Solving Graph Problems on the Automata Processor Roy, Indranil; Jammula, Nagakishore; Aluru, Srinivas 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) https://doi.org/10.1109/IPDPS.2016.116	conference	May 2016
SEP-graph: finding shortest execution paths for graph processing under a hybrid framework on GPU Wang, Hao; Geng, Liang; Lee, Rubao PPoPP '19: 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming https://doi.org/10.1145/3293883.3295733	conference	February 2019
Robotomata: A framework for approximate pattern matching of big data on an automata processor Yu, Xiaodong; Hou, Kaixi; Wang, Hao 2017 IEEE International Conference on Big Data (Big Data) https://doi.org/10.1109/BigData.2017.8257936	conference	December 2017
O3FA: A Scalable Finite Automata-based Pattern-Matching Engine for Out-of-Order Deep Packet Inspection Yu, Xiaodong; Feng, Wu-chun; Yao, Danfeng (Daphne) ANCS '16: Symposium on Architectures for Networking and Communications Systems, Proceedings of the 2016 Symposium on Architectures for Networking and Communications Systems https://doi.org/10.1145/2881025.2881034	conference	March 2016
AAlign: A SIMD Framework for Pairwise Sequence Alignment on x86-Based Multi-and Many-Core Processors Hou, Kaixi; Wang, Hao; Feng, Wu-Chun 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) https://doi.org/10.1109/IPDPS.2016.115	conference	May 2016
Accelerating CUDA graph algorithms at maximum warp Hong, Sungpack; Kim, Sang Kyun; Oguntebi, Tayo Proceedings of the 16th ACM symposium on Principles and practice of parallel programming - PPoPP '11 https://doi.org/10.1145/1941553.1941590	conference	January 2011
Precise interprocedural dataflow analysis with applications to constant propagation Sagiv, Mooly; Reps, Thomas; Horwitz, Susan Theoretical Computer Science, Vol. 167, Issue 1-2 https://doi.org/10.1016/0304-3975(96)00072-2	journal	January 1996
Inter-procedural data-flow analysis with IFDS/IDE and Soot Bodden, Eric Proceedings of the ACM SIGPLAN International Workshop on State of the Art in Java Program analysis - SOAP '12 https://doi.org/10.1145/2259051.2259052	conference	January 2012
Information-Flow Analysis of Android Applications in DroidSafe Gordon, Michael I.; Kim, Deokhwan; Perkins, Jeff Proceedings 2015 Network and Distributed System Security Symposium https://doi.org/10.14722/ndss.2015.23089	conference	January 2015
Enterprise: breadth-first graph traversal on GPUs Liu, Hang; Huang, H. Howie SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/2807591.2807594	conference	November 2015
Revisiting State Blow-Up: Automatically Building Augmented-FA While Preserving Functional Equivalence Yu, Xiaodong; Lin, Bill; Becchi, Michela IEEE Journal on Selected Areas in Communications, Vol. 32, Issue 10 https://doi.org/10.1109/JSAC.2014.2358840	journal	October 2014
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU Zhang, Jing; Wang, Hao; Feng, Wu-chun IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 14, Issue 4 https://doi.org/10.1109/TCBB.2015.2489662	journal	July 2017
Demystifying automata processing: GPUs, FPGAs or Micron's AP? Nourian, Marziyeh; Wang, Xiang; Yu, Xiaodong Proceedings of the International Conference on Supercomputing - ICS '17 https://doi.org/10.1145/3079079.3079100	conference	January 2017
Throughput-oriented GPU memory allocation Gelado, Isaac; Garland, Michael PPoPP '19: 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming https://doi.org/10.1145/3293883.3295727	conference	February 2019
Precise and compact modular procedure summaries for heap manipulating programs Dillig, Isil; Dillig, Thomas; Aiken, Alex ACM SIGPLAN Notices, Vol. 46, Issue 6 https://doi.org/10.1145/1993316.1993565	journal	June 2011
Scalable GPU graph traversal Merrill, Duane; Garland, Michael; Grimshaw, Andrew Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming - PPoPP '12 https://doi.org/10.1145/2145816.2145832	conference	January 2012
FlowDroid: precise context, flow, field, object-sensitive and lifecycle-aware taint analysis for Android apps Arzt, Steven; Rasthofer, Siegfried; Fritz, Christian ACM SIGPLAN Notices, Vol. 49, Issue 6 https://doi.org/10.1145/2666356.2594299	journal	June 2014
GPU acceleration of regular expression matching for large datasets: exploring the implementation space Yu, Xiaodong; Becchi, Michela Proceedings of the ACM International Conference on Computing Frontiers - CF '13 https://doi.org/10.1145/2482767.2482791	conference	January 2013
An Enhanced Image Reconstruction Tool for Computed Tomography on GPUs Yu, Xiaodong; Wang, Hao; Feng, Wu-chun CF '17: Computing Frontiers Conference, Proceedings of the Computing Frontiers Conference https://doi.org/10.1145/3075564.3078889	conference	May 2017
Soot: a Java bytecode optimization framework Vallée-Rai, Raja; Co, Phong; Gagnon, Etienne CASCON First Decade High Impact Papers on - CASCON '10 https://doi.org/10.1145/1925805.1925818	conference	January 2010
Do Android taint analysis tools keep their promises? Pauck, Felix; Bodden, Eric; Wehrheim, Heike ESEC/FSE '18: 26th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering https://doi.org/10.1145/3236024.3236029	conference	October 2018
cuART: Fine-Grained Algebraic Reconstruction Technique for Computed Tomography Images on GPUs Yu, Xiaodong; Wang, Hao; Feng, Wu-Chun 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) https://doi.org/10.1109/CCGrid.2016.96	conference	May 2016
EigenCFA: accelerating flow analysis with GPUs Prabhu, Tarun; Ramalingam, Shreyas; Might, Matthew Proceedings of the 38th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '11 https://doi.org/10.1145/1926385.1926445	conference	January 2011
Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2 Wang, Hao; Potluri, Sreeram; Luo, Miao 2011 IEEE International Conference on Cluster Computing (CLUSTER) https://doi.org/10.1109/CLUSTER.2011.42	conference	September 2011
JN-SAF: Precise and Efficient NDK/JNI-aware Inter-language Static Analysis Framework for Security Vetting of Android Applications with Native Code Wei, Fengguo; Lin, Xingwei; Ou, Xinming CCS '18: 2018 ACM SIGSAC Conference on Computer and Communications Security, Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security https://doi.org/10.1145/3243734.3243835	conference	October 2018
Analysis of Code Heterogeneity for High-Precision Classification of Repackaged Malware Tian, Ke; Yao, Danfeng; Ryder, Barbara G. 2016 IEEE Security and Privacy Workshops (SPW) https://doi.org/10.1109/SPW.2016.33	conference	May 2016
Amandroid: A Precise and General Inter-component Data Flow Analysis Framework for Security Vetting of Android Apps Wei, Fengguo; Roy, Sankardas; Ou, Xinming CCS'14: 2014 ACM SIGSAC Conference on Computer and Communications Security, Proceedings of the 2014 ACM SIGSAC Conference on Computer and Communications Security https://doi.org/10.1145/2660267.2660357	conference	November 2014
An Efficient GPU Implementation of Inclusion-Based Pointer Analysis Su, Yu; Ye, Ding; Xue, Jingling IEEE Transactions on Parallel and Distributed Systems, Vol. 27, Issue 2 https://doi.org/10.1109/TPDS.2015.2397933	journal	February 2016
GPU-Aware MPI on RDMA-Enabled Clusters: Design, Implementation and Evaluation Wang, Hao; Potluri, Sreeram; Bureddy, Devendar IEEE Transactions on Parallel and Distributed Systems, Vol. 25, Issue 10 https://doi.org/10.1109/TPDS.2013.222	journal	October 2014
Implementation techniques for efficient data-flow analysis of large programs Atkinson, D. C.; Griswold, W. G. Proceedings IEEE International Conference on Software Maintenance. ICSM 2001 https://doi.org/10.1109/ICSM.2001.972711	conference	January 2001
A generic worklist algorithm for graph reachability problems in program analysis Rayside, D.; Kontogiannis, K. Proceedings of the Sixth European Conference on Software Maintenance and Reengineering https://doi.org/10.1109/CSMR.2002.995791	conference	January 2002
BFS-4K: An Efficient Implementation of BFS for Kepler GPU Architectures Busato, Federico; Bombieri, Nicola IEEE Transactions on Parallel and Distributed Systems, Vol. 26, Issue 7 https://doi.org/10.1109/TPDS.2014.2330597	journal	July 2015
Collusive Data Leak and More: Large-scale Threat Analysis of Inter-app Communications Bosu, Amiangshu; Liu, Fang; Yao, Danfeng (Daphne) ASIA CCS '17: ACM Asia Conference on Computer and Communications Security, Proceedings of the 2017 ACM on Asia Conference on Computer and Communications Security https://doi.org/10.1145/3052973.3053004	conference	April 2017
Precise interprocedural dataflow analysis via graph reachability Reps, Thomas; Horwitz, Susan; Sagiv, Mooly Proceedings of the 22nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '95 https://doi.org/10.1145/199448.199462	conference	January 1995
Profiling user-trigger dependence for Android malware detection Elish, Karim O.; Shu, Xiaokui; Yao, Danfeng (Daphne) Computers & Security, Vol. 49 https://doi.org/10.1016/j.cose.2014.11.001	journal	March 2015
Composite Constant Propagation: Application to Android Inter-Component Communication Analysis Octeau, Damien; Luchaup, Daniel; Dering, Matthew 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering (ICSE) https://doi.org/10.1109/ICSE.2015.30	conference	May 2015
Fast and Efficient Graph Traversal Algorithm for CPUs: Maximizing Single-Node Efficiency Chhugani, Jatin; Satish, Nadathur; Kim, Changkyu 2012 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2012 IEEE 26th International Parallel and Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2012.43	conference	May 2012

Similar Records

Population Count on Intel® CPU, GPU, and FPGA

Conference · Tue Dec 31 23:00:00 EST 2019 · OSTI ID:1804082

Highly Efficient Compensation-Based Parallelism for Wavefront Loops on GPUs

Conference · Tue May 01 00:00:00 EDT 2018 · OSTI ID:1474547

Efficient Parallelization of Irregular Applications on GPU Architectures

Thesis/Dissertation · Sun Dec 31 23:00:00 EST 2023 · OSTI ID:2349242

Related Subjects

Android security
GPU
application-specific optimization
data-flow analysis
mobile application vetting
static program analysis
worklist algorithm

GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting

Citation Formats

References (39)

Similar Records

Related Subjects