Unknown

Dataset Information

0

Attention based GRU-LSTM for software defect prediction.


ABSTRACT: Software defect prediction (SDP) can be used to produce reliable, high-quality software. The current SDP is practiced on program granular components (such as file level, class level, or function level), which cannot accurately predict failures. To solve this problem, we propose a new framework called DP-AGL, which uses attention-based GRU-LSTM for statement-level defect prediction. By using clang to build an abstract syntax tree (AST), we define a set of 32 statement-level metrics. We label each statement, then make a three-dimensional vector and apply it as an automatic learning model, and then use a gated recurrent unit (GRU) with a long short-term memory (LSTM). In addition, the Attention mechanism is used to generate important features and improve accuracy. To verify our experiments, we selected 119,989 C/C++ programs in Code4Bench. The benchmark tests cover various programs and variant sets written by thousands of programmers. As an evaluation standard, compared with the state evaluation method, the recall, precision, accuracy and F1 measurement of our well-trained DP-AGL under normal conditions have increased by 1%, 4%, 5%, and 2% respectively.

SUBMITTER: Munir HS 

PROVIDER: S-EPMC7932164 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

Attention based GRU-LSTM for software defect prediction.

Munir Hafiz Shahbaz HS   Ren Shengbing S   Mustafa Mubashar M   Siddique Chaudry Naeem CN   Qayyum Shazib S  

PloS one 20210304 3


Software defect prediction (SDP) can be used to produce reliable, high-quality software. The current SDP is practiced on program granular components (such as file level, class level, or function level), which cannot accurately predict failures. To solve this problem, we propose a new framework called DP-AGL, which uses attention-based GRU-LSTM for statement-level defect prediction. By using clang to build an abstract syntax tree (AST), we define a set of 32 statement-level metrics. We label each  ...[more]

Similar Datasets

| S-EPMC10513229 | biostudies-literature
| S-EPMC9299279 | biostudies-literature
| S-EPMC7437542 | biostudies-literature
| S-EPMC8627227 | biostudies-literature
| S-EPMC10797904 | biostudies-literature
| S-EPMC8736265 | biostudies-literature
| S-EPMC9455287 | biostudies-literature
| S-EPMC9990937 | biostudies-literature
| S-EPMC6882172 | biostudies-literature
| S-EPMC9313278 | biostudies-literature