Dataset Information

The data dimensionality reduction and bad data detection in the process of smart grid reconstruction through machine learning.

ABSTRACT: To detect false data injection attacks (FDIAs) in power grid reconstruction and solve the problem of high data dimension and bad abnormal data processing in the power system, thereby achieving safe and stable operation of the power grid system, this study introduces machine learning methods to explore the detection of FDIAs. First, through the utilization of the standard IEEE node system and the simulation of FDIAs under the condition of non-complete topology information, the construction of the attack data set is completed, and the MatPower tool is applied to simulate and analyze the data set. Second, based on the isolated Forest (iForest) abnormal score data processing algorithm combined with the Local Linear Embedding (LLE) data dimensionality reduction method, an algorithm for data feature extraction is constructed. Finally, based on the combination of the Convolutional Neural Network (CNN) and the Gated Recurrent Unit (GRU) network, an algorithm model for FDIAs detection is constructed. The results show that in the IEEE14-bus node and IEEE118-bus node systems, the overall distribution of the state estimated before and after the attack vector injection is consistent with the initial value. In the iFores algorithm, the number of iTree and the number of samples affect the extraction of abnormal score data. When the number of iTree n is determined to be 100, and the corresponding number of samples w is determined to be 10, the algorithm has the best detection effect. The FDIAs detection algorithm model based on CNN-GRU shows good detection effects under high attack intensity, with an accuracy rate of more than 95%, and its performance is better than other traditional detection algorithms. In this study, the bad data detection model based on deep learning has an active role in the realization of the safe and stable operation of the smart grid.

SUBMITTER: Yu B

PROVIDER: S-EPMC7540890 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

The data dimensionality reduction and bad data detection in the process of smart grid reconstruction through machine learning.

Yu Bo B Wang Zheng Z Liu Shangke S Liu Xiaomin X Gou Ruixin R

PloS one 20201007 10

To detect false data injection attacks (FDIAs) in power grid reconstruction and solve the problem of high data dimension and bad abnormal data processing in the power system, thereby achieving safe and stable operation of the power grid system, this study introduces machine learning methods to explore the detection of FDIAs. First, through the utilization of the standard IEEE node system and the simulation of FDIAs under the condition of non-complete topology information, the construction of the ...[more]

PMID: 33027298

Similar Datasets

Project description:Network attacks pose a significant challenge for smart grid networks, mainly due to the existence of several multi-directional communication devices coupling consumers to the grid. One of the network attacks that can affect the smart grid is the distributed denial of service (DDoS), where numerous compromised communication devices/nodes of the grid flood the smart grid network with false data and requests, leading to disruptions in smart meters, data servers, and the state estimator, ultimately effecting the services for end-users. Machine learning-based strategies show distinctive benefits in resolving the challenge of securing the network from DDoS attacks. Regardless, a notable hindrance in deploying machine learning-based techniques is the requirement of model retraining whenever new attack classes arise. Practically, disrupting the normal operations of smart grid is really discouraged. To handle this challenge effectively and detect DDoS attacks without major disruptions, we propose the deployment of reconstructive deep learning techniques. A primary benefit of our proposed technique is the minimum disruption during the introduction of a new attack class, even after complete deployment. We trained several deep and shallow reconstructive models to get representations for each attack type separately, and we performed attack detection by class-specific reconstruction error-based classification. Our technique experienced rigid evaluation via multiple experiments using two well-acknowledged standard databases exclusively for DDoS attacks, including their subsets. Later, we performed a comparative estimation of our outcomes against six methods prevalent within the same domain. Our outcomes reveal that our technique attained higher accuracy, and notably eliminates the requirement of a complete model retraining in the event of the introduction of new attack classes. This method will not only boost the security of smart grid networks but also ensure the stability and reliability of normal operations, protecting the critical infrastructure from ever-evolving network attacks. As smart grid is advancing rapidly, our approach proposes a robust and adaptive way to overcome the continuous challenges posed by network attacks.

Project description:BackgroundIt is important to be able to predict, for each individual patient, the likelihood of later metastatic occurrence, because the prediction can guide treatment plans tailored to a specific patient to prevent metastasis and to help avoid under-treatment or over-treatment. Deep neural network (DNN) learning, commonly referred to as deep learning, has become popular due to its success in image detection and prediction, but questions such as whether deep learning outperforms other machine learning methods when using non-image clinical data remain unanswered. Grid search has been introduced to deep learning hyperparameter tuning for the purpose of improving its prediction performance, but the effect of grid search on other machine learning methods are under-studied. In this research, we take the empirical approach to study the performance of deep learning and other machine learning methods when using non-image clinical data to predict the occurrence of breast cancer metastasis (BCM) 5, 10, or 15 years after the initial treatment. We developed prediction models using the deep feedforward neural network (DFNN) methods, as well as models using nine other machine learning methods, including naïve Bayes (NB), logistic regression (LR), support vector machine (SVM), LASSO, decision tree (DT), k-nearest neighbor (KNN), random forest (RF), AdaBoost (ADB), and XGBoost (XGB). We used grid search to tune hyperparameters for all methods. We then compared our feedforward deep learning models to the models trained using the nine other machine learning methods.ResultsBased on the mean test AUC (Area under the ROC Curve) results, DFNN ranks 6th, 4th, and 3rd when predicting 5-year, 10-year, and 15-year BCM, respectively, out of 10 methods. The top performing methods in predicting 5-year BCM are XGB (1st), RF (2nd), and KNN (3rd). For predicting 10-year BCM, the top performers are XGB (1st), RF (2nd), and NB (3rd). Finally, for 15-year BCM, the top performers are SVM (1st), LR and LASSO (tied for 2nd), and DFNN (3rd). The ensemble methods RF and XGB outperform other methods when data are less balanced, while SVM, LR, LASSO, and DFNN outperform other methods when data are more balanced. Our statistical testing results show that at a significance level of 0.05, DFNN overall performs comparably to other machine learning methods when predicting 5-year, 10-year, and 15-year BCM.ConclusionsOur results show that deep learning with grid search overall performs at least as well as other machine learning methods when using non-image clinical data. It is interesting to note that some of the other machine learning methods, such as XGB, RF, and SVM, are very strong competitors of DFNN when incorporating grid search. It is also worth noting that the computation time required to do grid search with DFNN is much more than that required to do grid search with the other nine machine learning methods.

Dataset Information

The data dimensionality reduction and bad data detection in the process of smart grid reconstruction through machine learning.

Publications

The data dimensionality reduction and bad data detection in the process of smart grid reconstruction through machine learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets