Unknown

Dataset Information

0

A Data Quality Control Method for Seafloor Observatories: The Application of Observed Time Series Data in the East China Sea.


ABSTRACT: With the construction and deployment of seafloor observatories around the world, massive amounts of oceanographic measurement data were gathered and transmitted to data centers. The increase in the amount of observed data not only provides support for marine scientific research but also raises the requirements for data quality control, as scientists must ensure that their research outcomes come from high-quality data. In this paper, we first analyzed and defined data quality problems occurring in the East China Sea Seafloor Observatory System (ECSSOS). We then proposed a method to detect and repair the data quality problems of seafloor observatories. Incorporating data statistics and expert knowledge from domain specialists, the proposed method consists of three parts: a general pretest to preprocess data and provide a router for further processing, data outlier detection methods to label suspect data points, and a data interpolation method to fill up missing and suspect data. The autoregressive integrated moving average (ARIMA) model was improved and applied to seafloor observatory data quality control by using a sliding window and cleaning the input modeling data. Furthermore, a quality control flag system was also proposed and applied to describe data quality control results and processing procedure information. The real observed data in ECSSOS were used to implement and test the proposed method. The results demonstrated that the proposed method performed effectively at detecting and repairing data quality problems for seafloor observatory data.

SUBMITTER: Zhou Y 

PROVIDER: S-EPMC6111880 | biostudies-other | 2018 Aug

REPOSITORIES: biostudies-other

altmetric image

Publications

A Data Quality Control Method for Seafloor Observatories: The Application of Observed Time Series Data in the East China Sea.

Zhou Yusheng Y   Qin Rufu R   Xu Huiping H   Sadiq Shazia S   Yu Yang Y  

Sensors (Basel, Switzerland) 20180810 8


With the construction and deployment of seafloor observatories around the world, massive amounts of oceanographic measurement data were gathered and transmitted to data centers. The increase in the amount of observed data not only provides support for marine scientific research but also raises the requirements for data quality control, as scientists must ensure that their research outcomes come from high-quality data. In this paper, we first analyzed and defined data quality problems occurring i  ...[more]

Similar Datasets

| PRJNA418995 | ENA
| PRJEB78949 | ENA
| S-EPMC6449353 | biostudies-literature
| PRJNA682696 | ENA
| S-EPMC8004995 | biostudies-literature
| S-EPMC8752013 | biostudies-literature
| S-EPMC5113969 | biostudies-literature
| S-EPMC5428710 | biostudies-literature
| S-EPMC4917542 | biostudies-literature
| S-EPMC6684575 | biostudies-literature