Publication | Closed Access
Quality Control of Large Argo Datasets
117
Citations
12
References
2008
Year
Environmental MonitoringObservational NetworksEngineeringOceanographyEarth ScienceOcean MonitoringAbstract Argo FloatsMarine MeteorologyData ScienceData MiningCalibrationCoriolis Data CenterData IntegrationData ManagementOceanic SystemsOcean InstrumentationMarine GeologyData ModelingData OptimizationOcean TechnologyGeographyKnowledge DiscoveryData QualityDelayed Mode DatasetQuality ControlClimate DynamicsPhysical OceanographyMassive Data ProcessingBig Data
Argo floats have greatly enhanced global ocean interior observations, yet the expanding database now demands efficient quality‑control tools. This study demonstrates how classical optimal analysis can validate very large Argo datasets before operational or scientific use. Using the Coriolis data‑center analysis system, the authors examined weekly temperature and salinity fields, evaluated sensor‑error impacts, and performed twin experiments—including a pressure‑labeling test—to assess the system’s capacity to detect anomalies. Residuals extracted about two‑thirds of sensor errors, the method flagged 15 % of 482 Atlantic floats for salinity drift or offset, confirmed overall consistency in a delayed‑mode dataset of 120 floats, and successfully identified deficient instruments in a 36‑float 2003‑06 sample.
Abstract Argo floats have significantly improved the observation of the global ocean interior, but as the size of the database increases, so does the need for efficient tools to perform reliable quality control. It is shown here how the classical method of optimal analysis can be used to validate very large datasets before operational or scientific use. The analysis system employed is the one implemented at the Coriolis data center to produce the weekly fields of temperature and salinity, and the key data are the analysis residuals. The impacts of the various sensor errors are evaluated and twin experiments are performed to measure the system capacity in identifying these errors. It appears that for a typical data distribution, the analysis residuals extract 2/3 of the sensor error after a single analysis. The method has been applied on the full Argo Atlantic real-time dataset for the 2000–04 period (482 floats) and 15% of the floats were detected as having salinity drifts or offset. A second test was performed on the delayed mode dataset (120 floats) to check the overall consistency, and except for a few isolated anomalous profiles, the corrected datasets were found to be globally good. The last experiment performed on the Coriolis real-time products takes into account the recently discovered problem in the pressure labeling. For this experiment, a sample of 36 floats, mixing well-behaved and anomalous instruments of the 2003–06 period, was considered and the simple test designed to detect the most common systematic anomalies successfully identified the deficient floats.
| Year | Citations | |
|---|---|---|
Page 1
Page 1