Skip to content

Checking one-second data #1

@leonro

Description

@leonro

Checking one-second data

1. Data checking principle

A data-check routine for one second data should make use of one-minute (preferably checked data) and one-second step_1 data. Contents of these data sets are compared, and, for this purpose, partly recalculated based on INTERMAGNET and IAGA recommended techniques. A quick test will investigate data from a randomly chosen month and is sufficient to check the principal correctness underlying routines, formats and meta information. A full test will perform a detailed data check for all data. A report with all check details should be obtained and made accessible to the data submitter.

2. Issues to be checked

  • Submitted files and formats: Are all requested files available and are they submitted in correct and readable formats (IAGA-2002, IAF, IMCDF).

  • Meta information: Do all files contain the requested meta information and is this meta information consistent between all different files.

  • Data content: Are the IAF one-minute, one-hour, and daily averages consistent and obtained using IM filter procedures. Are F values provided for one-minute and one-second, and are they independent measures of the field. Are delta F variations within acceptable limits. IMCDF: Is the leap second table up-to-date.

  • Data consistency: Can one-minute data be reproduced by filtering one-second data using IM recommended procedures on filtering and outlier treatment.

  • Baseline variation: Are periodic measurements available. Are residuals and amplitudes in an acceptable range.

  • Eventually: Is the spectral data content as expected for a natural geomagnetic signal or are there any significant technical contributions. Noise level.

3. Step wise performance of the data check

(as performed with MagPy 0.9.6)

  1. directories and existence of files

  2. file names, access and basic header information

  3. data content and consistency of minute data

  4. second data and consistency with minute data (IM recommendations)

  5. basevalues and adopted baseline variation

  6. yearly means, meta information

  7. activity - "not included"

The results of each step are summarized and rated with a grade from 1 to 6. Grades of 5 and above indicate "significant" issues regarding IM recommendations to be considered. A detailed description on how to use MagPy for these tests can be found in the manual (http://cobs.zamg.ac.at/data/index.php/en/downloads/category/1-magnetism).

4. Issues and suggestions

  • Currently one second data is uploaded in two different formats using, however, various different packing routines. Should we suggest a common submission format? For IAGA2002 I would recommend daily zipped IAGA files. For IMCDF monthly files are already requested.

  • threshold for differences between minute data and IM filtered one-second data: how much difference do we tolerate?

5. Recommendations for analysis

  • To avoid downloading all files, one could mount the ftp directories on the local system. However, most files are uploaded as various different archives.
  • Run only quick tests first. Only for data passing quick test, run full test.
  • When using MagPy eventually skip step 5 (baseline) and 6 (meta, yearmean) for quick test, if minute data has been accepted
  • Some issues are easy to be solved, others give indications for underlying procedures.
    Example:
    - if filtered second data differs from minute data only for few days, this indicates that most likely the outlier treatment differs from the IAGA 90% rule
    - if the difference of filtered second data and minute data is evenly distributed with values exceeding the numerical uncertainty then most likely a different filter procedure (window, type) was used
    - if the difference is not evenly distributed and some "function" is observable, then eventually different baseline treatments might be the cause

6. Example analysis:

Please note, the following table contains only a test analysis and is not related to any decision from the data committee. All test are also used as a run-time test for MagPy. Error messages related to certain files structures (e.g. step 5 failure related to BLV files or step 6 related to yearmean files) do not necessary indicate that the file structure is wrong. MagPy eventually just cannot interpret the format correctly yet. With Python3 MagPy >= 0.9.6 is required for the analysis.

2016

Obs Format (second) QuickTest Observed issues QuickTest result FullTest Issues Summary
WIC IMCDF 1,1,1,1,1,3,1 Location and AltitudeDiff second data OK
UPS IMCDF daily 1,5,1,4, second data in daily cdfs, minor filter differences for two days, step5 failed Monthly files obligatory?
TUC IAGA (year zip) 1, iaga second data not readable
SJG IAGA (year zip) 1,1,5,1,3,3,2 inconsistency in IAF daily mean 2016-11-21, yearly means differ between yearmean and blv files second data OK
SIT IAGA (monthly zip) 1,1,3,4,3,3,2 hourly IAF complete?, filtered one-second differs slightly from one-minute second data OK, check z comp
SHU IAGA (year zip) 1,1,3,4,3,3,2 hourly IAF complete?, filtered one-second differs slightly from one-minute second data OK, check z comp
NEW
MMB IMCDF daily 1,5, second data in daily cdfs, IAF data not readable Monthly files obligatory?
MCQ
MAW IMCDF 1 no leap second information
LYC
LRM IMCDF 1 no leap second information
KNY
KDU
KAK
HRN
HON
HLP
HER
GNG
FRN
FRD
EBR IMCDF 1,3,1,1, file names not as expected for ImagCDF, step5 failed second data OK
DED
CTA
CSY
CNB
CMO
CKI
BSL
BRW
BOU
BEL
BDV
ASP
ABK

Metadata

Metadata

Labels

documentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions