-
Notifications
You must be signed in to change notification settings - Fork 3
Checking one-second data #1
Description
Checking one-second data
1. Data checking principle
A data-check routine for one second data should make use of one-minute (preferably checked data) and one-second step_1 data. Contents of these data sets are compared, and, for this purpose, partly recalculated based on INTERMAGNET and IAGA recommended techniques. A quick test will investigate data from a randomly chosen month and is sufficient to check the principal correctness underlying routines, formats and meta information. A full test will perform a detailed data check for all data. A report with all check details should be obtained and made accessible to the data submitter.
2. Issues to be checked
-
Submitted files and formats: Are all requested files available and are they submitted in correct and readable formats (IAGA-2002, IAF, IMCDF).
-
Meta information: Do all files contain the requested meta information and is this meta information consistent between all different files.
-
Data content: Are the IAF one-minute, one-hour, and daily averages consistent and obtained using IM filter procedures. Are F values provided for one-minute and one-second, and are they independent measures of the field. Are delta F variations within acceptable limits. IMCDF: Is the leap second table up-to-date.
-
Data consistency: Can one-minute data be reproduced by filtering one-second data using IM recommended procedures on filtering and outlier treatment.
-
Baseline variation: Are periodic measurements available. Are residuals and amplitudes in an acceptable range.
-
Eventually: Is the spectral data content as expected for a natural geomagnetic signal or are there any significant technical contributions. Noise level.
3. Step wise performance of the data check
(as performed with MagPy 0.9.6)
-
directories and existence of files
-
file names, access and basic header information
-
data content and consistency of minute data
-
second data and consistency with minute data (IM recommendations)
-
basevalues and adopted baseline variation
-
yearly means, meta information
-
activity - "not included"
The results of each step are summarized and rated with a grade from 1 to 6. Grades of 5 and above indicate "significant" issues regarding IM recommendations to be considered. A detailed description on how to use MagPy for these tests can be found in the manual (http://cobs.zamg.ac.at/data/index.php/en/downloads/category/1-magnetism).
4. Issues and suggestions
-
Currently one second data is uploaded in two different formats using, however, various different packing routines. Should we suggest a common submission format? For IAGA2002 I would recommend daily zipped IAGA files. For IMCDF monthly files are already requested.
-
threshold for differences between minute data and IM filtered one-second data: how much difference do we tolerate?
5. Recommendations for analysis
- To avoid downloading all files, one could mount the ftp directories on the local system. However, most files are uploaded as various different archives.
- Run only quick tests first. Only for data passing quick test, run full test.
- When using MagPy eventually skip step 5 (baseline) and 6 (meta, yearmean) for quick test, if minute data has been accepted
- Some issues are easy to be solved, others give indications for underlying procedures.
Example:
- if filtered second data differs from minute data only for few days, this indicates that most likely the outlier treatment differs from the IAGA 90% rule
- if the difference of filtered second data and minute data is evenly distributed with values exceeding the numerical uncertainty then most likely a different filter procedure (window, type) was used
- if the difference is not evenly distributed and some "function" is observable, then eventually different baseline treatments might be the cause
6. Example analysis:
Please note, the following table contains only a test analysis and is not related to any decision from the data committee. All test are also used as a run-time test for MagPy. Error messages related to certain files structures (e.g. step 5 failure related to BLV files or step 6 related to yearmean files) do not necessary indicate that the file structure is wrong. MagPy eventually just cannot interpret the format correctly yet. With Python3 MagPy >= 0.9.6 is required for the analysis.
2016
| Obs | Format (second) | QuickTest | Observed issues | QuickTest result | FullTest | Issues | Summary |
|---|---|---|---|---|---|---|---|
| WIC | IMCDF | 1,1,1,1,1,3,1 | Location and AltitudeDiff | second data OK | |||
| UPS | IMCDF daily | 1,5,1,4, | second data in daily cdfs, minor filter differences for two days, step5 failed | Monthly files obligatory? | |||
| TUC | IAGA (year zip) | 1, | iaga second data not readable | ||||
| SJG | IAGA (year zip) | 1,1,5,1,3,3,2 | inconsistency in IAF daily mean 2016-11-21, yearly means differ between yearmean and blv files | second data OK | |||
| SIT | IAGA (monthly zip) | 1,1,3,4,3,3,2 | hourly IAF complete?, filtered one-second differs slightly from one-minute | second data OK, check z comp | |||
| SHU | IAGA (year zip) | 1,1,3,4,3,3,2 | hourly IAF complete?, filtered one-second differs slightly from one-minute | second data OK, check z comp | |||
| NEW | |||||||
| MMB | IMCDF daily | 1,5, | second data in daily cdfs, IAF data not readable | Monthly files obligatory? | |||
| MCQ | |||||||
| MAW | IMCDF | 1 | no leap second information | ||||
| LYC | |||||||
| LRM | IMCDF | 1 | no leap second information | ||||
| KNY | |||||||
| KDU | |||||||
| KAK | |||||||
| HRN | |||||||
| HON | |||||||
| HLP | |||||||
| HER | |||||||
| GNG | |||||||
| FRN | |||||||
| FRD | |||||||
| EBR | IMCDF | 1,3,1,1, | file names not as expected for ImagCDF, step5 failed | second data OK | |||
| DED | |||||||
| CTA | |||||||
| CSY | |||||||
| CNB | |||||||
| CMO | |||||||
| CKI | |||||||
| BSL | |||||||
| BRW | |||||||
| BOU | |||||||
| BEL | |||||||
| BDV | |||||||
| ASP | |||||||
| ABK |