Skip to content

Respect the timestamps - some data might be missing #105

@Ropzn

Description

@Ropzn

There is an issue which at least was present on Arielle during the mosaic cruise.
Due to technical reasons sometimes some (30s) measurements are completely missing.
But this is not displayed correctly by Picasso.

Example:
2019-11-13 check period 22:00 to 24:00

The first plot is from https://polly.tropos.de/archive_link/recent_plots_new/arielle/2019/11/13/2019_11_13_Wed_ARI_18_00_02_RH.png
It does not show missing time bins

The second plot (sorry for different time and hight range) shows some missing time bins (each 10min appart).
You also recognize in the first plot, that ten missing 30s-bins pile up to 5minutes and the plot ends at 23:55.
(I think there were two more bins missing, which are not represented in fig2, since it happend before 21:00)

image
from

  versus

missingTimeBins

This might often not be a big deal. But as soon as you are playing with time resolution or compare the pollydata with other data there might be a timeshift.

Technical Reason:
Probably the measuring program could not access the nc-file while the nc-file was accessed by the backup-procedure which is scheduled every 10minutes.
This issue is resolved ?? (ask Ronny) for arielle (by changing the cronjob/taskmanagertime for the backup-procedure by some seconds), and might have occured only on the arielle server due to slow hardware.
(I have not noticed it for Tadjikistan system.)

Discussion:
I'm not deep into picasso, so I don't know, wether this is just a plotting issue or if timestamps are not respected during the processing chain.
Probably it was only this single system, which has/had this problem.
But probably this might be also the reason behind issue #91 .....?
So, one could argue for the future, that we only need a quality flag, to quickly intervene if something like this happens. To make it really error prone, one could either correct the data by inserting time bins with NaN-measurements (either the raw data or within picasso) or making the program respect time stamps.

Proposals:

  1. Include a quality check for missing time bins. (little effort)
  2. Correct the data within Picasso (little effort programming, but probably cpu consuming?)
  3. Correct the data on the server (not by Picasso) (Probably not the nice way, because one should not touch the raw data)
  4. Respect time stamps within Picasso (probably high effort in programming, but little cpu consuming)

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions