|
This repository consist in a collection of R-scripts to homogenise platform identifier information and to identify duplicate observations in the **International Comprehensive Ocean-Atmosphere Data Set** (ICOADS) marine data source.
|
|
This repository consist in a collection of R-scripts to homogenise platform identifier information and to identify duplicate observations in the **International Comprehensive Ocean-Atmosphere Data Set** (ICOADS) marine data source.
|
|
|
|
|
|
ICOADS is the world most extensive surface marine meteorological data collection. Contains ocean surface and atmospheric observations from the 1600's to present and is still receiving more data every year. The data base is made up of data from many different sources, there are several hundred combinations of the **DCK** (deck) and **SID** (sources) flags that indicate the origin of the data. Typically, **DCK** indicates the **type of data** (e.g. US Navy ships; Japanese Whaling Fleet) and **SID** provides more information about the data system or format (e.g. data stream extracted from the WMO global telecommunications systems, GIS).
|
|
ICOADS is the world most extensive surface marine meteorological data collection. Contains ocean surface and atmospheric observations from the 1600's to present and is still receiving more data every year. The data base is made up of observation reports from many different sources, there are several hundred combinations of the **DCK** (deck) and **SID** (sources) flags that indicate the origin of the data. Typically, **DCK** indicates the **type of data** (e.g. US Navy ships; Japanese Whaling Fleet) and **SID** provides more information about the data system or format (e.g. data stream extracted from the WMO global telecommunications systems, GIS).
|
|
|
|
|
|
Sometimes a single DCK is associated with a single SID, sometimes a single DCK will contain several SID and vice versa. Historically archives of marine data have been maintained by individual nations, and often these were shared so that the same observations appear in the archives of several nations. Truncated formats often did not contain sufficient information to identify the observations made by a particular ship or platform, and these compact formats sometimes converted or encoded data in different ways. For example, many observations do not have an identifier linking to the ship (**ID**) or platform (**pt**), and for those that do have such identifiers they may be different between data sources. The main types of duplicates are:
|
|
Sometimes a single DCK is associated with a single SID, sometimes a single DCK will contain several SID and vice versa. Historically archives of marine data have been maintained by individual nations, and often these were shared so that the same observations appear in the archives of several nations. Truncated formats often did not contain sufficient information to identify the observations made by a particular ship or platform, and these compact formats sometimes converted or encoded data in different ways. For example, many observations do not have an identifier linking to the ship (**ID**) or platform (**pt**), and for those that do have such identifiers they may be different between data sources. The main types of duplicates are:
|
|
|
|
|
... | | ... | |