Meta Data Standards

ODR curates each data sets following a set of standards covering the various aspects of the data. The key factors that are verified are:

Geo-spatial factors

Each data set made part of the platform is associated to the geographic territory where to which it is connected. The data is qualified against the following administrative structure of the state.

  • State
  • District
  • Block
  • Local Self Government
    • Municipal Corporation
    • Municipality
    • Panchayath

Granularity

Along with geographic coverage, granularity specify the level of detail available within the data.

Reference Time Period

For every official data set, it is important to have clarity on the time period to which the data is applicable.

Example:

  • District – Thiruvananthapuram
  • Granularity – Panchayath
  • Reference Time Period – 1-Apr-2020 – 31-Mar-2021

Machine Readable Formats

The datasets shared on the platform must be machine readable to ensure usability of the data.

Preferred formats:

  • CSV
  • TSV
  • JSON
  • XLSX
  • API

Errors and Missing Data

Ensure care while uploading datasets to the platform. Automated checks are in place to ensure that datasets containing blank values are not uploaded.

We encourage correcting any data issues that may negatively affect data quality, such as:

  • Null values in columns. If all values in a column are null, consider removing the column.
  • Duplicate rows in the data set