Meta Data Standards
ODR curates each data sets following a set of standards covering the various aspects of the data. The key factors that are verified are:
Geo-spatial factors
Each data set made part of the platform is associated to the geographic territory where to which it is connected. The data is qualified against the following administrative structure of the state.
- State
- District
- Block
- Local Self Government
- Municipal Corporation
- Municipality
- Panchayath
Granularity
Along with geographic coverage, granularity specify the level of detail available within the data.
Reference Time Period
For every official data set, it is important to have clarity on the time period to which the data is applicable.
Example:
- District – Thiruvananthapuram
- Granularity – Panchayath
- Reference Time Period – 1-Apr-2020 – 31-Mar-2021
Machine Readable Formats
The datasets shared on the platform must be machine readable to ensure usability of the data.
Preferred formats:
- CSV
- TSV
- JSON
- XLSX
- API
Errors and Missing Data
Ensure care while uploading datasets to the platform. Automated checks are in place to ensure that datasets containing blank values are not uploaded.
We encourage correcting any data issues that may negatively affect data quality, such as:
- Null values in columns. If all values in a column are null, consider removing the column.
- Duplicate rows in the data set