Why and How of Data and Algorithm Standards
Craig Dsouza, WELL Labs, 8th July, 2024
WATER • ENVIRONMENT • LAND • LIVELIHOODS
Why Data and Algorithm Standards
Data Standards
Algorithm Standards
How do we compare performance of Data Standards
To increase trust & reduce friction in data and algorithm sharing and hence accelerate development of better data and algorithms.
reduce friction
accelerate development
An Example
The seasonal LULC product
WATER • ENVIRONMENT • LAND • LIVELIHOODS
Maps
Land Use Land Cover (LULC)
SUPERVISED
SUPERVISED
UNSUPERVISED
WATER • ENVIRONMENT • LAND • LIVELIHOODS
Models
Land Use Land Cover (LULC)
The seasonal LULC product
Sharing the Seasonal LULC product (Dataset)
An example of how we can share the LULC dataset using existing open geospatial data standards (STAC spec)
Data Provider
Accessing the Seasonal LULC product (Dataset)
An example of how we can access the LULC dataset using existing open geospatial data standards
Data User
Join this afternoon’s group activity to discuss publishing your datasets with open standards
Thank You!
Group Activity
Broad questions
•How to leverage data hosting efforts that are underway, E.g. Source Coop (for Big data)
•Build on top of metadata standards already in use, E.g. Open Imagery Network, ARIES for SEAA
•Explore their extensibility to the kind of data we will be dealing with: Remote sensed, secondary, primary
•Think through algorithm standards and to track data flow chains which more or less seem missing in other efforts
•Build processes for agreement on domain specific standards for various data products that we are producing, including primary data collection standards
Task
•Take some datasets and algorithms as examples and publish them with existing specs.