Published using Google Docs
WebVid-CoVR - Datasheet
Updated automatically every 5 minutes

This document presents a datasheet [2] for the WebVid-CoVR dataset.

  1. Motivation

  1. Composition

  1. Collection Process

  1. Preprocessing/cleaning/labeling

  1. Uses

  1. The videos may contain inappropriate or objectionable content, since they were scraped from the web without curation. The authors attempted to filter out some inappropriate content, but dataset consumers should be aware that objectionable material may still exist in the data.
  2. The automatically generated modification texts between video pairs may be noisy or not accurately describe the visual differences between videos, since they were generated from caption differences only. This could impact potential uses or analysis done on the textual aspects of the data.
  3. The lack of human curation means there could be biases or quality issues in the data. Consumers should evaluate the data carefully for their application.
  4. There may be copyright or terms of use issues with the original videos that were scraped, so consumers should review permissions and potential restrictions associated to their institutions.

To mitigate risks/harms, consumers could manually review and filter objectionable content, analyze texts to exclude noisy samples, and evaluate dataset distributions/biases before use. Working with subset samples or known high-quality portions may also help. Overall, the automated nature of the dataset generation process should be kept in mind.

  1. Distribution

  1. Maintenance

References

  1. Max Bain, Arsha Nagrani, Gül Varol, and Andrew Zisserman. Frozen in time: A joint video and image encoder for end-to-end retrieval. In ICCV, 2021.
  2. Timnit Gebru, Jamie Morgenstern, Briana Vecchione, Jennifer Wortman Vaughan, Hanna Wallach, Hal Daumé, and Kate Crawford. Datasheets for datasets. Communications of the ACM.