Skip to content

CSVW-EO Overview

CSVW-EO extends the W3C CSV on the Web (CSVW) standard with privacy-safe metadata for:

  • Differential Privacy (DP)
  • Dummy data generation
  • Structural dataset modeling
  • Public partition definitions
  • Safe schema publication

Overview

CSVW-EO allows organizations to publish assumptions and guarantees about datasets without exposing sensitive underlying records.

These assumptions may include:

  • dataset schema
  • nullable proportions
  • public categorical domains
  • grouping partitions
  • contribution bounds for DP
  • logical dependencies between columns

Warning

Some assumptions may themselves leak sensitive information. Metadata must always be manually reviewed before publication.

Main Concepts

CSVW-EO extends CSVW with:

Concept Purpose
Structural modeling Describe possible datasets
Dummy modeling Generate realistic fake datasets
DP contribution bounds Calibrate differential privacy
Public partitions Define safe grouping assumptions
Validation Ensure metadata consistency
File Purpose
csvw-eo-vocab.ttl RDF vocabulary
csvw-eo-context.jsonld JSON-LD context
csvw-eo-constraints.ttl SHACL validation
csvw-eo-library Python tooling