Data & Access

Our complete dataset is open and available for anyone.

Download Full Dataset

Use these persistent URLs for automated access to always get the latest version of the dataset.

Live Data Access

Use these persistent URLs for automated access to always get the latest version of the dataset.

CSV ENDPOINT

https://yourselftoscience.org/resources.csv

JSON ENDPOINT

https://yourselftoscience.org/resources.json

RDF/TTL ENDPOINT

https://yourselftoscience.org/resources.ttl

VOID ENDPOINT (Semantic Web)

Linked Data Descriptor
https://yourselftoscience.org/void.ttl

Licensing

Our dataset is dedicated to the public domain under the Creative Commons CC0 1.0 Universal Public Domain Dedication (CC0 1.0). You can copy, modify, and distribute the data, even for commercial purposes, without asking permission.

While not required, we appreciate credit to Yourself to Science when using our data.

Wikidata Integration

Each resource in our catalogue is mapped to Wikidata QIDs. Organizations, countries, and key entities reference their Wikidata identifiers, making the dataset interoperable with the global knowledge graph.

This alignment is maintained manually and is used to enrich existing Wikidata items and identify missing ones.

Dataset Schema

The dataset contains the following fields for each resource. For detailed definitions of each data type, visit our Full Data Dictionary.

  • id: A persistent, unique identifier (UUID) for the resource.
  • permalink: The permanent URI linking directly to the resource's dataset page.
  • slug: A user-friendly identifier used in the URL.
  • title: The name of the resource or study.
  • organizations: An array of organizations conducting the research, each with a name and optional Wikidata ID.
  • link: A URL to the resource's website.
  • dataTypes: An array of strings describing the types of data collected (e.g., "Genome", "Health data").
  • compensationType: The type of compensation offered ("donation", "payment", or "mixed").
  • origin: The country where the organization is based (Headquarters).
  • countries: An array of countries where the resource is available.
  • description: A brief description of the resource.
  • citations: An array of academic citations related to the resource.
  • compatibleSources: Known accepted dataset sources (e.g., "WGS", "23andMe").
  • resourceWikidataId: The main Wikidata QID aligned with the project.
  • entityCategory: The general type of the organization (e.g., "Non-Profit", "Government").
  • entitySubType: A more specific classification of the organization (e.g., "Research Foundation", "Regulatory Agency").