Building Blocks and Schemas for GA4GH Implementations
Please not: The content of this repository is being re-“invented” in schemablocks.org, as a wider GA4GH cross-workstream initiative. This site here should be considered a prototype for the new project …
This repository contains schema “blocks” for the GA4GH project, in a collaborative effort between members of the Clinical and Phenotypic Data Capture (GA4GH::CP) and the Genomic Knowledge Standards (GA4GH::GKS) and the Discovery work streams.
Such blocks can be
The project does not intent to build a monolithic API, but rather help to exchange usable components for creating implementations.
Currently, this site just represents skeleton schema elements, derived from the original, then monolithic GA4GH schema.
The primary documents are in the yaml directory, with JSON versions and examples extracted from them. The “readable” documentation is also created from the YAML files and can be accessed through the links below.
biosample
object.individual
object contains information which pertains to the whole biological entity biosamples are derived from (e.g. sex, heritable phenotypes…).The “genomic” parts of the schema recommendations do not yet represent authoritative recommendations of the GA4GH::GKS group, but rather reflect extended versions of the original, VCF-derived GA4GH schema. Examples for current use of this schema are e.g. in the arraymap.org and the Beacon+ projects.
variant
object includes attributes and examples for both structural (DUP, DEL …) and precise genome variants.callset
object is for technoical data and series information (e.g. used platform and analysis metods). It is not strictly needed for querying combined variant + biosample aspects, since in the current implementation the variant
object contains a reference to the biosample
it was derived from.