Building Blocks and Schemas for GA4GH Implementations
The document describes attributes of the variant object. In its current implementation, valiant (and related genomic objects such as callset) represent extended versions of the original, VCF-derived GA4GH schema. This format may be superseeded or augmented based on current developments in the GA4GH::GKS work stream.
The schema definitions are done in the YAML file.
Property | Type | Format | Description |
---|---|---|---|
alternate_bases | string | one or more bases relative to start position of the reference genome,replacing the reference_bases value; for precise variants | |
biosample_id | The identifier ("biosample.id") of the biosample this variant was reported from. This is a shortcut to using the variant -> callset -> biosample chaining. | ||
callset_id | string | The identifier ("callset.id") of the callset this variant is part of. | |
created | string | The creation time of this record, in ISO8601 | |
digest | string | concatenated unique specific elements of the variant | |
end | array | int64 | array of 0 (for presise sequence variants), 1 or 2 (for imprecise end position of structural variant) integers |
genotype | array | list of strings, which represent the (phased) alleles in which the variant was being observed | |
id | string | The local-unique identifier of this variant (referenced as "variant_id"). | |
info | additional variant information, as defined in the example and accompanying documentation | ||
mate_name | string | Mate name (chromosome) for fusion (BRK) events; otherwise left empty. Accepting values 1-22, X, Y. | |
reference_bases | string | one or more bases at start position in the reference genome, which have been replaced by the alternate_bases value; for precise variants | |
reference_name | string | Reference name (chromosome). Accepting values 1-22, X, Y. | |
start | array | int64 | array of 1 or 2 (for imprecise end position of structural variant) integers |
updated | string | The time of the last edit of this record, in ISO8601 | |
variant_type | string | the variant type in case of a named (structural) variant (e.g. DUP, DEL, BRK ...) |
one or more bases relative to start position of the reference genome,replacing the reference_bases value; for precise variants
'alternate_bases' : "AC"
The identifier (“biosample.id”) of the biosample this variant was reported from. This is a shortcut to using the variant -> callset -> biosample chaining.
'biosample_id' : "pgx-bs-987647"
The identifier (“callset.id”) of the callset this variant is part of.
'callset_id' : "PGX_AM_CS_GSM1690424"
The creation time of this record, in ISO8601
'created' : "2017-10-25T07:06:03Z"
concatenated unique specific elements of the variant
'digest' : "4:12282-46465:DEL"
array of 0 (for presise sequence variants), 1 or 2 (for imprecise end position of structural variant) integers
'end' : [
21977798,
21978106
]
the query will return all variants with any overlap of the CDKN2A CDR
db.variants.find( { "reference_name" : 9, "variant_type" : "DEL", "start" : { $lteq : 21975098 }, "end" : { $gteq : 21967753 } } )
list of strings, which represent the (phased) alleles in which the variant was being observed
'genotype' : [
'1',
'.'
]
The local-unique identifier of this variant (referenced as “variant_id”).
'id' : "amvar-8754-7751-1119-8539"
additional variant information, as defined in the example and accompanying documentation
'info' : {
'cnv_value' : '-0.294',
'cnv_length' : 1205290
}
Mate name (chromosome) for fusion (BRK) events; otherwise left empty. Accepting values 1-22, X, Y.
'mate_name' : 14
one or more bases at start position in the reference genome, which have been replaced by the alternate_bases value; for precise variants
'reference_bases' : "G"
Reference name (chromosome). Accepting values 1-22, X, Y.
'reference_name' : 8
array of 1 or 2 (for imprecise end position of structural variant) integers
'start' : [
20867740
]
The time of the last edit of this record, in ISO8601
'updated' : "2022-11-11T09:45:13Z"
the variant type in case of a named (structural) variant (e.g. DUP, DEL, BRK …)
'variant_type' : "DEL"