Class: Gene

A region (or regions) that includes all of the sequence elements necessary to encode a functional transcript. A gene locus may include regulatory regions, transcribed regions and/or other functional sequence regions.

URI: biolink:Gene


Identifier prefixes

  • NCBIGene
  • HGNC
  • MGI
  • ZFIN
  • dictyBase
  • WB
  • WormBase
  • FB
  • RGD
  • SGD
  • OMIM
  • UMLS
  • Xenbase


  • is_a: NucleicAcidEntity - A nucleic acid entity is a molecular entity characterized by availability in gene databases of nucleotide-based sequence representations of its precise sequence; for convenience of representation, partial sequences of various kinds are included, even if they do not represent a physical molecule.

Uses Mixins

  • mixin: GeneOrGeneProduct - A union of gene loci or gene products. Frequently an identifier for one will be used as proxy for another
  • mixin: ThingWithTaxon - A mixin that can be used on any entity that can be taxonomically classified. This includes individual organisms; genes, their products and other molecular entities; body parts; biological processes

Referenced by class


Inherited from entity:

  • id REQ
    • Description: A unique identifier for an entity. Must be either a CURIE shorthand for a URI or a complete URI
    • range: String
    • in subsets: (translator_minimal)
  • iri OPT
    • Description: An IRI for an entity. This is determined by the id using expansion rules.
    • range: IriType
    • in subsets: (translator_minimal,samples)
  • category 0..*
    • Description: Name of the high level ontology class in which this entity is categorized. Corresponds to the label for the biolink entity type class.
  • In a neo4j database this MAY correspond to the neo4j label tag.
  • In an RDF database it should be a biolink model class URI. This field is multi-valued. It should include values for ancestors of the biolink class; for example, a protein such as Shh would have category values biolink:Protein, biolink:GeneProduct, biolink:MolecularEntity, … In an RDF database, nodes will typically have an rdf:type triples. This can be to the most specific biolink class, or potentially to a class more specific than something in biolink. For example, a sequence feature f may have a rdf:type assertion to a SO class such as TF_binding_site, which is more specific than anything in biolink. Here we would have categories {biolink:GenomicEntity, biolink:MolecularEntity, biolink:NamedThing}
  • type OPT
  • description OPT
    • Description: a human-readable description of an entity
    • range: NarrativeText
    • in subsets: (translator_minimal)
  • source OPT
    • Description: a lightweight analog to the association class ‘has provider’ slot, which is the string name, or the authoritative (i.e. database) namespace, designating the origin of the entity to which the slot belongs.
    • range: LabelType
    • in subsets: (translator_minimal)
  • provided by 0..*
    • Description: connects an association to the agent (person, organization or group) that provided it
    • range: Agent
  • has attribute 0..*
    • Description: connects any entity to an attribute
    • range: Attribute
    • in subsets: (samples)

Inherited from macromolecular machine mixin:

  • macromolecular machine mixin➞name OPT
    • Description: genes are typically designated by a short symbol and a full name. We map the symbol to the default display name and use an additional slot for full name
    • range: SymbolType

Inherited from molecular entity:

  • is metabolite OPT
    • Description: indicates whether a molecular entity is a metabolite
    • range: Boolean

Inherited from named thing:

Inherited from nucleic acid entity:

Inherited from thing with taxon:

  • in taxon 0..*
    • Description: connects an entity to its taxonomic classification. Only certain kinds of entities can be taxonomically classified; see ‘thing with taxon’
    • range: OrganismTaxon
    • in subsets: (translator_minimal)

Other properties

Aliases:   locus
In Subsets:   model_organism_database
Exact Mappings:   SO:0000704