Skip to content

SIGnaling Network Open Resource (Signor) Reference Ingest Guide

Source Information

InfoRes ID: infores:signor

Description: SIGNOR 3.0, https://signor.uniroma2.it, is a public repository that captures causal information and represents it according to an 'activity-flow' model. SIGNOR provides freely-accessible static maps of causal interactions that can be tailored, pruned and refined to build dynamic and predictive models. Each signaling relationship is annotated with an effect (up/down-regulation) and with the mechanism (e.g. binding, phosphorylation, transcriptional activation, etc.) causing the regulation of the target entity. Since its latest release, SIGNOR has undergone a significant upgrade including: (i) a new website that offers an improved user experience and novel advanced search and graph tools; (ii) a significant content growth adding up to a total of approx. 33,000 manually-annotated causal relationships between more than 8900 biological entities; (iii) an increase in the number of manually annotated pathways, currently including pathways deregulated by SARS-CoV-2 infection or involved in neurodevelopment synaptic transmission and metabolism, among others; (iv) additional features such as new model to represent metabolic reactions and a new confidence score assigned to each interaction.

Citations: - Prisca Lo Surdo, Marta Iannuccelli, Silvia Contino, Luisa Castagnoli, Luana Licata, Gianni Cesareni, Livia Perfetto, SIGNOR 3.0, the SIGnaling network open resource 3.0: 2022 update, Nucleic Acids Research, Volume 51, Issue D1, 6 January 2023, Pages D631–D637, https://doi.org/10.1093/nar/gkac883

Data Access Locations: - Signor 3.0 Downloads: https://signor.uniroma2.it/downloads.php (this page includes file sizes and simple data dictionaries for each download)

Data Provision Mechanisms: file_download

Data Formats: csv

Data Versioning and Releases: No consistent cadence for releases, but on average there are 1-2 releases each month. Versioning is based on the month and year of the release. Releases page / change log: https://signor.uniroma2.it/downloads.php

Ingest Information

Ingest Categories: primary_knowledge_provider

Utility: Signor is a rich source of manually curated genetic associations to other biological entities which are an important type of edge for Translator query and reasoning use cases, including treatment predictions, gene-gene regulation predictions, and pathfinder queries. It is one of the sources that focus on drug and genes.

Scope: This initial ingest of Signor covers curated Gene to Gene associations that report therapeutic and marker/mechanism relationships, and inferred statistical associations generated by Signor.

Relevant Files

File Name Location Description
signor_genes.csv https://signor.uniroma2.it/downloads.php Associations generated by knowledge assertions between 'Gene' 'Complex' 'Chemical' 'Phenotype' 'Protein' 'Smallmolecule' 'Proteinfamily' 'Stimulus' 'Fusion Protein' 'Mirna' 'Antibody' and 'Ncrna'

Included Content

File Name Included Records Fields Used
signor_genes.csv Associations generated by knowledge assertions with quality controlled edges between 'Gene' 'Complex' 'Chemical' 'Phenotype' 'Protein' 'Smallmolecule' 'Proteinfamily' 'Mirna' 'Antibody' and 'Ncrna' subject_identifier, subject_name, subject_category, object_identifier, object_name, object_category, predicate, original_predicate, provided_by, Primary_Knowledge_Source, publications, knowledge_level

Future Content Considerations

edge_content: While the current ingest includes only edges passing initial quality control of LLM & manual verification, future iterations will include additional edges fail initial quality control.

edge_content: Considering add new biolink categories: 'Fusion Protein': potential descendant to 'biolink:Protein', 'Stimulus': only partially overlap with 'biolink:EnvironmentalProcess'

Target Information

Target InfoRes ID: infores:catrax-pharmacogenomics

Edge Types

Subject Categories Predicate Object Categories Knowledge Level Agent Type UI Explanation
biolink:Gene biolink:Gene knowledge_assertion manual_agent Signor records indicate the gene to be 'regulate' another gene - which maps best to the Biolink predicate 'regulates' with additional directional qualifier.
biolink:ChemicalEntity biolink:Gene knowledge_assertion manual_agent Signor records indicate the chemical to be 'regulate' another gene - which maps best to the Biolink predicate 'affects' with additional directional qualifier.
biolink:Gene biolink:Phenotype knowledge_assertion manual_agent Signor records indicate the gene to be 'regulate' another phenotype - which maps best to the Biolink predicate 'affects' with additional directional qualifier.
biolink:Complex biolink:Gene knowledge_assertion manual_agent Signor records indicate the complex to be 'regulate' another gene - which maps best to the Biolink predicate 'affects' with additional directional qualifier.
biolink:Gene biolink:SmallMolecule knowledge_assertion manual_agent Signor records indicate the gene to be 'regulate' another smallmolecule - which maps best to the Biolink predicate 'affects' with additional directional qualifier.
biolink:ProteinFamily biolink:Gene knowledge_assertion manual_agent Signor records indicate the proteinfamily to be 'regulate' another gene - which maps best to the Biolink predicate 'regulates' with additional directional qualifier.

Provenance Information

Contributors: - Qi Wei: code author, data modeling - Yue Zhang: data modeling - Guangrong Qin: data modeling, domain expertise - Sierra Moxon: code support - Matthew Brush: data modeling, domain expertise

Artifacts: - Ingest Survey: https://docs.google.com/spreadsheets/d/1tqimhXxpWzQdfNxanpW-rAaLnmsP80YZUthY5O4mEc8/edit?gid=1223527032#gid=1223527032 - Ingest Ticket: https://github.com/NCATSTranslator/Data-Ingest-Coordination-Working-Group/issues/29