Skip to content

Results GREEKC



The Knowledge Commons

The analysis of gene regulation by transcription factor-mediated mechanisms is key to understanding how genomic information determines cellular differentiation and function. This regulation involves a coordinated interaction between various types of transcription factors, the DNA regions where they engage in gene-specific transcription regulation and the specific epigenetic context that defines the accessibility and proximity to their target genes. Systems approaches to study this require ample access to all knowledge that has been obtained and such knowledge should be available from public databases: The Knowledge Commons.


The COST Action Gene Regulation Ensemble Effort for the Knowledge Commons was proposed by the Gene Regulation Consortium (GRECO) and designed to improve the development of the Gene Regulation Knowledge Commons (GRKC): ‘The collection of freely accessible information resources, with data well annotated with unambiguous descriptors according to quality criteria and standards that allow seamless integration and interoperability as well as automated computational access with third-party software. GREEKC has worked toward improving the resources contributing to the GRKC by coordinating efforts in building and making available high-quality, curated databases, following a responsible research and innovation (RRI) approach. This RRI approach proved to be an extremely good fit with the main use of COST Actions: bringing together people in one room who would not normally discuss or consult each other.



Through the organization of nine Workshops, the GREEKC COST Action discussed improvements of the Sequence Ontology and the Gene Ontology, and the testing and use of curation tools including Noctua (which allows Gene Ontology-Causal Activity Modeling GO-CAM)) and Visual Syntax Markup (which aids a curator in the construction of annotations in a readable sentence format). Discussions stimulated the development of new curation guidelines, workflows, and data standards for the annotation of gene regulators at levels of proteins, ncRNA, gene, nucleotide sequence and interactions, regulatory complexes, and network information flow. A critical assessment was done to define what constitutes a DNA binding Transcription Factor, resulting in the GREEKC dbTF list. Given the importance of computationally generated knowledge, the crucial role of the Positional Weight Matrix (PWM) for predicting TF-TG interactions was the inspiration for a comprehensive benchmarking of public PWM models against large experimental reference sets, resulting in benchmarking protocols for future use.

GREEKC also coordinated the new annotation standard ‘Minimum Information about a Molecular Interaction Causal Statement’ (MI2CAST) and the data exchange language CausalTAB (the MITAB 2.8 format), for producing and sharing causal (regulatory) molecular interaction data. The use of text mining was discussed as a way to extract knowledge about regulatory interactions of transcription factors with target genes automatically from MedLine abstracts (, and the results (a large corpus of TF-TG sentences) was made available through the PSICQUIC web service and through the Cytoscape App BioGateway, allowing a network builder to validate the quality of the text mining result by linking out to the ExTRI sentence in the abstract.


GREEKC Training Schools results


Summary of results document

To get access to the Results Document Folder for this event, you can send an email to


Summary of results document

To appear in Biochimica et Biophysica Acta as Gene Ontology representation for transcription factor functions.


  • Ershova AS, Eliseeva IA, Nikonov OS, Fedorova AD, Vorontsov IE, Papatsenko D, Kulakovskiy IV. Enhanced C/EBP binding to G· T mismatches facilitates fixation of CpG mutations in cancer and adult stem cells. Cell Reports. 2021 Jun 8;35(10):109221. DOI: 10.1016/j.celrep.2021.109221

    Abramov S, Boytsov A, Bykova D, Penzar DD, Yevshin I, Kolmykov SK, Fridman MV, Favorov AV, Vorontsov IE, Baulin E, Kolpakov F. Landscape of allele-specific transcription factor binding in the human genome. Nature Communications. 2021 May 12;12(1):1-5. DOI: 10.1038/s41467-021-23007-0

    Carbon S, Douglass E, Good BM, Unni DR, Harris NL, Mungall CJ, Basu S, Chisholm RL, Dodson RJ, Hartline E, Fey P. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Research. 2021 Jan 8;49(D1):D325-34. DOI: 10.1093/nar/gkaa1113

    Gene Ontology Consortium. The gene ontology resource: 20 years and still GOing strong. Nucleic acids research. 2019 Jan 8;47(D1):D330-8. DOI: 10.1093/nar/gky1055

    Vera-Ramos JA, Juanes-Cortés B, Fernández-Breis JT, Gaudet P, Kuiper M, Lægreid A, Logie C, García MD, Schulz S. An Example of Multimodal Biological Knowledge Representation. InJOWO 2019. //

    Ambrosini G, Vorontsov I, Penzar D, Groux R, Fornes O, Nikolaeva DD, Ballester B, Grau J, Grosse I, Makeev V, Kulakovskiy I. Insights gained from a comprehensive all-against-all transcription factor binding motif benchmarking study. Genome biology. 2020 Dec;21:1-8. DOI: 10.1186/s13059-020-01996-3

    Perfetto L, Acencio ML, Bradley G, Cesareni G, Del Toro N, Fazekas D, Hermjakob H, Korcsmaros T, Kuiper M, Lægreid A, Lo Surdo P. CausalTAB: the PSI-MITAB 2.8 updated format for signalling data representation and dissemination. Bioinformatics. 2019 Oct 1;35(19):3779-85. DOI: 10.1093/bioinformatics/btz132

    Touré V, Vercruysse S, Acencio ML, Lovering RC, Orchard S, Bradley G, Casals-Casas C, Chaouiya C, Del-Toro N, Flobak Å, Gaudet P. The minimum information about a molecular interaction causal statement (MI2CAST). Bioinformatics. 2020 Apr 27. DOI: 10.1093/bioinformatics/btaa622

    Panni S, Lovering RC, Porras P, Orchard S. Non-coding RNA regulatory networks. Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms. 2020 Jun 1;1863(6):194417. DOI: 10.1016/j.bbagrm.2019.194417

    Chatterjee A. Pure for me or impure for us: Pollution in the Gene Regulation Knowledge Commons. //

    Gabrielsen AM. Openness and trust in data-intensive science: the case of biocuration. Medicine, Health Care and Philosophy. 2020 Sep;23(3):497-504. DOI: 10.1007/s11019-020-09960-5

    Nydal R, Bennett G, Kuiper M, Lægreid A. Silencing trust: confidence and familiarity in re-engineering knowledge infrastructures. Medicine, Health Care and Philosophy. 2020 Sep;23:471-84. DOI: 10.1007/s11019-020-09957-0

    Shelley-Egan C, Gjefsen MD, Nydal R. Consolidating RRI and Open Science: understanding the potential for transformative change. Life Sciences, Society and Policy. 2020 Dec;16(1):1-4. DOI: 10.1186/s40504-020-00103-5

  • Bonello J, Cachia E, Alfino N. AutoFAIR – Automating FAIR Assessments Portal for Bioinformatics Tools. 2020 Submitted to BBA.

    Chatterjee A, Swierstra T, Kuiper M. The potential purity of polluted data: curator and user perspectives. 2021 Submitted to BBA.

    Gaudet P, Logie C, Lovering R, Kuiper M, Lægreid A, Thomas PD. Gene Ontology representation for transcription factor functions. 2021 Submitted to BBA.

    Juanes-Cortés B, Vera-Ramos JA, Lovering CR, Gaudet P, Laegreid A, Logie C, … & Fernández-Breis JT. Formalization of gene regulation knowledge using ontologies and Gene Ontology Causal Activity Models. 2021 Submitted to BBA. 

    Kuiper M, The GREEKC Consortium. The Gene Regulation Knowledge Commons, the Action Area of GREEKC. 2021 Submitted to BBA.

    Lovering RC, Gaudet P, Acencio ML, Ignatchenko A, Jolma A, Fornes O, Kuiper M, Kulakovskiy IV, Lægreid A, Martin MJ, Logie C. A GO catalogue of human DNA-binding transcription factors. bioRxiv, 28 Oct 2020, // 2021 Submitted to BBA.

    Sant D, Sinclair M, Mungall C, Schulz S, Zerbino D, Lovering R, Logie C, Eilbeck K. Sequence Ontology terminology for gene regulation. 2021 Submitted to BBA.

    Velthuijs N, Meldal BH, Geessinck Q, Porras P, Medvedeva Y, Zubritskiy A, Orchard S, Logie C. Integration of human co-factor complexes and sequence-specific DNA-binding factor interactomes. 2021 Submitted to BBA.

  • Almagro Hernández G. Modelos de análisis semántico de información y conocimiento genético y genómico para el estudio de enfermedades genéticas y cáncer. Proyecto de investigación:. 2021 Jan 14. //

    Touré V. Improving the FAIRness of causal interactions in systems biology. 2020 //

    Touré V, Zobolas J, Kuiper M, Vercruysse S. CausalBuilder: bringing the MI2CAST causal interaction annotation standard to the curator. Database. 2021 Jan 1;2021. DOI: 10.1093/database/baaa107

    Zobolas J, Touré V, Kuiper M, Vercruysse S. UniBioDicts: unified access to biological dictionaries. Bioinformatics. 2021 Jan 1;37(1):143-4. DOI: 10.1093/bioinformatics/btaa1065

    Porras P, Barrera E, Bridge A, Del-Toro N, Cesareni G, Duesbury M, Hermjakob H, Iannuccelli M, Jurisica I, Kotlyar M, Licata L. Towards a unified open access dataset of molecular interactions. Nature communications. 2020 Dec 1;11(1):1-2. DOI: 10.1038/s41467-020-19942-z

    Vera-Ramos JA, Roldán-García M, Hurtado S, Fernández-Breis JT, Schulz S. Deconstructing and recycling ontologies: A Gene Regulation ontology use case. 2021. Pending submission. 

By continuing to use the site, you agree to the use of cookies. more information

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.