Skip to content

Finding and using Disease and Phenotype Ontologies

Warning

These materials are under construction and incomplete.

Prerequisites

  • None

Preparation

What is delivered as part of the course

Description: An introduction to the landscape of disease and phenotype terminologies and ontologies, and how they can be used to add value to your analysis.

Learning objectives

Tutorials

Additional materials and resources

Contributors

Major disease and phenotype ontologies that are available

A landscape analysis of major disease and phenotype ontologies that are currently available is here.

Decide which phenotype or disease ontology to use for different use cases

Different ontologies are build for different purposes and were created for various reasons. For example, some ontologies are built for text mining purposes, some are built for annotating data and downstream computational analysis.

The unified phenotype ontology (uPheno) aggregates species-specific phenotype ontologies into a unified resource. Several species-specific phenotype ontologies exist, such as the Human Phenotype Ontology, Mammalian Phenotype Ontology (http://www.informatics.jax.org/searches/MP_form.shtml) and many more.

Similarly to the phenotype ontologies, there are many disease ontologies that exist that are specific to certain areas of diseases, such as infectious diseases (e.g. Infectious Disease Ontology), cancer (e.g. National Cancer Institute Thesaurus), rare diseases (e.g. Orphanet), etc.

In addition, there are several more general disease ontologies, such as the Mondo Disease Ontology, the Human Disease Ontology (DO), SNOMED, etc.

Different disease ontologies may be built for different purposes; for example, ontologies like Mondo and DO are intended to be used for classifying data, and downstream computational analyses. Some terminologies are used for indexing purposes, such as International classification of Diseases (ICD). ICD-11 is intended for indexing medical encounters for the purposes of billing and coding. Some of the disease ontologies listed on the landscape contain terms that define diseases, such as Ontology for General Medical Sciences (OGMS) are upper level ontologies and are intended for integration with other ontologies.

When deciding on which phenotype or disease ontology to use, some things to consider:

  • Do you need a more specific ontology, such as a species-specific ontology, or do you need a more general ontology that is cross-species or covers more aspects of diseases?
  • Is the ontology open and free to use?
  • Does the description of the ontology describe it's intended use? For example, some ontologies are built for text mining purposes, some are built for annotating data and downstream computational analysis.
  • Is the ontology actively maintained?
  • Does the ontology contain the terms you need? If not, is there a mechanism to request changes and new terms and are the ontology developers responsive to change requests on their tracker?
  • Is the ontology widely used by the community? You can check things like active contributors on GitHub, usages described on the OBO Foundry page (for example http://obofoundry.org/ontology/mondo.html), published papers and citations.

Understand how to leverage disease and phenotype ontologies for advanced data analytics

How to integrate other data