Networks used

Warning

Gene IDs have to be consistent between input data (target genes, GMT and networks)
When data are retrieved by queries, HGNC IDs are used.

In this section, we present networks used in the use-cases.

We propose to apply the random walk with restart (RWR) approach on two different multilayers.

multilayerCompo — Fig. 18 : Multilayers composition: On the left, multilayer is composed of genes multilayer network and pathways/processes of interest network (disconnected network). On the right, the multilayer is composed of genes multiplayer network and disease-disease similarity network.

Multilayers are composed of:

genes multilayer network + pathways/processes of interest network (Fig. 18 - left part)
genes multilayer network + disease-disease similarity network (Fig. 18 - right part)

Genes multilayer network

The genes multilayer network is composed of three different networks. Nodes are identical between networks, but the relationships between them are coming from different sources. Networks are downloaded from the Network Data Exchange (NDEx) [1] (see Network downloading page).

Protein-Protein Interaction (PPI) network

The Protein-Protein Interaction (PPI) network is obtained from fusion of three datasets : Hi-Union and Lit-BM [2] and APID [3]. It’s composed of:

15,390 nodes
131,087 edges
UUID: bfac0486-cefe-11ed-a79c-005056ae23aa

This network is also used for the active module identification approach.

Molecular complexes network

Molecular complexes network is constructed from the fusion of Hu.map [4] and Corum [5]. It’s composed of:

8,497 nodes
62,073 edges
UUID: 419ae651-cf05-11ed-a79c-005056ae23aa

Reactome pathways network

The Reactome pathways network was build using data derived from Reactome protein-protein interaction data [6]. It’s composed of:

4,598 nodes
19,292 edges
UUID: b13e9620-cefd-11ed-a79c-005056ae23aa

Use-case command lines

odamnet networkDownloading  --netUUID bfac0486-cefe-11ed-a79c-005056ae23aa \
                            --networkFile useCases/InputData/multiplex/1/PPI_HiUnion_LitBM_APID_gene_names_190123.gr \
                            --simple True

odamnet networkDownloading  --netUUID 419ae651-cf05-11ed-a79c-005056ae23aa \
                            --networkFile useCases/InputData/multiplex/1/Complexes_gene_names_190123.gr \
                            --simple True

odamnet networkDownloading  --netUUID b13e9620-cefd-11ed-a79c-005056ae23aa \
                            --networkFile useCases/InputData/multiplex/1/Pathways_reactome_gene_names_190123.gr \
                            --simple True

Pathways/processes of interest network

In the use-case 1, we are using data retrieved from databases. So, we created a rare disease pathways network with data retrieved from WikiPathways [7]. The network is composed of 104 nodes and the bipartite contains 4,612 interactions between genes and rare disease pathways.

In the use-case 2, we are provided data from a previous study [8]. We created a disconnected network with pathways and processes related to Congenital Anomalies of the Kidney and Urinary Tract (CAKUT) as nodes. The network is composed of 13 nodes and the bipartite network contains 1,655 interactions between genes and pathways and processes related to CAKUT.

To know how to create these two networks, see the Network creation page.

Disease-disease similarity network

Disease-disease similarity network creation

Data were downloaded on June 11th, 2022

Annotations and ontologies are coming from HPO

We constructed a disease-disease network based on the phenotype similarity between diseases. A disease is defined as a set of phenotypes and each phenotype is associated to the Human Ontology Project IDs (HPO) [9].

The similarity score is calculated based on the number of shared phenotypes between two diseases ([10], [11], [12]). Every pairs of diseases have a similarity score. To create the network, we selected the 5 most similar diseases fo each disease.

The disease-disease similarity network contains 33,925 edges and 8,264 diseases.

Gene-disease bipartite

Data were downloaded on September 27th, 2022

Associations between genes and diseases file is coming from HPO

The molecular multilayer network is connected to the disease-disease similarity network with the gene-disease bipartite. The bipartite contains 6,564 associations (4,483 genes and 5,878 diseases).