Genes are the sequences of nucleotides in DNA or RNA, which are the carrier of life information
and play an important role in healthy organisms. If gene mutations or abnormalities in the process
of gene expression occur, it may lead to diseases.
The disease-gene associations are curated from two database: DisGeNET
and ClinVar, which
contains 9533 genes, 11313 diseases and 83693 associations.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The Entrez id of a gene |
4 | The symbol of a gene |
Long non-coding RNAs (lncRNAs) are the class of non-coding RNAs longer than 200 nucleotides and have a great
impact on transcriptional, post-transcriptional regulation and chromatin modification. The mutated and
dysfunctional lncRNAs have negative influence on gene expression, and even cause the occurrence of disease.
The disease-lncRNA associations are curated from
LncRNADisease v2.0, which contains 5865 lncRNA, 426 diseases and 10059 associations.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The name of a lncRNA |
4 | The PubMed id of the related paper |
MicroRNAs (miRNAs) are the small non-coding RNA molecules that are widely present in eukaryotes and are about 22 nucleotides long. It plays an important role in regulating gene expression, cell cycle, and organism developmental timing. There are many studies to support the relevance between miRNA and human disease The disease-miRNA associations are curated from HMDD and miRCancer, which contains 1609 miRNAs, 829 diseases and 40965 associations.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The name of a miRNA |
4 | The PubMed ID of the related paper |
The symptoms are the special knowledge of individual disease from the community health professionals and
general practitioners, which are crucial in clinical diagnosis and treatment and represent the highest
level clinical phenotypes. The occurrence of abnormal states in human body often insinuates the imbalance
of normal functions and can even be used as a sign of disease. In practice, the symptoms are always used
as a bridge for communication between doctors and patients so that they can be considered the most
directly observable characteristics for a disease. In this dataset, there are 6058 diseases and 321 symptoms.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | Symptom id (MESH identifier) |
4 | The name of a symptom |
Single nucleotide polymorphisms (SNPs) mainly refer to DNA sequence polymorphism caused by mutation of a single nucleotide at genome level. It is the most common form of human heritable variation and accounts for more than 90% of all known polymorphisms. It is well known that non-coding disease-associated SNPs in regulatory regions of the genome may cause gene disorders at transcriptional or post-transcriptional levels. The disease-SNPs associations are collected from DisGeNET and the retrieved result with SNPcurator, which is a tool to explore the relationships among diseases and SNPs. We finally get 347459 associations, 11154 diseases, 179685 SNPs.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The name of a SNP |
4 | The PubMed id of related paper |
The microbes are small and simple organism, which can be divided into many kinds and lives everywhere. Some microbes living in the organism participate in various life activities and play an important role in main-taining organism health. The disease-microbe associations are curated from the existing literatures by text-mining. In total, there are 32 human disease and 289 microbes.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The name of a microbe |
4 | The organs used for microbiome sequencing |
5 | The deregulatory evidence of a microbe |
6 | The PubMed id of related paper |
The drugs are a class of substances that are artificially sent into the body to repair human abnormalities and their information is consolidated in some specialized databases, such as DrugBank and KEGG drug. The associations between diseases and drugs are an important resource for disease-related research. The disease-drug associations are curated from CTD and KEGG. In total, there are 10057 human disease, 17454 drugs and 5708411 associations.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The id of a drug |
4 | The name of a drug |
The biological pathway is a series of interactions among molecules in a cell that leads to a
certain product or a change in a cell. It is related to molecular signaling, metabolic processes.
The destroy, missing or break-down of some pathways often leads to the occurrence of disorders and
even disease in organisms.
The disease-pathway associations contain 10150 diseases, 2973 pathways and 1897916 associations.
Column | Filed |
---|---|
1 | Disease id (UMLS CUI) |
2 | Disease name |
3 | The pathway id (KEGG, Reactome or wikipathway) |
4 | The name of a pathway |