Pharmosome: an integrative and collective database for exploration and analysis of single nucleotide polymorphisms associated with disease

Peter T. Habib; Alsamman M. Alsamman; Sameh E. Hassanein; Kerolos M. Yousef; Aladdin Hamwieh

doi:10.12688/f1000research.21773.1

Home Browse Pharmosome: an integrative and collective database for exploration...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

Pharmosome: an integrative and collective database for exploration and analysis of single nucleotide polymorphisms associated with disease

[version 1; peer review: 2 approved with reservations]

Peter T. Habib ^1,2, Alsamman M. Alsamman³, Sameh E. Hassanein^1,4, Kerolos M. Yousef¹, Aladdin Hamwieh²

Peter T. Habib ^1,2, Alsamman M. Alsamman³, [...] Sameh E. Hassanein^1,4, Kerolos M. Yousef¹, Aladdin Hamwieh²

PUBLISHED 10 Jan 2020

Author details Author details

¹ Department of Bioinformatics and Functional Genomics, College of Biotechnology, College of Biotechnology, Misr University for Science and Technology, Giza, Egypt
² Department of Biodiversity and Crop Improvement Program, International Center for Agricultural Research in the Dry Areas, Cairo, Egypt
³ Department of Genome Mapping, Molecular Genetics and Genome Mapping Laboratory, Agricultural Genetic Engineering Research Institute, Giza, Egypt
⁴ Department of Bioinformatics & Computer Networks, AGERI, Agricultural Research Center, Giza, Egypt

Peter T. Habib
Roles: Conceptualization, Investigation, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Alsamman M. Alsamman
Roles: Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Sameh E. Hassanein
Roles: Conceptualization, Project Administration, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Kerolos M. Yousef
Roles: Validation, Writing – Review & Editing

Aladdin Hamwieh
Roles: Funding Acquisition, Supervision

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Current single nucleotide polymorphism (SNP) databases are limited to a narrow set of SNPs, which has led to a lack of interactivity between different databases, limited tools to analyze and manipulate the already existing data, and complexity in the graphical user interface. Here we introduce Pharmosome, a web-based, user-friendly and collective database for more than 30,000 human disease-related SNPs, with dynamic pipelines to explore SNPs associated with disease development, drug response and the pathways shared between different genes related to these SNPs. Pharmosome implements several tools to design primers to detect SNPs in large genomes and facilitates analysis of different SNPs to determine relationships between them by aligning sequences, constructing phylogenetic trees, and providing consensus sequences illustrating the connections between SNPs. Pharmosome was written in the Python programming language using the Django web framework in combination with HTML, CSS, and JavaScript to receive user inputs, and process and export the sorted result to the interface. Pharmosome is available from: https://pharmosome.herokuapp.com/.

Keywords

SNP, Disease, Python, Bioinformatics

Corresponding author: Peter T. Habib

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2020 Habib PT et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Habib PT, Alsamman AM, Hassanein SE et al. Pharmosome: an integrative and collective database for exploration and analysis of single nucleotide polymorphisms associated with disease [version 1; peer review: 2 approved with reservations]. F1000Research 2020, 9:14 (https://doi.org/10.12688/f1000research.21773.1) First published: 10 Jan 2020, 9:14 (https://doi.org/10.12688/f1000research.21773.1) Latest published: 10 Jan 2020, 9:14 (https://doi.org/10.12688/f1000research.21773.1)

Introduction

With the extending and deciphering of genomic data produced by sequencing technologies and the Human Genome Project¹, much information has been discovered, such as exons, introns, domains, coding sequence, and non-coding sequences. Mutated signals have attracted tremendous attention because of their crucial impact in altering gene expression, in particular, single nucleotide polymorphisms (SNPs)², which also act as gene molecular-markers for an associated trait. Additionally, the existence of a specific SNP can indicate precisely what disease is foreseen and the possible drugs to treat it, which is considered the ultimate goal of pharmacogenomics.

Pharmacogenetics, or its inclusive version pharmacogenomics (since it covers proteomic, genomic, epigenomic and transcriptomic effects on disease and drug response), has produced a vast body of research since 1997³, due to its key importance in personalized medicine through investigating how far genetic variations (e.g. SNPs) are involved in disease development and determining drug targets. Thereby the safety and efficacy of an individualized drug therapy can be improved. Since the relationships between molecular data pertaining to patients and their disease phenotype are complex and difficult to determine manually, scientists have begun to develop and enrich the bioinformatics knowledge base with more sophisticated and accurate molecular tools to detect genetic variations for example, SNPector is a recent tool developed by the authors to detect SNP effect in drug response and disease development⁴. This will allow interpretation of how these tiny variations may cause direct errors, e.g. X-SNP is a common SNP type that gives rise to premature termination codons that halt gene expression⁵. Some SNP types involved in alteration of the protein production process cause disease⁶, while other regulatory SNPs⁷ may disrupt pathways causing cascade errors that lead to collapse of a whole pathway thereby causing disease development.

Elucidations about SNPs play a central role in providing recommendations for practicing physicians. In addition, a wide range of research fields have arisen out of pharmacogenomics, such as vaccinomics, which study aberrant immune responses to vaccines based on genetic makeup. For example, a specific SNP in the TLR3 gene was found to be responsible for the reduction of humoral immune responses and cell-mediated immunity to the measles vaccine⁸. Nutrigenomics is the science of gene–nutrient interactions, which involves research methods and clinical implementation to detect and treat nutrient-related diseases. One of the best known examples of nutrigenomics is lactase persistence, in which the gene encoding lactase is expressed past weaning. Lactase persistence in Europeans is caused by a polymorphism called “C-13910T” in the lactase phlorizin hydrolase gene promoter⁹, and the lack of C-1390T in many adults can lead to severe gastrointestinal discomfort and diarrhea resulting from ingesting milk due to the inability to metabolize lactose¹⁰.

Here we introduce Pharmosome, a web-based, user-friendly and collective database for more than 30,000 human disease-related SNPs, with dynamic pipelines to explore SNPs associated with disease development, drug response and the pathways shared between different genes related to these SNPs. Pharmosome implements several tools to design primers to detect SNPs in large genomes and facilitates analysis of different SNPs to determine relationships between them by aligning sequences, constructing phylogenetic trees, and providing consensus sequences illustrating the connections between SNPs.

Methods

Implementation

We collected SNP-related data (e.g. SNP ID, annotation, pathway and phenotypes) from PharmGKB¹¹, NCBI^12,13, Ensemble¹⁴, DiseaseEnhancer¹⁵, GeneCards¹⁶ and Reactome¹⁷ in tab-delimited format in order to link between different available information. The collected data were categorized into four main sub-databases: SNP, Gene, Chemical and Disease. The Python 3+ programming language was utilized to read, select, filter and sort the data and to link the Python scripts with HTML and JavaScript codes.

Data collection. About 50% of the data was downloaded from PharmGKB, which is considered the most common database for SNP annotation. The data comprises the associated phenotype, the clinical perspectives and, considering storage space limitations, the remaining data are imported on-demand to use later by a set of Python functions we built to get access to the Application Programming Interface (API) of different databases. These other databases include GeneCards, DiseaseEnhancer, Ensembl and Reactome; thereby a user can return specific data using preset IDs and then export this information to the HTML interface. The number of data entries collected by Pharmosome is shown in Table 1.

Table 1. Number of data entries collected from each database used by Pharmosome.

Type of data	NCBI	Ensemble	Reactome	PharmGKB	DiseaseEnhancer	GeneCards	KEGG	DrugBank
Gene	18,975	0	0	27,000	1,059	13,500	20,109	0
Transcript	0	381,060	0	0	0	0	0	0
Pathway	0	0	2,256	0	0	0	0	0
Disease	0	0	0	3,546	1,059	0	2,287	0
Chemical	0	0	0	3,393	0	0	0	11,926
SNP	440,000,000	0	0	10,780	0	0	0	0
Annotation	10,845	0	2,256	3,393	0	0	20,109	0

Data fetching and exporting. We constructed a Python module containing 12 functions. These functions connect in different ways to import the data from the databases above either from tab-delimited files or using APIs. User-requested information is extracted and exported to the Pharmosome web interface. The Django web framework was used to build the Pharmosome web interface with HTML programming language. We used Django to build functions that receive the user input from the Pharmosome interface, process requests using a Python script, and finely export the result to the interface.

return render(request, 'WebInterface.html', context={})

Operation

Google Chrome browser is recommended to use the Pharmosome web interface (https://pharmosome.herokuapp.com/), but other internet browsers can also be used.

Functions of Pharmosome

SNP sub-database. In the SNP sub-database, users can enter the ID of a SNP (e.g. rs141033578) and receive output data about its related gene, chromosomal location, gene bands, summary of the normal function of the gene, the gene part responsible for enhancing the disease occurrence, pathway of defective gene (if available), gene transcripts and different splicing variants. Pharmosome also provides information that can be used to retrieve data of recent studies (specifically, SNP reference nucleotide and alternative nucleotide data and an explanation of how the SNP contributes to disease development and drug response). For example, an SNP in GSTP1 was associated with overall survival in 107 patients with metastatic colorectal cancer who received 5-FU/oxaliplatin combination chemotherapy that caused the replacement of isoleucine with valine at amino acid position 105 of the protein, which is known to substantially diminish enzyme activity¹⁸.

Disease sub-database. The Disease sub-database allows users to search for disease data collected from KEGG Disease and PharmGKB databases by entering the disease name. The search result is a list of genes responsible for the disease, the ID of the SNP occurring in this gene, a description of clinical annotations related to this gene and the gene-specific chemical used in the treatment. The sub-database can be used, for example, to identify the SNP, gene and chemical related to coronary artery disease caused by a high frequency of a particular polymorphism in the PLA2 gene. This gene encodes glycoprotein IIIa, which is associated with a high prevalence of premature myocardial infarction^19,20

Chemical sub-database. The Chemical sub-database has data from four sources: KEGG, PharmGKB, DrugBank and ChemSpider. After users enter a chemical name, the output is a list of the chemical name, trend and generic names, structure, description and pharmacodynamics. For example, by inputting bortezomib (a proteasome inhibitor), users receive output data relating to the clinical success of bortezomib, which established the ubiquitin (Ub)+proteasome system as a key therapeutic target in multiple myeloma^21,22.

Gene sub-database. The Gene sub-database links between different sources (NCBI GenBank, Ensemble, Reactome and DiseaseEnhancer). In particular, the DiseaseEnhancer dataset represents a new approach that determines the gene part that is responsible for enhancing occurrence of disease. NCBI provides information about gene name, location, a summary of gene function and chromosomal location. Reactome displays the pathway in which genes are involved and gives an overview description. The Gene sub-database can be used to identify the pathway, splicing variants, disease enhancing region, genomic and proteomic expression profile or even to get general information. An example of this would be looking at the SLCO1B1 gene, for which various groups tested the hypothesis of whether polymorphisms in SLCO1B1 affect pharmacokinetics and the effects of drugs in humans²³.

SNP collector. The SNP collector is a tool within Pharmosome that is designed to find all SNPs present on the gene, related to disease, associated with the chemical compound. Users can choose between different options to collect SNPs and their clinical annotations and chemicals related to each SNP. This tool can be used to find SNPs or other information. For example, it could be used to detect the bond work which occurs on location 118 of the mu opioid gene, which was 3 fold more vigorous than the wildtype in its interaction with b-endorphin Other regulatory-SNPs in this region of the gene can be linked to other phenotypes²⁴.

Pick primer. The detection of SNP existence in a DNA sample taken from patients depends on designing an appropriate primer. Primers should be compatible with the flanking sequences of SNP. As the presence of an SNP in the genome may result in disease and affect the choice of drug, there is a need to detect the presence of SNP e.g. for early disease diagnosis. The pick primer tool within Pharmosome has the important function of designing primers to detect an SNP in the genome by retrieving the SNP sequence record from the NCBI database, locating the SNP position and designing primers 50 bases before, after and within the SNP sequence.

SNP phylogeny. As discussed in previous sections, there is always some relationship between different SNPs due to the complicated interaction network between different genes. In order to determine how these SNPs are related to each other, the SNP phylogeny tool constructs a phylogenetic tree that illustrates the relationships between different SNPs²⁵ by downloading SNP and flanking sequences and commencing multiple sequence alignment to determine how far each sequence is related to others. This function could be used clarify connections in studies, such as that of Thompson et al. that showed an association of 43 SNPs in 16 genes with the response drug of atorvastatin²⁶.

Workflow

Figure 1 shows the flow of information to meet the needs of users.

Figure 1. Pharmosome workflow illustrating the processing in the background of the database and the relationships and links between different interface webpages.

Use case

Pharmosome deploys seven sub-databases and tools. Our approach during the building of Pharmosome, is to achieve the easiest usage. We designed each tool and sub-database to receive the user input with minimum required parameters (as shown in Table 2). Users enter the target input they require and the Pharmosome interface will automatically redirect to another page that shows the user the output results. Figure 2–Figure 5 show output on the Pharmosome web interface.

Table 2. Use cases for Pharmosome detailing the input required in the web interface and a summary of the output of Pharmosome.

Function	Input required	Input example	Output description
SNP sub-database	SNP ID	rs75527207	SNP related disease and drug response annotation
Disease sub-database	Disease Name	Heart Failure	Gene involved in disease, chemical used in treatment, and SNP causing the disease (Figure 1)
Chemical sub-database	Chemical Name	Ivacaftor	Description of target disease (Figure 2)
Gene sub-database	Gene Symbol	CFTR	Gene annotation (Figure 3)
SNP collector	Gene, Chromosome, Disease, or Chemical	CFTR, 11, Heart Failure, or Ivacaftor	List of SNPs (Figure 4)
Pick primer	SNP ID	rs75527207	Forward and reverse primer
SNP phylogeny	SNP List	rs113993960 rs2853741 rs1045642 rs3857532 rs10817464 rs16969968	Illustration shows the phylogenetic tree of the input SNPs

Figure 2. Disease sub-database output on Pharmosome web interface after input of “Heart failure”.

The results consist of list of drop-down menus. Each menu describes disease annotation and the associated gene to the disease.

Figure 3. Chemical sub-database output on Pharmosome web interface after input of “Ivacaftor”.

The results consists of a list of drop-down menus. Each menu describes Drug/Chemical annotations.

Figure 4. Gene sub-database output on Pharmosome web interface after input of “CFTR”.

The results consist of a navigation bar, and each button expands to a different annotation.

Figure 5. SNP collector output on Pharmosome web interface after input of “CFTR”.

The results consists of a list of SNPs located on this gene.

Summary

In this study, we introduce Pharmosome, an integrative and collective database for exploring and analysing human SNPs and the associated disease and drug response. Our tool deploys various functions to determine the relationships between different SNPs, construct the consensus sequence between different SNPs and to determine the pathways shared between different genes. Pharmosome also includes sub-databases to simplify, link and display data about gene functions, pathways, transcriptomes of genes, different splicing variants, clinical annotation, chemical structures and annotations of chemicals involved in the disease. The returned data are informative, user-friendly and easy to navigate. Pharmosome was written in Python 3.5, HTML and CSS with the implementation of Django (Python library) to design links between Python scripts and other languages.

Software availability

Pharmosome web interface: https://pharmosome.herokuapp.com/

Source code available from: https://github.com/peterhabib/Pharmosome_Web

Archived source code as at time of publication: http://doi.org/10.5281/zenodo.3583191²⁷.

License: MIT

Faculty Opinions recommended

References

1. Lander ES, Linton LM, Birren B, et al.: Initial sequencing and analysis of the human genome. Nature. 2001; 409(6822): 860–921. PubMed Abstract | Publisher Full Text
2. Vignal A, Milan D, SanCristobal M, et al.: A review on SNP and other types of molecular markers and their use in animal genetics. Genet Sel Evol. 2002; 34(3): 275–305. PubMed Abstract | Publisher Full Text | Free Full Text
3. Gurwitz D, Pirmohamed M: Pharmacogenomics: the importance of accurate phenotypes. Pharmacogenomics. 2010; 11(4): 469–70. PubMed Abstract | Publisher Full Text
4. Habib PT, Alsamman AM, Hassanein SE, et al.: SNPector: SNP inspection tool for diagnosing gene pathogenicity and drug response in a naked sequence [version 1; peer review: awaiting peer review]. F1000Res. 2019; 8: 2133. Publisher Full Text
5. Savas S, Tuzmen S, Ozcelik H: Human SNPs resulting in premature stop codons and protein truncation. Hum Genomics. 2006; 2(5): 274–86. PubMed Abstract | Publisher Full Text | Free Full Text
6. Bond J, Scott S, Hampshire DJ, et al.: Protein-truncating mutations in ASPM cause variable reduction in brain size. Am J Hum Genet. 2003; 73(5): 1170–7. PubMed Abstract | Publisher Full Text | Free Full Text
7. De Gobbi M, Viprakasit V, Hughes JR, et al.: A regulatory SNP causes a human genetic disease by creating a new transcriptional promoter. Science. 2006; 312(5777): 1215–7. PubMed Abstract | Publisher Full Text
8. Dhiman N, Poland GA, Cunningham JM, et al.: Variations in measles vaccine-specific humoral immunity by polymorphisms in SLAM and CD46 measles virus receptors. J Allergy Clin Immunol. 2007; 120(3): 666–72. PubMed Abstract | Publisher Full Text
9. Enattah NS, Sahi T, Savilahti E, et al.: Identification of a variant associated with adult-type hypolactasia. Nat Genet. 2002; 30(2): 233–7. PubMed Abstract | Publisher Full Text
10. Kaput J, Rodriguez RL: Nutritional genomics: the next frontier in the postgenomic era. Physiol Genomics. 2004; 16(2): 166–77. PubMed Abstract | Publisher Full Text
11. Hewett M, Oliver DE, Rubin DL, et al.: PharmGKB: the Pharmacogenetics Knowledge Base. Nucleic Acids Res. 2002; 30(1): 163–5. PubMed Abstract | Publisher Full Text | Free Full Text
12. Gene. Bethesda (MD): National Library of Medicine (US), National Center for Biotechnology Information; 2004; Accessed 2019-02-16. Reference Source
13. dbSNP. Bethesda (MD): National Library of Medicine (US), National Center for Biotechnology Information; 1998; Accessed 2019-02-16. Reference Source
14. Kersey PJ, Allen JE, Allot A, et al.: Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species. Nucleic Acids Res. 2018; 46(D1): D802–8. PubMed Abstract | Publisher Full Text | Free Full Text
15. Zhang G, Shi J, Zhu S, et al.: DiseaseEnhancer: a resource of human disease-associated enhancer catalog. Nucleic Acids Res. 2018; 46(D1): D78–84. PubMed Abstract | Publisher Full Text | Free Full Text
16. Safran M, Dalah I, Alexander J, et al.: GeneCards Version 3: the human gene integrator. Database (Oxford). 2010; 2010: baq020. PubMed Abstract | Publisher Full Text | Free Full Text
17. Fabregat A, Sidiropoulos K, Viteri G, et al.: Reactome diagram viewer: data structures and strategies to boost performance. Bioinformatics. 2018; 34(7): 1208–14. PubMed Abstract | Publisher Full Text | Free Full Text
18. Stoehlmacher J, Park DJ, Zhang W, et al.: Association between glutathione S-transferase P1, T1, and M1 genetic polymorphism and survival of patients with metastatic colorectal cancer. J Natl Cancer Inst. 2002; 94(12): 936–42. PubMed Abstract | Publisher Full Text
19. Weiss EJ, Bray PF, Tayback M, et al.: A polymorphism of a platelet glycoprotein receptor as an inherited risk factor for coronary thrombosis. N Engl J Med. 1996; 334(17): 1090–4. PubMed Abstract | Publisher Full Text
20. Walter DH, Hink U, Asahara T, et al.: The in vivo bioactivity of vascular endothelial growth factor/vascular permeability factor is independent of N-linked glycosylation. Lab Invest. 1996; 74(2): 546–56. PubMed Abstract
21. Richardson PG, Mitsiades CS, Hideshima T, et al.: Novel biological therapies for the treatment of multiple myeloma. Best Pract Res Clin Haematol. 2005; 18(4): 619–34. PubMed Abstract | Publisher Full Text
22. Richardson PG, Schlossman RL, Alsina M, et al.: PANORAMA 2: panobinostat in combination with bortezomib and dexamethasone in patients with relapsed and bortezomib-refractory myeloma. Blood. 2013; 122(14): 2331–7. PubMed Abstract | Publisher Full Text
23. König J, Seithel A, Gradhand U, et al.: Pharmacogenomics of human OATP transporters. Naunyn Schmiedebergs Arch Pharmacol. 2006; 372(6): 432–43. PubMed Abstract | Publisher Full Text
24. Bond C, LaForge KS, Tian M, et al.: Single-nucleotide polymorphism in the human mu opioid receptor gene alters beta-endorphin binding and activity: possible implications for opiate addiction. Proc Natl Acad Sci U S A. 1998; 95(16): 9608–13. PubMed Abstract | Publisher Full Text | Free Full Text
25. Habib PT, Alsamman AM, Hamwieh A: BioAnalyzer: Bioinformatic software of routinely used tools for analysis of genomic data. Biotechnology. 2019; 10: 33–41. Publisher Full Text
26. Thompson JF, Man M, Johnson KJ, et al.: An association study of 43 SNPs in 16 candidate genes with atorvastatin response. Pharmacogenomics J. 2005; 5(6): 352–8. PubMed Abstract | Publisher Full Text
27. Peter: Pharmosome: An Integrative and Collective Database for Explorations and Analysis of SNP Associated with Disease. Zenodo. 2019. http://www.doi.org/10.5281/zenodo.3583191

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 10 Jan 2020

Author details Author details

¹ Department of Bioinformatics and Functional Genomics, College of Biotechnology, College of Biotechnology, Misr University for Science and Technology, Giza, Egypt
² Department of Biodiversity and Crop Improvement Program, International Center for Agricultural Research in the Dry Areas, Cairo, Egypt
³ Department of Genome Mapping, Molecular Genetics and Genome Mapping Laboratory, Agricultural Genetic Engineering Research Institute, Giza, Egypt
⁴ Department of Bioinformatics & Computer Networks, AGERI, Agricultural Research Center, Giza, Egypt

Peter T. Habib
Roles: Conceptualization, Investigation, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Alsamman M. Alsamman
Roles: Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Sameh E. Hassanein
Roles: Conceptualization, Project Administration, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Kerolos M. Yousef
Roles: Validation, Writing – Review & Editing

Aladdin Hamwieh
Roles: Funding Acquisition, Supervision

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (1)

version 1

Published: 10 Jan 2020, 9:14

https://doi.org/10.12688/f1000research.21773.1

Copyright

© 2020 Habib PT et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Habib PT, Alsamman AM, Hassanein SE et al. Pharmosome: an integrative and collective database for exploration and analysis of single nucleotide polymorphisms associated with disease [version 1; peer review: 2 approved with reservations]. F1000Research 2020, 9:14 (https://doi.org/10.12688/f1000research.21773.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 10 Jan 2020

Views

4

Reviewer Report 18 Nov 2021

Mulin Jun Li, Department of Epidemiology and Biostatistics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, 300070, China

Approved with Reservations

https://doi.org/10.5256/f1000research.24001.r98627

In this manuscript, the authors developed Pharmosome, a web-based, user-friendly and collective annotation database for exploring and analyzing SNPs associated with disease development, drug response and the pathways shared between different genes related to these SNPs. Pharmosome is a friendly ... Continue reading

In this manuscript, the authors developed Pharmosome, a web-based, user-friendly and collective annotation database for exploring and analyzing SNPs associated with disease development, drug response and the pathways shared between different genes related to these SNPs. Pharmosome is a friendly tool to query pathogenic SNP and the relationship of disease or drug.

However, this study needs some more evaluations and validations:

Major:

In the disease sub-database, Pharmosome should provide the effect of drugs on the disease and clinical/preclinical/experimental levels of evidence.
In the primer design sub-database, Pharmosome should provide specific scores to indicate the quality of primers and the feasibility of subsequent experiments.
The authors should compare Pharmosome with similar databases such as PharmGKB, PreMedKB, and CIVIC to show its specific focus and superiority.
It seems that the authors restrict the input SNPs to be located in gene regions, but the majority of disease-associating SNPs lie in the non-coding regions according to published GWAS and drug-response studies, non-coding SNPs should not be excluded therefore.

Minor:

The versions of all collected resources should be provided.
There are some errors/bugs in the website, such as the hyperlink of SNP from disease page to SNP page cannot be accessed by users, please polish them accordingly.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

No
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

No

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Bioinformatics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

9

Reviewer Report 19 Aug 2021

Fuyi Li, Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Vic, Australia

Approved with Reservations

https://doi.org/10.5256/f1000research.24001.r90928

This study developed a novel web-based database, Pharmosome, for human disease-related SNPs. The web page of the database is well-designed. The manuscript is well written and easy to follow.

I have several comments and suggestions:

This study developed a novel web-based database, Pharmosome, for human disease-related SNPs. The web page of the database is well-designed. The manuscript is well written and easy to follow.

I have several comments and suggestions:

The authors should provide a download page on the webserver of the database to allow the users to download the data in their database.
A detailed user manual or tutorial for the webserver of Pharmosome should be provided on the webserver to guide the users to use the database.
It would be better to provide a timeline of the database, which can help to record the different versions of the database.
The authors are suggested to provide a statistics page to show the statistic summary of the database.
It would be better to have an advanced search option to allow the users to search the data with a combination of variables.
The authors should also provide an example in each search box to show the users which format of the IDs or keywords are acceptable.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Bioinformatics and Computational biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 10 Jan 2020

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 10 Jan 20	read	read

Fuyi Li, University of Melbourne, Melbourne, Australia
Mulin Jun Li, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

4 Views

18 Nov 2021 | for Version 1

Mulin Jun Li, Department of Epidemiology and Biostatistics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, 300070, China

4 Views Cite this report Responses(0)

Approved With Reservations

In this manuscript, the authors developed Pharmosome, a web-based, user-friendly and collective annotation database for exploring and analyzing SNPs associated with disease development, drug response and the pathways shared between different genes related to these SNPs. Pharmosome is a friendly tool to query pathogenic SNP and the relationship of disease or drug.

However, this study needs some more evaluations and validations:

Major:

In the disease sub-database, Pharmosome should provide the effect of drugs on the disease and clinical/preclinical/experimental levels of evidence.
In the primer design sub-database, Pharmosome should provide specific scores to indicate the quality of primers and the feasibility of subsequent experiments.
The authors should compare Pharmosome with similar databases such as PharmGKB, PreMedKB, and CIVIC to show its specific focus and superiority.
It seems that the authors restrict the input SNPs to be located in gene regions, but the majority of disease-associating SNPs lie in the non-coding regions according to published GWAS and drug-response studies, non-coding SNPs should not be excluded therefore.

Minor:

The versions of all collected resources should be provided.
There are some errors/bugs in the website, such as the hyperlink of SNP from disease page to SNP page cannot be accessed by users, please polish them accordingly.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

No
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Bioinformatics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

9 Views

19 Aug 2021 | for Version 1

Fuyi Li, Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Vic, Australia

9 Views Cite this report Responses(0)

Approved With Reservations

This study developed a novel web-based database, Pharmosome, for human disease-related SNPs. The web page of the database is well-designed. The manuscript is well written and easy to follow.

I have several comments and suggestions:

The authors should provide a download page on the webserver of the database to allow the users to download the data in their database.
A detailed user manual or tutorial for the webserver of Pharmosome should be provided on the webserver to guide the users to use the database.
It would be better to provide a timeline of the database, which can help to record the different versions of the database.
The authors are suggested to provide a statistics page to show the statistic summary of the database.
It would be better to have an advanced search option to allow the users to search the data with a combination of variables.
The authors should also provide an example in each search box to show the users which format of the IDs or keywords are acceptable.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Bioinformatics and Computational biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

[1] 1. Lander ES, Linton LM, Birren B, et al.: Initial sequencing and analysis of the human genome. Nature. 2001; 409(6822): 860–921. PubMed Abstract | Publisher Full Text

[2] 2. Vignal A, Milan D, SanCristobal M, et al.: A review on SNP and other types of molecular markers and their use in animal genetics. Genet Sel Evol. 2002; 34(3): 275–305. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Gurwitz D, Pirmohamed M: Pharmacogenomics: the importance of accurate phenotypes. Pharmacogenomics. 2010; 11(4): 469–70. PubMed Abstract | Publisher Full Text

[4] 4. Habib PT, Alsamman AM, Hassanein SE, et al.: SNPector: SNP inspection tool for diagnosing gene pathogenicity and drug response in a naked sequence [version 1; peer review: awaiting peer review]. F1000Res. 2019; 8: 2133. Publisher Full Text

[5] 5. Savas S, Tuzmen S, Ozcelik H: Human SNPs resulting in premature stop codons and protein truncation. Hum Genomics. 2006; 2(5): 274–86. PubMed Abstract | Publisher Full Text | Free Full Text

[6] 6. Bond J, Scott S, Hampshire DJ, et al.: Protein-truncating mutations in ASPM cause variable reduction in brain size. Am J Hum Genet. 2003; 73(5): 1170–7. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. De Gobbi M, Viprakasit V, Hughes JR, et al.: A regulatory SNP causes a human genetic disease by creating a new transcriptional promoter. Science. 2006; 312(5777): 1215–7. PubMed Abstract | Publisher Full Text

[8] 8. Dhiman N, Poland GA, Cunningham JM, et al.: Variations in measles vaccine-specific humoral immunity by polymorphisms in SLAM and CD46 measles virus receptors. J Allergy Clin Immunol. 2007; 120(3): 666–72. PubMed Abstract | Publisher Full Text

[9] 9. Enattah NS, Sahi T, Savilahti E, et al.: Identification of a variant associated with adult-type hypolactasia. Nat Genet. 2002; 30(2): 233–7. PubMed Abstract | Publisher Full Text

[10] 10. Kaput J, Rodriguez RL: Nutritional genomics: the next frontier in the postgenomic era. Physiol Genomics. 2004; 16(2): 166–77. PubMed Abstract | Publisher Full Text

[11] 11. Hewett M, Oliver DE, Rubin DL, et al.: PharmGKB: the Pharmacogenetics Knowledge Base. Nucleic Acids Res. 2002; 30(1): 163–5. PubMed Abstract | Publisher Full Text | Free Full Text

[12] 12. Gene. Bethesda (MD): National Library of Medicine (US), National Center for Biotechnology Information; 2004; Accessed 2019-02-16. Reference Source

[13] 13. dbSNP. Bethesda (MD): National Library of Medicine (US), National Center for Biotechnology Information; 1998; Accessed 2019-02-16. Reference Source

[14] 14. Kersey PJ, Allen JE, Allot A, et al.: Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species. Nucleic Acids Res. 2018; 46(D1): D802–8. PubMed Abstract | Publisher Full Text | Free Full Text

[15] 15. Zhang G, Shi J, Zhu S, et al.: DiseaseEnhancer: a resource of human disease-associated enhancer catalog. Nucleic Acids Res. 2018; 46(D1): D78–84. PubMed Abstract | Publisher Full Text | Free Full Text

[16] 16. Safran M, Dalah I, Alexander J, et al.: GeneCards Version 3: the human gene integrator. Database (Oxford). 2010; 2010: baq020. PubMed Abstract | Publisher Full Text | Free Full Text

[17] 17. Fabregat A, Sidiropoulos K, Viteri G, et al.: Reactome diagram viewer: data structures and strategies to boost performance. Bioinformatics. 2018; 34(7): 1208–14. PubMed Abstract | Publisher Full Text | Free Full Text

[18] 18. Stoehlmacher J, Park DJ, Zhang W, et al.: Association between glutathione S-transferase P1, T1, and M1 genetic polymorphism and survival of patients with metastatic colorectal cancer. J Natl Cancer Inst. 2002; 94(12): 936–42. PubMed Abstract | Publisher Full Text

[19] 19. Weiss EJ, Bray PF, Tayback M, et al.: A polymorphism of a platelet glycoprotein receptor as an inherited risk factor for coronary thrombosis. N Engl J Med. 1996; 334(17): 1090–4. PubMed Abstract | Publisher Full Text

[20] 20. Walter DH, Hink U, Asahara T, et al.: The in vivo bioactivity of vascular endothelial growth factor/vascular permeability factor is independent of N-linked glycosylation. Lab Invest. 1996; 74(2): 546–56. PubMed Abstract

[21] 21. Richardson PG, Mitsiades CS, Hideshima T, et al.: Novel biological therapies for the treatment of multiple myeloma. Best Pract Res Clin Haematol. 2005; 18(4): 619–34. PubMed Abstract | Publisher Full Text

[22] 22. Richardson PG, Schlossman RL, Alsina M, et al.: PANORAMA 2: panobinostat in combination with bortezomib and dexamethasone in patients with relapsed and bortezomib-refractory myeloma. Blood. 2013; 122(14): 2331–7. PubMed Abstract | Publisher Full Text

[23] 23. König J, Seithel A, Gradhand U, et al.: Pharmacogenomics of human OATP transporters. Naunyn Schmiedebergs Arch Pharmacol. 2006; 372(6): 432–43. PubMed Abstract | Publisher Full Text

[24] 24. Bond C, LaForge KS, Tian M, et al.: Single-nucleotide polymorphism in the human mu opioid receptor gene alters beta-endorphin binding and activity: possible implications for opiate addiction. Proc Natl Acad Sci U S A. 1998; 95(16): 9608–13. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Habib PT, Alsamman AM, Hamwieh A: BioAnalyzer: Bioinformatic software of routinely used tools for analysis of genomic data. Biotechnology. 2019; 10: 33–41. Publisher Full Text

[26] 26. Thompson JF, Man M, Johnson KJ, et al.: An association study of 43 SNPs in 16 candidate genes with atorvastatin response. Pharmacogenomics J. 2005; 5(6): 352–8. PubMed Abstract | Publisher Full Text

[27] 27. Peter: Pharmosome: An Integrative and Collective Database for Explorations and Analysis of SNP Associated with Disease. Zenodo. 2019. http://www.doi.org/10.5281/zenodo.3583191

Pharmosome: an integrative and collective database for exploration and analysis of single nucleotide polymorphisms associated with disease

Abstract

Keywords

Introduction

Methods

Implementation

Table 1. Number of data entries collected from each database used by Pharmosome.

Operation

Functions of Pharmosome

Workflow

Figure 1. Pharmosome workflow illustrating the processing in the background of the database and the relationships and links between different interface webpages.

Use case

Table 2. Use cases for Pharmosome detailing the input required in the web interface and a summary of the output of Pharmosome.

Figure 2. Disease sub-database output on Pharmosome web interface after input of “Heart failure”.

Figure 3. Chemical sub-database output on Pharmosome web interface after input of “Ivacaftor”.

Figure 4. Gene sub-database output on Pharmosome web interface after input of “CFTR”.

Figure 5. SNP collector output on Pharmosome web interface after input of “CFTR”.

Summary

Software availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated