ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Software Tool Article
Revised

Explicit interaction information from WikiPathways in RDF facilitates drug discovery in the Open PHACTS Discovery Platform

[version 2; peer review: 2 approved]
PUBLISHED 12 Oct 2018
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Open PHACTS is a pre-competitive project to answer scientific questions developed recently by the pharmaceutical industry. Having high quality biological interaction information in the Open PHACTS Discovery Platform is needed to answer multiple pathway related questions. To address this, updated WikiPathways data has been added to the platform. This data includes information about biological interactions, such as stimulation and inhibition. The platform's Application Programming Interface (API) was extended with appropriate calls to reference these interactions.  These new methods of the Open PHACTS API are available now.

Keywords

Open PHACTS, drug discovery, semantic, bioinformatics, WikiPathways, pathway database, API

Revised Amendments from Version 1

  • The most significant change in this revision is the addition of two simple example workflows
  • Supplementary File 1 was added which includes example result outputs for the two remaining API calls not shown in the main manuscript
  • Updated the Implementation section to clarify the WikiPathways RDF generation process
  • The summary section also discusses creating more complex workflows and supported platforms and libraries
  • Explanation for acceptable URIs that can be used in API calls was added
  • Updated the Methods section to specify what happens in the API call when an interaction direction is not specified
  • The Methods section now also explains how ontological interaction type information can be retrieved from the JSON returned by the API call

See the authors' detailed response to the review by Augustin Luna
See the authors' detailed response to the review by Yi-An Chen

Introduction

Targeting proteins to ideally restore normal biological processes is a common starting point in drug discovery1. The Open PHACTS Discovery Platform (OPDP) was designed to help identify protein targets and information about their associations with each other24. The OPDP supports target identification and validation by including target-target interactions from WikiPathways57. Of these interaction networks, proteins sharing a downstream path allows investigation of alternative drug target combinations. Even the knowledge of which biological pathways participate in disease-related processes provides insight in the pathway topology between the targets. The importance and need of providing access to interaction information for real-world research questions was outlined in a recent Open PHACTS paper8.

The Open PHACTS project was born out of the desire to integrate pharmacological data from multiple precompetitive sources to efficiently address scientific questions that cannot be answered with single data sources8. It integrates data using linked data approaches3 from chemical and biological sources such as ChEBI, ChEMBL, UniProt, and WikiPathways6. However, the OPDP did not previously include calls to access specific up- and downstream interaction effects. This information is needed for questions related to drug repositioning and repurposing. Up- or downstream targets may be interesting alternatives with similar therapeutic effect to targets, for which it is particularly hard to develop a drug agent. Thus, finding a target that has already been drugged or is more drug tractable will be advantageous. Here we describe how to identify alternative targets in the same cellular pathway using OPDP against the WikiPathways data.

Methods

Implementation

The WikiPathways Resource Description Framework data (WPRDF) is released as part of the monthly releases5. The native format for WikiPathways is Graphical Pathway Markup Language (GPML) based on the eXtensible Markup Language (XML) standard. The RDF export is transformed from the original GPML. In the RDF representation we use two distinct controlled vocabularies, to distinguish between the graphical notation of a pathway and the biological meanings expressed in the pathway. This is done to allow integration with other pathway repositories which use other graphical notations or none. The WikiPathways RDF also includes details about directed and undirected interactions. Directed biochemical interactions capture the source and target which are depicted as an arrow in simple pathway drawings. WikiPathways adds biological meaning to interactions with Molecular Interaction Map (MIM) interaction types, like inhibitions, enzyme catalyzed reactions, and stimulations9, as well as Systems Biology Graphical Notation (SBGN) interactions10. Reactome pathways in WikiPathways use SBGN interactions11,12. However, because MIM and SBGN use different drawing styles, we normalize their inhibition types into a common inhibition type, defined by the WikiPathways ontology (https://vocabularies.wikipathways.org/wp).

The WikiPathways basic drawing tools also contain generic arrows and T-bar annotations that give the user the ability to create basic diagrams without the semantic meaning of MIM or SBGN notations. The interactions connecting these nodes are captured, but the only explicit information is that it is a directed interaction from a source to a target. To handle more complicated enzyme reaction drawings, where there is not a single line that directly connects targets in a cascade of enzymatic reactions, a query was developed that recognizes these types of reactions. However, this is not implemented in the current Open PHACTS Application Programming Interface (API).

Version 2.1 of the OPDP API contains three new calls for interactions and their pathways. The first call, /pathway/getInteractions, returns all interactions involved in a pathway. To use this feature, the user specifies a pathway URI and OPDP returns its interactions including information about direction and the connected entities. The direction information is relayed as a starting node having a wp:source annotation, while the end of the interaction has the wp:target annotation. In its simplest form, this means that if gene product A is interacting with a gene product B, then we have wp:source for product A and wp:target for product B. However, the presented new methods also support interactions with multiple sources and targets for more complex interactions that are more accurately represented this way.

The second added call, /pathways/interactions/byEntity, returns the direction of the interactions involving this entity. An entity is specified by a URI and can be a metabolite, protein, gene product, or RNA. API options allow the user to select only upstream or only downstream interactions. If a direction is not specified in the call, all the adjacent interactions will be retrieved regardless of their direction. The results also specify the interaction type (e.g. inhibition, stimulation, conversion). Vocabularies.wikipathways.org also identifies catalysis and binding events as well as a more generic directedInteraction in the case where the type of the interaction is not identified. This ability to select the interaction direction is specifically what allows users to answer scientific questions around upstream and downstream effects, such as those defined by Open PHACTS. The third API call is /pathways/interactions/byEntity/count which is a helper function that returns the number of interactions for a target.

Operation

The OPDP API calls are backed by SPARQL searches against the loaded WikiPathways RDF. The query parameters that are required or optional are given in the documentation of Open PHACTS (https://dev.openphacts.org/docs/2.1). As in previous versions, the API uses HTTP GET to call methods and needs a (free) application ID and key (see https://dev.openphacts.org/signup)3.

To ensure multiple URI schemes can be used to identify genes, proteins, and metabolites, the Open PHACTS platform uses an Identifier Mapping Service (IMS)6. This ensures that people can use Ensembl, NCBI Gene, and others for genes, UniProt, Ensembl, etc. for proteins, and HMDB, ChEBI, CAS registry number, and PubChem for metabolites. Furthermore, it supports identifiers.org formatted URIs, further simplifying entering identifiers13.

Example queries

We are demonstrating the platform with three example calls. All the API calls require use of an application ID and an application key. This key and ID can be acquired by creating a free Open PHACTS account. The first example is an application to the PI3K/AKT pathway for cell growth regulation which contain important targets for cancer treatment14. The AKT protein has a central role and usefully shows the API call’s ability to return connected elements with the /pathways/interactions/byEntity and the /pathway/getInteractions calls. The API calls can help aid drug discovery by taking a target, in this case AKT, and easily identify other connected proteins that could potentially be used as drug targets with a common downstream effect.

Figure 1 shows the web interface of the API call that returns the connectivity of the AKT2 target to both upstream or downstream proteins or gene products. This method allows the user to identify connections to other targets in the pathway. The results of that API call (Figure 2) show the AKT2 interaction with microRNA. A helper method (Figure 3): /pathways/interactions/byEntity/count is also included. It returns the number of all interactions in which an entity is participates. This helps the user get a sense of the prevalence of the queried entity with interactions in pathways found on WikiPathways. An example result for this query can be found in Supplementary Figure 1.

fb846bf4-45c1-4d52-bb6f-caf8643aca26_figure1.gif

Figure 1. Parameters (bottom) and curl command (top) for the GET /pathways/interactions/byEntity call.

The GET portion tells the API to retrieve data with the associated call. It takes an entity URI, the Ensembl ID for AKT2, and returns a list interactions for AKT2. The obligatory parameters are shown in bold. Entity IDs that are acceptable for queries include Ensembl, Entrez Gene, and UniProt for genes, proteins, and RNAs. For metabolites the ID sources HMDB, ChEBI, and ChemSpider, for example, are acceptable entity IDs

fb846bf4-45c1-4d52-bb6f-caf8643aca26_figure2.gif

Figure 2. Result in the JSON format of the AKT2 query from Figure 1.

The participants of the interaction are directed from source (hsa-let7b) to target (AKT2). It also shows the type of interaction (inhibition), and the biological types of the interaction participants.

fb846bf4-45c1-4d52-bb6f-caf8643aca26_figure3.gif

Figure 3. Parameters (bottom) and curl command (top) for the GET /pathways/interactions/byEntity/count call.

It takes a URI for an entity, in this case the Ensembl ID for AKT2 and returns a count of the interactions to which this gene product is involved. Only the entity URI, app ID, and app key are required fields. Optional parameters are pathway organism, direction, or type of interaction.

The other call implemented, /pathway/getInteractions (Figure 4), demonstrates an API call to return all interactions in the MicroRNAs in cardiomyocyte hypertrophy pathway15. This pathway has interaction details for AKT, mTOR, and PI3K, which are all important targets in cancer research16. For each interaction the participants are given and whether it is a directed or undirected interaction. An example result for this query can be seen in Supplementary Figure 2.

fb846bf4-45c1-4d52-bb6f-caf8643aca26_figure4.gif

Figure 4. Parameters (bottom) and curl command (top) for the /pathway/getInteractions call.

It is intended to take the pathway URI from WikiPathways and return a list of interaction involved in that particular pathway. Pathway URI, app ID, and app key are the only required values for this call.

Example workflows

In order to demonstrate the basic use of the introduced API methods, we developed two workflows, available in the Supplementary Material. One uses Python to return a file with the results in a table and the other uses a HTML webpage using the ops.js JavaScript client library17. More involved workflows have been developed for KNIME and Pipeline Pilot18,19.

The Python script example uses the Open PHACTS /pathway/getInteraction API call and prompts the user to enter a WikiPathways pathway number that they wish to query, such as 1544 for WikiPathways pathway WP1544. Invocation of the API call with the pathway identifier returns information about the directed interactions that are involved with the pathway. The information that is returned is the interaction ID used by WikiPathways, the interaction type, and URIs for the source and target of the interaction. In order to convert the URIs into something more readable, a SPARQL query is then executed to get labels, from the WikiPathways SPARQL endpoint, for the source and target of the interaction. The results are written to a file with the interaction ID, interaction type, URIs for the source and target, as well as alias IDs, the curl for the API call, the pathway ID used, and a number of interactions returned.

The second example uses a HTML5 webpage and the ops.js JavaScript client library to retrieve interactions for a particular gene, using the URI for the gene’s Ensembl identifier and the /pathways/interactions/byEntity API method. The ops.js library passes the returned JSON with interaction information to a callback function, where the interacting source and target are extracted and the interacting entity determined. For each interacting entity, which may be a protein, RNA, or small compound, a call to the /pathways/interactions/byEntity/count method is made to return the number of interaction that entity has.

Summary

While the calls identified here are simple calls, workflow tools make it possible to take advantage of the integrative nature of the OPDP to make API calls in succession. Two such workflow tools that work with the OPDP are KNIME and Pipeline Pilot. With these tools, it is possible to perform a directional query of a target and identify alternative targets that can then be queried against the chemistry calls to identify active compounds for these alternative targets. The client libraries ops.js, ops4j, and ropenphacts also support Open PHACTS and the interaction calls for pathways. This allows users to perform API calls to the OPDP using their preferred language or platform, such as JavaScript, Java, or R.

The addition of interactions with direction information allows OPDP to answering more of the pre-defined scientific questions2. The directional information allows the user to explore how proteins and gene products are connected with one another and easily access this information. This is illustrated in the example queries using the cancer target AKT.

Software availability

Online service: https://dev.openphacts.org/docs/2.1

Latest source code is available at: https://github.com/openphacts/OPS_LinkedDataApi

Archived source code of discussed version: https://doi.org/10.5281/zenodo.106825220

License: Apache License 2.0

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 17 Jan 2018
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Miller RA, Woollard P, Willighagen EL et al. Explicit interaction information from WikiPathways in RDF facilitates drug discovery in the Open PHACTS Discovery Platform [version 2; peer review: 2 approved]. F1000Research 2018, 7:75 (https://doi.org/10.12688/f1000research.13197.2)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 2
VERSION 2
PUBLISHED 12 Oct 2018
Revised
Views
8
Cite
Reviewer Report 23 Oct 2018
Augustin Luna, Dana-Farber Cancer Institute, Boston, MA, USA 
Approved
VIEWS 8
The authors satisfactorily ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Luna A. Reviewer Report For: Explicit interaction information from WikiPathways in RDF facilitates drug discovery in the Open PHACTS Discovery Platform [version 2; peer review: 2 approved]. F1000Research 2018, 7:75 (https://doi.org/10.5256/f1000research.17945.r39411)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
8
Cite
Reviewer Report 19 Oct 2018
Yi-An Chen, National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan 
Approved
VIEWS 8
The author has addressed all the ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Chen YA. Reviewer Report For: Explicit interaction information from WikiPathways in RDF facilitates drug discovery in the Open PHACTS Discovery Platform [version 2; peer review: 2 approved]. F1000Research 2018, 7:75 (https://doi.org/10.5256/f1000research.17945.r39412)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 1
VERSION 1
PUBLISHED 17 Jan 2018
Views
16
Cite
Reviewer Report 03 Apr 2018
Yi-An Chen, National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan 
Approved with Reservations
VIEWS 16
The authors add new features to the Open PHACTS APIs which facilitate querying reactions in WikiPathways.

Some comments:
  1. In the first example, the query for AKT2 in the figure 1 uses  http://identifiers.org/ensembl/ENSG00000105221 but the result
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Chen YA. Reviewer Report For: Explicit interaction information from WikiPathways in RDF facilitates drug discovery in the Open PHACTS Discovery Platform [version 2; peer review: 2 approved]. F1000Research 2018, 7:75 (https://doi.org/10.5256/f1000research.14318.r32325)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 12 Oct 2018
    Ryan Miller, Department of Bioinformatics (BiGCaT), Maastricht University, Maastricht, The Netherlands
    12 Oct 2018
    Author Response
    1.  The Open PHACTS API includes the Identifier Mapping Service component which allows use of many identifier schemes, as long as suitable link sets are available (at http://data.openphacts.org/1.5/ims/linksets/). These sets ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 12 Oct 2018
    Ryan Miller, Department of Bioinformatics (BiGCaT), Maastricht University, Maastricht, The Netherlands
    12 Oct 2018
    Author Response
    1.  The Open PHACTS API includes the Identifier Mapping Service component which allows use of many identifier schemes, as long as suitable link sets are available (at http://data.openphacts.org/1.5/ims/linksets/). These sets ... Continue reading
Views
33
Cite
Reviewer Report 24 Jan 2018
Augustin Luna, Dana-Farber Cancer Institute, Boston, MA, USA 
Approved with Reservations
VIEWS 33
The tool provides additional features to the Open PHACTS platform to simplify the access of interaction information from WikiPathways. Some comments: 
  • The authors go into some detail about the MIM/SBGN representations used by WikiPathways. To my
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Luna A. Reviewer Report For: Explicit interaction information from WikiPathways in RDF facilitates drug discovery in the Open PHACTS Discovery Platform [version 2; peer review: 2 approved]. F1000Research 2018, 7:75 (https://doi.org/10.5256/f1000research.14318.r29933)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 12 Oct 2018
    Ryan Miller, Department of Bioinformatics (BiGCaT), Maastricht University, Maastricht, The Netherlands
    12 Oct 2018
    Author Response
    1. WikiPathways is represented using GPML (Graphical Pathway Markup Language), which is a modified XML format.  The WikiPathways RDF is divided into two divisions, the GPML and the WPRDF.
    ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 12 Oct 2018
    Ryan Miller, Department of Bioinformatics (BiGCaT), Maastricht University, Maastricht, The Netherlands
    12 Oct 2018
    Author Response
    1. WikiPathways is represented using GPML (Graphical Pathway Markup Language), which is a modified XML format.  The WikiPathways RDF is divided into two divisions, the GPML and the WPRDF.
    ... Continue reading

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 17 Jan 2018
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.