ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Software Tool Article
Revised

CellMap visualizes protein-protein interactions and subcellular localization

[version 2; peer review: 2 approved]
PUBLISHED 01 Feb 2018
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Bioinformatics gateway.

Abstract

Many tools visualize protein-protein interaction (PPI) networks. The tool introduced here, CellMap, adds one crucial novelty by visualizing PPI networks in the context of subcellular localization, i.e. the location in the cell or cellular component in which a PPI happens. Users can upload images of cells and define areas of interest against which PPIs for selected proteins are displayed (by default on a cartoon of a cell). Annotations of localization are provided by the user or through our in-house database. The visualizer and server are written in JavaScript, making CellMap easy to customize and to extend by researchers and developers.

Keywords

subcellular location, biological visualization, protein-protein interaction

Revised Amendments from Version 1

In the new version of the manuscript, we have highlighted the origin of the data sources for the example deployment of the portal on http://cell.dallago.us and the behavior of the visualization tool in case there are multiple protein localizations for a protein displayed in the protein-protein interaction network visualization page.

See the authors' detailed response to the review by Augustin Luna
See the authors' detailed response to the review by Sandra Orchard

Introduction

Many tools visualize different aspects of protein-protein interaction (PPI) networks; the most prominent might be Cytoscape1. Existing visualizations of large PPI networks continue to be difficult to use. Some proteins interact with many hundreds or thousands of others. Often referred to as ‘PPI hairballs’, such hubs are in the way of understanding large data sets. Many ways have been proposed to resolve such hairballs through the addition of biologically meaningful dimensions such as pathways2 or time3.

Another dimension was first introduced a decade ago, namely the overlay of PPIs with subcellular localization4. Combining PPI networks with protein location provide an intuitive way of laying out PPI networks on a graphical representation of the cell, and might reduce the clutter from PPI hairballs. This decade-old solution4 no longer copes with today’s data, in terms of scalability nor of customizability and in terms of ease-of-use.

CellMap, the prototype introduced here, takes up on the idea of PPI visualization constrained by protein location, and provides a simple visual interface for users to explore protein location inside a cell. It presents this information in a graphically pleasant way and offers several customization features. The framework has been optimized to simplify future developments, such as the addition of further data dimensions (e.g. inclusion of protein trafficking). An instance of the tool with localization data from a previous publication that includes protein localizations of the human proteome5 and PPI data from the Human Integrated Protein-Protein Interaction rEference (HIPPIE) resource6 is available at http://cell.dallago.us.

Methods

Implementation

The CellMap prototype is an integrated portal that exposes API calls to retrieve images (representing cells) and protein information, as well as a frontend to visualize protein location and PPI data. The portal is fully written in JavaScript, namely in the JavaScript interpreter node.js (https://nodejs.org) for the backend and vanilla JavaScript for the frontend. The portal is deployed to the public through a Docker container. Docker is a technology that allows shipping of packaged services such as web applications to customers and users without the need to install dependencies other than the Docker engine (available through: https://www.docker.com). For the representation of cell images as maps, the Leaflet framework is used. Leaflet is a JavaScript-based tool used to represent maps (http://leafletjs.com).

Data about proteins are stored as JSON documents in a Mongo (http://mongodb.com) database. All information about the interaction partners and the subcellular localization of a protein is stored in a single JSON document, making the data structure simple to understand for non-experts and enabling them to deploy prototypes using their own data. Figure 1 schematically represents a protein data model (for a specific example for a protein object: http://cell.dallago.us/api/proteins/search/Q99943).

Operation

In CellMap, users can choose to upload new maps (images of cells). They can modify the location of regions of interest (ROIs) for a selected map (Figure 2), and visualize the locations of selected proteins on a map or render protein-protein interaction networks from a set of selected proteins.

To maintain a consistent coloring scheme for different cellular compartments throughout a set of different images, each compartment is assigned a unique color through the hash of the compartment’s name (e.g. light blue = vacuole, Figure 3B). Using this coloring approach, users might eventually learn to associate color with compartment. When proteins are loaded into the map, they are assigned pseudo-random coordinates representing a point that lies within the boundaries of the ROI in which they are localized (Figure 3D). A circle of a given radius is placed on the randomly generated point (Figure 3E-F), and the circle will be filled with the same color as the compartment in which the protein is located in (Figure 3B and 3F).

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure1.gif

Figure 1. Diagram of the data representation in CellMap.

In the figure we present a diagram of the Protein class, which contains several attributes of type String, two fields of type timestamp and two arrays (in square brackets) that reference the Interactions and Localizations classes. The arrows highlight the referenced models. This simple representation of information about a protein, its protein-protein interaction partners and its localizations enables the tool to be reused with one’s own datasets.

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure2.gif

Figure 2. Section of a screenshot of the CellMap editing tool on a private instance of the portal.

In the screenshot, an authorized user with editing capabilities draws a polygon (dark green) representing a new cellular compartment or region of interest (ROI). The user has a set of tools on the left side that can be used to draw polygons, lines, squares or circles. Once the new region has been drawn, the user can associate a cellular compartment through the dropdown input on the top-right and submit the new information to the server. The image used for this screenshot was taken from Wikimedia’s user Royroydeb, under CC BY-SA 4.0 (http://bit.ly/2fuYRiE) and is used in this figure for demonstrative purposes only, as using it on the online version of CellMap would infringe copyright.

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure3.gif

Figure 3. Definition of an area and drawing of protein circle.

(A) Section of a cartoon image of a cell; (B) user-drawn polygon representing the area occupied by a vacuole; (C) how the section of the cartoon image is displayed on the PPI/map viewer; (D) random point calculation inside vacuole-polygon-defined area; (E) drawing of a protein circle located inside the vacuole, (F) result of loading a protein localized in the vacuole as shown by PPI/map viewer.

Users can choose between two visualization options: the subcellular location in the context of the protein-protein interaction viewer (PPI viewer, Figure 4A, http://cell.dallago.us/ppi), and the protein subcellular location viewer (Map viewer, Figure 4B, http://cell.dallago.us/map). The two viewers can load the same images of cells (maps) and collect localization data from the same source, in the publicly available instance by 5. The PPI viewer offers the possibility to overlay networks between proteins being visualized. The map viewer displays all locations reported for a given protein simultaneously, while the PPI viewer only displays only one location at a time (by default: the first localization in the array of localizations as described in the protein data model, Figure 1); users can manually change the location by clicking on the protein circle and selecting a new location from the information box (Figure 5). Both the PPI and the map viewer are enriched by several controls (Figure 6): The top-left controls enable actions including: the navigation to the home of CellMap (Figure 6, panels 1 and 2, A), switching from the map viewer to the PPI viewer and vice versa, keeping the proteins currently loaded in the view (Figure 6, panels 1 and 2, B), reducing the opacity of the cell map, highlighting the protein circles (Figure 6, panels 1 and 2, C), zooming in- and out of the map and PPI viewers (Figure 6, panels 1 and 2, D), and visualization of the global network among all proteins loaded in the visualizer (Figure 6, panel 1, E). The top-right control allows to temporarily hide loaded proteins or activate an overlay of the user-drawn localizations (Figure 6, panel 4). The top-center search panel allows users to load new proteins by searching for their UniProt identifier, primary gene or primary protein name7 into the viewer (Figure 6, panel 3).

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure4.gif

Figure 4. Comparison between PPI viewer and map viewer.

The left view (A) shows the PPI viewer, which depicts the result of loading protein Q9NR71 and displays a circle for the first localization found in the array of locations (http://cell.dallago.us/ppi?p=Q9NR71); The right panel (B) shows the Map viewer, which depicts the result of loading the same protein Q9NR71 and displays a circle for the protein in each of its reported location (http://cell.dallago.us/map?p=Q9NR71). The red arrows are overlaid on top of the screenshots to highlight where the protein circles have been drawn in the viewers, since fitting the screenshot on the page reduces the overall size of the images.

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure5.gif

Figure 5. Protein information box.

Top: information about the selected protein. Bottom: new localization selection box rendered in the PPI viewer when clicking on the protein circle (http://cell.dallago.us/ppi?p=Q9NR71).

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure6.gif

Figure 6. Controls used in the different viewers.

(1) Top-left controls of PPI viewer; (2) top-left controls of map viewer; (3) top-center search panel of PPI/map viewer; (4) top-right layer control on PPI/map viewer.

To facilitate the retrieval of proteins and their interacting partners, CellMap provides basic search functionalities. Users can search for proteins based on their UniProt identifiers, by their gene identifiers or by their protein names. When performing the search, the page renders a grid containing boxes, each representing a different protein (Figure 7). Inside the boxes, the UniProt identifier for the protein that matched the search criterion is displayed. Starting on the top-right of every box a smaller colored square for each compartment is displayed in which that protein is localized. For proteins annotated to be in a single compartment, the border of the outer box (representing one protein as indicated by the UniProt ID in the center of the box) will get the color of that compartment (2nd box in Figure 7). Clicking on one of the colored squares will filter results based on the compartment represented by that color. In the bottom-right of each box, the total number of PPI partners are annotated.

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure7.gif

Figure 7. Results of searching for protein “foxo”.

The screenshot of this section of the home page shows four proteins that match the search criterion “foxo” either by their UniProt identifier, primary gene name or primary protein name. The protein boxes contain the UniProt identifiers of the matched proteins (center) and display the number of interaction partners (bottom-right) and several color-filled boxes graphically representing the localizations reported for the matched proteins (top-left).

Discussion

Some CellMap functionality is exemplified by a heat shock protein (HSPA4; Heat shock 70 kDa protein 4, UniProt identifier P34932) with many interaction partners (338, according to HIPPIE, http://cbdm-01.zdv.uni-mainz.de/~mschaefer/hippie/query.php?s=HSPA4) in different compartments. The objective was to showcase how CellMap can simplify PPI hairballs. We visualize the same PPI network using CellMap (Figure 8A) and Cytoscape1 in the form of the Cytoscape.js version used by HIPPIE (Figure 8B) and the Cytoscape desktop version (Figure 8C).

None of the three viewers solves the PPI hairball problem completely. Without zooming in, the information density for 338 protein pairs is too high to be helpful. HIPPIE’s layout for Cytoscape.js (Figure 8B) clearly improves over the standard Cytoscape desktop version (Figure 8C) by centering the view around HSPA4, the protein of interest. In CellMap (Figure 8A) the biologically relevant differences between pairs from the same and from different compartments remain visible.

f564ed5a-095a-4992-83fa-6b7aea020bcd_figure8.gif

Figure 8.

PPI hub in CellMap (A), Cytoscape.js (B) and Cytoscape desktop (C). For HSPA4 (Heat shock 70 kDa protein 4, UniProt identifier P34932), we show some of the PPIs known (according to HIPPIE HSPA4 has 338 interaction partners). We chose this as one example of a protein with many more PPIs than the average protein (“PPI hub”). The figure compares how three different PPI viewers cope with the HSPA4 network: (A) CellMap (http://cell.dallago.us/protein/P34932), (B) HIPPIE’s Cytoscape.js visualizer and (C) the desktop version of Cytoscape. Proteins in CellMap are represented as colored dots on the map (image) of the cell, and upon selecting the protein of interest an overlay of edges is drawn. In Cytoscape and Cytoscape.js, proteins are represented as nodes containing a label (protein name as UniProt identifier), and edges are directly inferred from the data. The Cytoscape.js visualization was taken directly from HIPPIE. The Cytoscape network was automatically drawn upon loading the HIPPIE dataset and selecting the protein of interest and it’s direct neighbors.

By using a biologically relevant dimension (protein localization), instead of drawing nodes in positions based on edge weight (force layout of Cytoscape), some aspects of the protein and its partners become obvious at first glance, e.g. that HSPA4 interacts with many nuclear and cytoplasmic proteins, as well as with proteins that are secreted (extra-cellular) and located in the Endoplasmic Reticulum (ER, Figure 8). This may suggest the hypothesis HSPA4 to be an important hub involved in process spanning across compartments. Such a hypothesis is presented in our supplementary material (Figure SOM_1), where we analyze the visualization of the FOXO3 protein through CellMap.

One disadvantage of CellMap over the Cytoscape.js view is that the protein identifiers are not visible at all on the static image (protein identifiers become visible through mouse-over events in CellMap). However, in the image shown (Figure 8) the Cytoscape.js names also remain unreadable. Another problem with CellMap are the numbers displayed on edges (experimental reliability of the PPI as given by HIPPIE). In our view, this information is extremely important to look at interactions, but we are still lacking a more sophisticated mechanism to visualize these numbers.

CellNetVis8 is a recent tool that also connects localization with PPI networks. It emphasizes the way PPI networks are laid out through the adaptation of a so-called force-directed layout (using the tool While). Although CellMap and CellNetVis are founded on a similar idea, user experience and focus differ importantly. For instance, CellMap can be driven by data from users that define the number of compartments on a map, and provide localizations. In contrast, CellNetVis uses a fixed subset of compartments and an ad hoc diagram for the cell. Additionally, CellMap comes with out of the box data for the human proteome and allows the community to grow the tool by enriching datasets (images and localizations), whereas CellNetVis has a per-use approach, allowing to visualize networks stored in specialized XGMML files. Another unique aspect of CellMap is the openness to introduce further biologically meaningful dimensions (beyond location such as time or pathways) that increase the usefulness of PPI visualization tools to create new testable hypotheses.

Conclusions

CellMap is a prototype providing a portal exploring the idea of using protein subcellular location as the basis to construct more complete visualizations of biological data, such as protein-protein interactions (PPIs). Using this paradigm, we claim that additional information, such as pathways, can be layered on top of the current visualization of subcellular location to potentially generate meaningful biological insights. The source code for the portal is publicly available and an instance of the portal with location data from a previous publication about the subcellular localization of the human proteome5 and protein-protein interaction data from HIPPIE6 (http://cbdm-01.zdv.uni-mainz.de/~mschaefer/hippie) is running at http://cell.dallago.us. The visualization tool is written in JavaScript, thereby tapping into a very large user base for customized extensions and modifications. With the release of the prototype, we aim at creating a user base and awareness of the tool, ultimately collecting precious feedback from experimentalists and technical users alike.

Abbreviations

2D: two dimensions, API: Application Program Interface, ID: identifier, JSON: JavaScript Object Notation, PPI: protein-protein interaction, ROI: region of interest.

Software availability

The CellMap prototype is released as open source software under the GNU General Public License v3.0. Documentation, source code and viewer are available at https://github.com/sacdallago/cellmap. Archived source code as at the time of publication is available at https://doi.org/10.5281/zenodo.9043249. An example of use with protein localization data from a recent publication5 and from the HIPPIE database of protein-protein interactions6 is available at http://cell.dallago.us.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 11 Oct 2017
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Dallago C, Goldberg T, Andrade-Navarro MA et al. CellMap visualizes protein-protein interactions and subcellular localization [version 2; peer review: 2 approved]. F1000Research 2018, 6:1824 (https://doi.org/10.12688/f1000research.12707.2)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 2
VERSION 2
PUBLISHED 01 Feb 2018
Revised
Views
8
Cite
Reviewer Report 12 Feb 2018
Sandra Orchard, European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Cambridge, UK 
Approved
VIEWS 8
The authors have addressed my concerns and ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Orchard S. Reviewer Report For: CellMap visualizes protein-protein interactions and subcellular localization [version 2; peer review: 2 approved]. F1000Research 2018, 6:1824 (https://doi.org/10.5256/f1000research.15085.r30439)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 1
VERSION 1
PUBLISHED 11 Oct 2017
Views
22
Cite
Reviewer Report 18 Jan 2018
Augustin Luna, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA 
Approved
VIEWS 22
The tool provides an interesting feature to help declutter visualizations of biological networks using localization information. Some comments:
  • It would be good if the names of the used databases was stated in the last paragraph of
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Luna A. Reviewer Report For: CellMap visualizes protein-protein interactions and subcellular localization [version 2; peer review: 2 approved]. F1000Research 2018, 6:1824 (https://doi.org/10.5256/f1000research.13762.r29754)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 29 Jan 2018
    Christian Dallago, Department of Informatics, Bioinformatics & Computational Biology, TUM (Technical University of Munich), Munich, Germany
    29 Jan 2018
    Author Response
    Dear Dr. Luna,

    Thank you very much for your input on our work.

    We have submitted a new version of the manuscript, which should address points one and four of your comment.

    As ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 29 Jan 2018
    Christian Dallago, Department of Informatics, Bioinformatics & Computational Biology, TUM (Technical University of Munich), Munich, Germany
    29 Jan 2018
    Author Response
    Dear Dr. Luna,

    Thank you very much for your input on our work.

    We have submitted a new version of the manuscript, which should address points one and four of your comment.

    As ... Continue reading
Views
43
Cite
Reviewer Report 13 Nov 2017
Sandra Orchard, European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Cambridge, UK 
Approved with Reservations
VIEWS 43
The authors describe a tool for visualising PPI networks in the context of the subcellular localisation of the searched protein.
  1. I have twice tried to review this paper and both times the http://cell.dallago.us link gave me a
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Orchard S. Reviewer Report For: CellMap visualizes protein-protein interactions and subcellular localization [version 2; peer review: 2 approved]. F1000Research 2018, 6:1824 (https://doi.org/10.5256/f1000research.13762.r27544)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 27 Nov 2017
    Christian Dallago, Department of Informatics, Bioinformatics & Computational Biology, TUM (Technical University of Munich), Munich, Germany
    27 Nov 2017
    Author Response
    Dear Dr. Orchard,

    Thank you for your valuable input on the tool!


    With regard to point 1: we apologize for the broken database connection, unfortunately, the deployment system missed that flag and ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 27 Nov 2017
    Christian Dallago, Department of Informatics, Bioinformatics & Computational Biology, TUM (Technical University of Munich), Munich, Germany
    27 Nov 2017
    Author Response
    Dear Dr. Orchard,

    Thank you for your valuable input on the tool!


    With regard to point 1: we apologize for the broken database connection, unfortunately, the deployment system missed that flag and ... Continue reading

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 11 Oct 2017
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.