ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Software Tool Article

Tessellate & Montage: Molecular analytics of cyclic conformations

[version 1; peer review: 1 approved, 1 approved with reservations]
PUBLISHED 08 Dec 2017
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

The conformations and shapes of macromolecular structures in biological and synthetic materials often define the macroscopic functions of the systems. Tessellate and Montage provide a standardized toolset for rapid reporting of large datasets allowing comparisons of cyclic molecule conformations (ring pucker) from structural databases and simulation trajectory data. This facilitates an understanding of the dynamic transition between common conformations and the flexible range in a ring that underlies molecular behaviour and recognition properties.

Keywords

multiqc, conformation, saccharides, visualisation, macrocycles

Introduction

Cyclic molecules and systems consisting of cyclic molecules play central roles in several chemical and biological processes1,2. RNA, DNA, proteins and carbohydrates are polymers that all contain cyclic moieties that exhibit wide ranging conformational variety. For example, the puckering of fructose can be used to illustrate the value of determining conformational statistics as recognition and binding to Fructokinase is due to pucker specificity. Similarly different sugar puckers of ribose in RNA and de-oxyribose in A-DNA and Z-DNA account for the molecular shape of the respective nucleotide helices2. In DNA-protein complexes any kink in the DNA axis is accompanied by a change in deoxy-ribose pucker, demonstrating the important connection pucker plays in molecular structure and flexibility3,4. In enzymatic transition states for glycosyl hydrolases and transferases, conformations change to provide the correct electronic state for the reaction to take place5.

Conformational analyses, which recognise shapes (Figure 1), are important for finding transition state leads, understanding molecular recognition events, ligand fitting in macrocycles and for molecular pattern matching.

9c758b14-0f2c-45c1-8d8b-2ee4453abb4e_figure1.gif

Figure 1. Representative canonical conformers for 5,6,7,8 cycles. Twist, chair, chair, crown.

We created a standardized toolset called Tessellate & Montage for comparisons of cyclic molecule conformations (ring pucker) from structural databases and simulation trajectory data. Tessellate can readily find, calculate and quantify cyclic conformations of cycles and macrocycles in datasets. Montage forked from MultiQC6 is a transferable reporting package for visualizing the conformations of cycles. Montage addresses the need to compare multiple molecular data sets by incorporating results from multiple analyses into a single report. Similar tools for calculating ring conformations are shown in Table 1, however these tools do not provide methods for calculating the ring pucker of macrocycles, nor do they have the reporting capability for comparing multiple types of molecular data sets in a single report. Existing pucker models include perpendicular displacements from a mean plane7,8, intracyclic torsions9,10 and triangular tessellation1115.

Table 1. Alternative tools for calculating ring pucker.

ResourceOpen source/FreeWebappDetails
RING16,17Yes, requires
password.
NoCP pucker.
Cremer-Pople (CP) parameter calculatorYesYesCP pucker.
VMD18, paper chain representation19YesNoIn VMD. CP pucker.
GlyTorsionYesYesPDB statistics using
ring torsion angles.
CHARMM CORREL command20NoNoIn CHARMM. CP
Pucker timeseries.

A further aim was to ensure that these tools (Tessellate and Montage) function independently from informatics workflows systems, while also being readily incorporated and fully functional as part of systems such as Galaxy21 and the Glycome Analytics Platform22.

Methods

Tessellate is a package for quantifying ring conformation and qualifying the conformer in terms of a canonical conformer (e.g. chair, boat). Developed in Python, Tessellate uses BioPython23 for accessing protein data bank files. Using Tessellate with Visual Molecular Dynamics (VMD) is possible through a tcl script provided. Visualising the results of this analysis is done using Montage.

Montage is a fork of the excellent MultiQC6 and provides reporting templates and functionality for computational chemistry and chemical glycobiology. Previously we prototyped reporting within a web platform, the aim here was to ensure that the visual reports connected easily into workflow systems such as Galaxy21, GAP22 and were also transferable outside of these platforms.

Implementation

These tools were designed to work in Linux.

Triangular tessellation for ring sizes 5–8 – Implemented as per the following papers11,14,15. Protein and water residues in the input structures are ignored by default.

Nomenclature and ring ordering – Conformational qualification depends on a consistent ring ordering according to the IUPAC definition24. Residues atoms are matched to an internal ligand dictionary and aligned in best-possible cyclic order to prevent atom ordering mistakes.

Ring finding algorithm – Cycles are detected using the Smallest Set of Smallest Rings (SSSR) ring perception algorithm25.

Conformational quantitation – Coordinates are matched to known canonical conformers coordinates. The best match is returned.

Macrocyclic – Macrocycle conformations are returned based on the centre of geometry of any subsidiary cycles.

Operation

Requirements: Python 3, Linux. For PDB analysis, BioPDB is required. Detailed installation instructions, package requirements and usage examples are provided in the README file of the released software (see Software availability section).

Tessellate reads in molecular data and outputs a summary of cyclic conformations (Input file types: list containing PDB IDs, PDB file, list of atomic coordinates. Output file types: JSON, txt). Montage reads the summary from Tessellate and produces a html report with a summary chart and charts for 5,6,7,8 rings found in the inputs (Input file types: JSON, txt. Output file types: HTML).

Usage example:

The workflow is as follows:

  • 1. Choose molecular data e.g. from the PDB or use VMD to create a list of atomic coordinates from molecular simulation data.

  • 2. Run Tessellate on the Linux command line e.g. `tessellate data/usecase-*DNA --input-format=pdblist --output-format=json --output-dir=output-usecase-rnadna`.

  • 3. Run Montage on the Linux command line e.g. `multiqc output-usecase-rnadna -m montage_tessellate`.

  • 4. Open report using a web browser.

Use cases

Simulation – Ribose in vacuo biased molecular dynamics calculation

The conformational changes during a biased free energy simulation for ribose. This analysis can be used to identify patterns in conformational change and confirm adequate sampling of phase space (Figure 2). VMD was used to open and read coordinate and trajectory files, and write out the atomic coordinates of the ribose 5 membered ring. The atomic coordinates are read in by Tessellate, producing pucker coordinates stored as JSON. MultiQC creates an HTML report from the JSON input.

9c758b14-0f2c-45c1-8d8b-2ee4453abb4e_figure2.gif

Figure 2. Scatter plot and histogram of ring pucker from timeseries data.

Comparative conformational character of DNA and RNA

The nucleotides A-DNA and Z-DNA tend to be C3’-endo (3E), but in B-DNA the tendency is to C2’-endo (2E); this accounts for the different molecular shape of the respective nucleotide helices. To explore this, PDB structures that are representative of A, B and Z DNA (1ANA, 3BSE, and 2DCG) were analysed (see Operation section and Figure 3).

9c758b14-0f2c-45c1-8d8b-2ee4453abb4e_figure3.gif

Figure 3. Histogram of ring pucker in different types of DNA, the planar conformation is due to the nitrogen base.

Cyclodextrins

An analysis of alpha cyclodextrin (ACD) from the PDB shows the glucose monomer conformations are mostly 4C1 (85.9%) with few deviations (Figure 4). The ring pucker of the macrocycle, as defined by treating the centre of geometry for each glucose in cyclodextrin as an atom, varies between planar (P) and half chair (H, e.g. 3H4, 5H4,2H3) conformations.

Results from the three use cases can be found in Supplementary File 1.

9c758b14-0f2c-45c1-8d8b-2ee4453abb4e_figure4.gif

Figure 4. Histograms of ring pucker in alpha cyclodextrin for the standard 6 membered cycle and for the macrocycle.

Conclusions

Tessellate enables users to compile datasets from the reservoir of PDB and simulation produced data. From this ensemble of experimentally and computationally determined ring structures, the relation between cyclic conformational preference and other variables can be discovered using Montage.

Abbreviations: ACD: Alpha Cyclodextrin; GAP: Glycome Analytics Platform; JSON: JavaScript Object Notation; PDB: Protein Data Bank; SSSR: Smallest Set of Smallest Rings

Software availability

Latest source code:

Tessellate: https://github.com/scientificomputing/Tessellate

Montage: https://github.com/scientificomputing/Montage

Archived source code as at the time of publication:

Tessellate: https://doi.org/10.5281/zenodo.106865626

Montage: https://doi.org/10.5281/zenodo.106869227

Tessellate License: Apache 2.0

Montage License: GNU GPLv3

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 08 Dec 2017
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Barnett C and Naidoo K. Tessellate & Montage: Molecular analytics of cyclic conformations [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2017, 6:2113 (https://doi.org/10.12688/f1000research.13261.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 08 Dec 2017
Views
4
Cite
Reviewer Report 31 Jul 2018
B. Lachele Foley, Complex Carbohydrate Research Center, The University of Georgia, Athens, GA, USA 
Approved with Reservations
VIEWS 4
Relevant reviewer expertise: conformational properties of carbohydrates, especially puckering of 6-member rings; Linux; scientific computing, especially molecular modeling and the proper communication of simulation methods; communication of technical information to interdisciplinary and non-specialist audiences; some biochemistry of carbohydrates.

... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Foley BL. Reviewer Report For: Tessellate & Montage: Molecular analytics of cyclic conformations [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2017, 6:2113 (https://doi.org/10.5256/f1000research.14386.r35646)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
9
Cite
Reviewer Report 21 Dec 2017
Marcello Sega, Faculty of Physics, University of Vienna, Vienna, Austria 
Approved
VIEWS 9
The authors present two python-based packages for the analysis of ring conformations in molecular systems and for the visualization of the related data. The tessellate package implements various algorithms for the detection of cycles of different size (5 to 8 included) and ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Sega M. Reviewer Report For: Tessellate & Montage: Molecular analytics of cyclic conformations [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2017, 6:2113 (https://doi.org/10.5256/f1000research.14386.r28851)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 08 Dec 2017
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.