ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Software Tool Article

ENA Source Attribute Helper: An Application Programming Interface to facilitate accurate reference to biological source data

[version 1; peer review: 1 approved, 1 not approved]
PUBLISHED 13 Sep 2022
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Bioinformatics gateway.

This article is included in the EMBL-EBI collection.

Abstract

Background: Metadata attributes of sequences that accurately reference their biological sources, as specimens or other materials of origin, and link with natural history collections, are essential to facilitate the connections between different fields in life sciences and promote reusability of data. However, metadata used to reference the biological source of sequences available within the molecular data repositories are not always well structured or comprehensive.
Methods: Within the scope of the Horizon 2020 project Biodiversity Community Integrated Knowledge Library (BiCIKL), we have developed a tool, the European Nucleotide Archive (ENA) Source Attribute Helper Application Programming Interface (API), to help users accurately report biological source-related sequence and sample attributes. This tool currently focuses on the attributes in which specimens, cultures or other materials are identified, from which the sequence data were derived, and uses curated data to obtain the unique codes for the institutions and collections holding the vouchers. The API's main functions include the presentation of metadata associated with queried institutions or collections, validation of institution and collection codes in the attribute strings provided by the user, and the construction of an attribute string based on user-entered data. The API does not however support the search of voucher specimen codes, as these need to be obtained directly from the voucher institutions. We describe the API and discuss use cases for its different endpoints. The API is available at https://www.ebi.ac.uk/ena/sah/api/.
Conclusions: We expect the API to promote and support the initial submission and any subsequent curation of biological source attributes, and hereby contribute to better links between sequence data and natural history collections, and hence on to taxonomy and biodiversity research, towards increasing the discoverability, reusability and impact of data.

Keywords

European Nucleotide Archive, submission tools, validation functions, specimen voucher, culture collection, bio material, NCBI Biocollections

Introduction

The generation and archiving of sequence data and associated metadata at large scale have transformed and promoted research in the life sciences. Sequence data have been essential to scientific breakthroughs in several fields such as medicine, food security, evolutionary biology and biodiversity conservation.

One of the most important aspects of the archiving of sequence data is metadata management, which is crucial for the accurate description of Earth’s genetic and genomic biodiversity and its preservation in molecular sequence collections (Waterhouse et al. 2021). By holding enriched metadata, such as biological source attributes, that describe the material provenance of sequence data (allowing linking to the specimen of origin), molecular sequence collections facilitate connections between molecular biology, taxonomy, systematics and biodiversity research, increasing the discoverability and usability of data by researchers worldwide.

The infrastructure for storing and sharing of sequence data is the International Nucleotide Sequence Database Collaboration (INSDC, Arita et al. 2021) that operates between the DNA Data Bank of Japan (DDBJ, Fukuda et al. 2021), the National Centre for Biotechnology Information (NCBI, Sayers et al. 2021) and the European Nucleotide Archive (ENA, Cummins et al. 2022) that stands as its European node. The INSDC contains currently over 236 million sequences and 1.7 billion reads (https://www.ncbi.nlm.nih.gov/genbank/statistics/), holding a large body of associated metadata related to sequenced sample sources, as culture collection or natural history collection annotations. However, for a number of records these metadata may be incomplete or ambiguous, which hinders the linking of the sequence data to their origin and therefore reduces data reusability.

The Biodiversity Community Integrated Knowledge Library (BiCIKL) is a Horizon 2020 project that aims to establish open science practices in the biodiversity domain by providing Findable, Accessible, Interoperable and Reusable (FAIR) access and developing new methods and workflows for linking data along the biodiversity research cycle, namely from data resources of molecular biology, natural history collections, taxonomy, and literature (Penev et al. 2022). To take full advantage of these workflows a foundation of well-structured and accessible metadata is required, namely in the molecular sequence databases. Therefore, in the scope of this project we have developed a tool for driving the accurate and complete reporting of biological source metadata: the ENA Source Attribute Helper.

The metadata that refers to the biological source of sequence data is described for sequences in the source feature qualifiers that are embedded in the sequence flat files (INSDC 2021, https://www.ebi.ac.uk/ena/WebFeat/) and in the samples’ attributes. These will be hereafter referred simply as ‘attributes’. These attributes are submitted to the ENA at the time of data deposition or in subsequent updates. ENA holds many routes for data submission (such as web interfaces, RESTful Application Programming Interfaces (APIs) and locally installed command-line tools), so there is no single point of entry for supporting submission of accurate provenance metadata. Therefore, the Source Attribute Helper is a publicly accessible open-source tool that may be used as a free-standing service independently across platforms and workflows. The initial version of the tool focuses on the sequence and sample attributes that identify the specimen, culture, or material from which the sequence was derived, namely /specimen_voucher, /culture_collection, and /bio_material. These attributes are formatted according to the Darwin Core Standards (Wieczorek et al. 2012) and follow a Darwin Core Triplet format, composed of Institution code, collection code and the specimen, culture, or material id, accordingly (Table 1). The tool was developed to help users fetch accurate information regarding the institution and collections codes of the specimen, culture, or material, and construct and validate the string to be submitted as an attribute of the sequence data. The tool does not however support the search of voucher specimen codes, as these need to be obtained directly from the institutions.

Table 1. Attributes for the biological source of sequence data addressed in the current version of the ENA Source Attribute Helper API.

The value of the attributes follows a Darwin Core triplet format (according to Darwin Core Standards, Wieczorek et al. 2012). Specimen ID, culture ID, and material ID are mandatory values. Institution code is optional for specimen voucher and bio material, but mandatory for culture collection. Collection code is always optional. When collection code is provided, institution code is mandatory (see INSDC 2021, also available at https://www.ebi.ac.uk/ena/WebFeat/, for more details on these attributes). ENA, European Nucleotide Archive; API, Application Programming Interface; INSDC, International Nucleotide Sequence Database Collaboration.

AttributeDefinitionValue format
Specimen voucherIdentifier for the specimen from which the data was obtained[<institution-code>:[<collection-code>:]]<specimen_id>
Culture collectionIdentifier for the culture from which the data was obtained<institution-code>:[<collection-code>:]<culture_id>
Bio materialIdentifier for the biological material from which the data was obtained[<institution-code>:[<collection-code>:]]<material_id>

In this paper we describe the design and implementation of the API. We also describe use cases for its application, highlighting its utility for increasing the accuracy of biological source attributes in molecular databases and promoting reusability.

Methods

The ENA Source Attribute Helper, although described as a single tool, comprises several endpoints with different functions, namely, to display metadata associated with institutions or collections, to validate the attribute string provided by the user according to the institutions and collections database, and to construct the attribute string based on data input by the user. The code is available from GitHub and is archived with Zenodo (Jayathilaka & Gupta 2022).

For the retrieval of information on the institutions and collections and subsequent validation, the application uses the data available in the NCBI Biocollections (RRID:SCR_016459). NCBI Biocollections is a curated database of metadata for herbaria, museums, culture collections, and other natural history collections, that are connected to records in INSDC, and is maintained by the NCBI taxonomy group (Sharma et al. 2018). It includes institution and collection codes and their URLs, where available, that allow users to find additional information. New records are added to the database upon submission of information together with sequence records to INSDC. The ENA Source Attribute Helper consumes the curated data from NCBI Biocollections to get, validate and construct the values for the attributes (Tables 2 and 3). Currently, the available Biocollections database files are retrieved manually from the ftp server and imported into the ENA ElasticSearch datastore.

Table 2. Description of fields included in the NCBI Biocollections institutions database imported into the ENA ElasticSearch datastore.

NCBI, National Centre for Biotechnology Information; ENA, European Nucleotide Archive.

FieldExampleTypeRequirementNotes
_idx-P-TIEBpiSBteIpqVU0System set uuidSystem setSystem set uuid.
inst_id1111IntegerMandatoryProvided Institution id
inst_codeCAMZMStringMandatoryInstitution Code
unique_nameUMZCStringMandatoryInstitution Unique Name
synonymsAEICStringOptionalInstitute Synonym
inst_nameUniversity Museum of Zoology CambridgeStringMandatoryName of the Institution
countryUnited KingdomStringMandatoryInstitution Country
addressDowning Street, Cambridge, CB2 3EJ, CambridgeStringMandatoryAddress of the Institution
collection_typemuseumStringMandatoryType of Collection
qualifier_typespecimen_voucherStringMandatoryAttribute: specimen_voucher/bio_material/culture_collection
home_urlhttp://www.zoo.cam.ac.uk/museum/StringOptionalHome Page URL
url_ruleStringOptionalURL Rule Page

Table 3. Description of fields included in the NCBI Biocollections collections database imported into the ENA ElasticSearch datastore.

NCBI, National Centre for Biotechnology Information; ENA, European Nucleotide Archive.

FieldExampleTypeRequirementNotes
_iddee11d4e-63c6-4d90-983c-5c9f1e79e96cSystem set uuidSystem setSystem set uuid
coll_id222IntegerMandatoryProvided Collection id
inst_id12345IntegerMandatoryMapped Institution id
coll_codeAnnelidStringMandatoryCollection Code
coll_nameAnnelid CollectionStringMandatoryName of the Collection
coll_typemuseumStringMandatoryType of Collection
qualifier_typespecimen_voucherStringMandatoryAttribute: specimen_voucher/bio_material/culture_collection
coll_urlhttp://nature.ca/collections/inverts_e.cfmStringOptionalCollection Reference URL
coll_url_rulehttps://science.mnhn.fr/institution/mnhn/collection/ar/item/arStringOptionalCollection URL Rule

The development of the ENA Source Attribute Helper API was based on the following tools and frameworks:

  • 1. Spring Boot API framework: This is a framework used for building RESTFul APIs that are accessible from various platforms/clients including but not limited to web browsers, mobile devices, desktop applications etc.

  • 2. ElasticSearch datastore: The application utilises strengths of ElasticSearch datastore to enable text search over JavaScript Object Notation (JSON) data over multiple properties and provide suggestions/similar matches.

  • 3. Spring Data and object–relational mapping (ORM): The application utilises Spring Data libraries to create abstractions over repository and custom object mappings along with usage of an ORM (like Hibernate).

  • 4. Postman: This is an open-source tool for testing, monitoring and publishing APIs.

  • 5. Swagger: This is an Interface Description Language (IDL) used for the description of RESTful APIs. It allows the visualisation of the various API endpoints and an easy execution of the commands.

Implementation

Construct & validation flows

The main function of the API is to validate and construct the Attribute Values (see Table 1), to ensure that these are aligned with the format definition. The user inputs a code or name (at least one character) for the institution or collection and the application suggests the closest options available, so that the user can select the correct option. The construct and validation flows are represented in Figure 1.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure1.gif

Figure 1. Construct & validation flow diagram.

Core

The core of the application is built using the Spring Boot framework, which follows a layered architecture approach (Presentation Layer, Business Layer, Persistence Layer, Database Layer) in which each layer communicates to other layers in a hierarchical order. The Database entities (ElasticSearch datastore) and the Backend APIs (Web API Layer) are represented in Figure 2.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure2.gif

Figure 2. Construct & validation tool components of the application.

API, Application Programming Interface.

Operation

API endpoints

API endpoints are the channels through which other applications can communicate with or consume an API. They are represented by Universal Resource Locators (URL), which serve as points of entry.

Table 4 describes the various endpoints that this API provides. The endpoint to get error-codes is an additional endpoint that allows users to fetch the definitions of the error codes, which may be returned by the system. This may be useful for system integration and error handling on the client side.

Table 4. ENA Source Attribute Helper API Endpoints description and success and failure responses.

ENA, European Nucleotide Archive; API, Application Programming Interface.

API EndpointVerbActionSuccessFailure
/institution​/{ivalue}GETFinds Institution using institution name or code. If the institution name or code is not fully known, 1 or more characters can be provided. API searches both for exact matches and for partial matches by either institution name or institution code.200 OK400 Bad Request
/institution​/{institutionUniqueName}/collectionGETGets all collections by institution unique name200 OK400 Bad Request
/institution​/{institutionUniqueName}​/collection/{cvalue}GETGets collection by institution's unique name and collection code. If the collection name or code is not fully known, 1 or more characters can be provided.200 OK400 Bad Request
/validateGETValidates the provided attribute string200 OK400 Bad Request
/constructGETConstructs the attribute string200 OK400 Bad Request
/error-codesGETGets the error codes definition200 OK400 Bad Request

API access & tools

The API can be accessed using:

  • any Web Browser

  • any scripting/programming language-based REST client

  • command line tools like cURL and Wget

  • testing tools like Swagger user interface (UI) and Postman

Testing tools, such as the Swagger UI, facilitate the usage of the API even by a non-technical person (Figure 3), while other tools such as Postman may require a higher level of technical understanding to know how to consume the API.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure3.gif

Figure 3. Swagger UI for the ENA Source Attribute Helper API.

UI, User Interface; ENA, European Nucleotide Archive; API, Application Programming Interface.

Using the Swagger interface

Swagger is a web browser based graphical UI that provides a set of form fields and hints for interacting with a RESTful API. It helps a user to interact with and test the API by hiding the complexity of building correct requests. It uses annotations and descriptions in the source code of the application to describe the API in human readable format. Figure 3 shows the general UI for Swagger, and an example of a JSON response for the /validate endpoint (which validates the given attribute string) is displayed in Figure 4. Figure 5 shows an example of a JSON response for the /construct endpoint, which validates input and constructs the attribute string based on the provided parameters.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure4.gif

Figure 4. Swagger UI for /validate endpoint.

UI, User Interface.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure5.gif

Figure 5. Swagger UI for /construct endpoint.

UI, User Interface.

Using the cURL command-line tool

cURL is a widely available free and open-source command-line tool for transferring data using URL syntax. Figures 6 and 7 show two examples of the usage of the /validate and /construct API endpoints to validate and construct the attribute string, respectively, using cURL.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure6.gif

Figure 6. cURL request for /validate API endpoint.

API, Application Programming Interface.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure7.gif

Figure 7. cURL request for /construct API endpoint.

API, Application Programming Interface.

Using the Postman API client

The Postman API client is a tool to easily explore, debug, and test APIs while also enabling users to define complex API requests for HTTP, REST, SOAP, GraphQL, and WebSockets. In the development of the tool, we engaged it to inspect API endpoints and their responses. An example JSON response for the /validate endpoint, which validates a given attribute string and presents the support data for the institution and collection values, is shown in Figure 8.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure8.gif

Figure 8. Postman UI for /validate endpoint, showing the request and the obtained response.

UI, User Interface.

Other API clients - Python

Python is a scripting/programming language that allows a quick output and integrates systems more effectively. Figures 9 and 10 show basic code examples to demonstrate querying two of the available API endpoints - /validate and /construct.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure9.gif

Figure 9. Python sample code for /validate API endpoint.

API, Application Programming Interface.

2c09cb0a-2149-4f3d-86b9-05e7d8a0d542_figure10.gif

Figure 10. Python sample code for /construct API endpoint.

API, Application Programming Interface.

Deployment

The ENA Source Attribute Helper application is deployed on the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) infrastructure, and loads balanced on a cluster of servers for resiliency and high availability.

Use cases

Institution codes

Users submitting sequence related data may need to look for the unique code for the institution holding the voucher associated with the data. The API endpoint Get Institution allows the user to fetch the Institution details by providing either the institution name or code (Table 5). The type of attribute (‘qualifier_type’: specimen voucher, culture collection, or bio material) may also be optionally specified, but if none is provided the API will search within all attributes. The API searches both for exact matches and for partial matches. The API response will include the metadata for all institutions with exact or partial matches to the input value, allowing the user to confirm the details of the institution that is holding the voucher.

Table 5. Parameters required for the API endpoint Get Institution.

Requirements and an example are also provided. API, Application Programming Interface.

ParametersExampleTypeRequirementNotes
ivalueCAMZXStringMandatoryInstitution name/code - to search for institution(s)
qualifier_typespecimen_voucher, bio_material, culture_collectionStringOptionalFilters results for a specific attribute

Collection codes

Once the user knows the unique institution code for their voucher, they may need to identify the unique code for the collection to input in the attribute string. The API has two endpoints that allow users to search for the collection codes. In both endpoints the type of attribute (‘qualifier_type’: specimen voucher, culture collection, or bio material) may be optionally specified.

The endpoint Get Collections for the Institution allows to fetch all collections in a given institution by providing the institution’s unique name (unique code, Table 6). This operation looks only for an exact match of the institution's unique name and returns the complete list of collections within that institution and associated metadata. If the institution’s unique name is not found in the database the endpoint does not return any record.

Table 6. Parameters required for the API endpoint Get Collections for the Institution.

Requirements and an example are also provided. API, Application Programming Interface.

ParametersExampleTypeRequirementNotes
institutionUniqueNameCAMZXStringMandatoryInstitution’s unique name
qualifier_typespecimen_voucher, bio_material, culture_collectionStringOptionalFilters results for a specific attribute

The endpoint Get Collections by Institution Unique name and Collection Code allows users to obtain the metadata of a given collection of an institution by providing the institution’s unique name (unique code) and known collection code (Table 7). This endpoint searches for an exact match of the institution's unique name, and a full or partial match of the collection code and returns the metadata for the collection found. If the institution’s unique name or collection code are not found in the database, the endpoint does not return any record.

Table 7. Parameters required for the API endpoint Get Collections by Institution Unique name and Collection Code.

Requirements and an example are also provided. API, Application Programming Interface.

ParametersExampleTypeRequirementNotes
institutionUniqueNameCAMZXStringMandatoryInstitution’s unique name
cvalueherpStringMandatoryCollection Code
qualifier_typespecimen_voucher, bio_material, culture_collectionStringOptionalFilters results for a specific attribute

Validate an attribute

Users that are already aware of the format of the biological source attributes, and have information about the institution and collection codes, may use the API endpoint Validate Attribute to validate the attribute string. The user needs to provide the attribute string in the format detailed in Table 1, according to the attribute type. The attribute type (‘qualifier_type’: specimen voucher, culture collection, or bio material) may also be specified to narrow the search, but if none is provided the API will search within all attribute values (Table 8). The API performs the search for the exact match, but if none is found, a search for partial matches for the provided string will be performed. The response includes the type of match (match level exact or partial), a recommendation for the qualifier value (match) that may correspond to the input or include corrections to the unique values of the institution and collections, and the metadata of the referred institution and collections (Figure 8). If the match isn’t exact and there is more than one possible match to the attribute string input by the user, the response will include all possible matches and associated metadata.

Table 8. Parameters required for the API endpoint Validate attribute.

Requirements and an example are also provided. API, Application Programming Interface.

ParametersExampleTypeRequirementNotes
valueMSNT:FAZC:123456StringMandatoryAttribute string to be validated in the format described in Table 1
qualifier_typespecimen_voucher, bio_material, culture_collectionStringOptionalFilters results for a specific attribute

Construct the attribute

Users may use the API endpoint Construct the Attribute to help them obtain the correct attribute string for referring to the biological source of the voucher linked with the sequence data. In this endpoint the user needs to provide separately the expected values for the institution, collection, and the ID of the specimen, culture, or material, depending on the attribute type (Table 9). The type of attribute (‘qualifier_type’: specimen voucher, culture collection, or bio material) may also be specified to narrow the search, but if none is provided the API will search within all attribute values. As in the validate function, the API also searches for partial matches. The response includes the type of match (match level exact or partial), the constructed attribute string from the values input by the user (input value), a recommendation for the attribute string (match) that may correspond to the input or include corrections to the unique values of the institution and collections, and the metadata of the referred institution and collections. In Figure 7 (cURL request for /construct API endpoint) we can see a case where the match is only partial, as there is more than one option for the institution code provided by the user. In these situations, the response will include all possible matches and associated metadata.

Table 9. Parameters required for the API endpoint Construct the Attribute.

Requirements and an example are also provided. API, Application Programming Interface.

ParametersExampleTypeRequirementNotes
ivalueHSUVStringMandatoryInstitution’s unique name
cvalueBirdStringOptionalCollection Code
id123456StringMandatoryIdentifier value
qualifier_typespecimen_voucher, bio_material, culture_collectionStringOptionalFilters results for a specific attribute

Future steps

Further developments of the ENA Source Attribute Helper API are planned.

Regarding the retrieval of the data, an automated flow for getting the updated files from the NCBI servers regularly is planned for implementation.

The development of a Graphical User Interface (GUI) is also planned for implementation, likely embedded in, or accessible from, one or more of ENA's existing submission tools. This will allow more intuitive searches for Institutions and/or Collections metadata and the validation/construction of the qualifier values to be more accessible to inexperienced users. This UI will connect to the API to support features like:

  • Dynamic auto completion of user input

  • Visual indicators for attribute matches

  • Easy copying of constructed/validated attributes

  • Metadata browsing

Conclusions

Considering the increasing rates of generation and submission of sequence data to public repositories it becomes increasingly important to assure the greatest accuracy and precision of associated metadata. Hence, we have developed and deployed a tool that will considerably help users to provide accurate metadata for reference to the biological source of sequence data. We have described the ENA Source Attribute Helper API design and implementation and discussed its main usages. We expect this tool to promote and support the submission of better structured and more richly described data that will provide a stronger foundation to strengthen the value of natural history collections, taxonomic expertise, and biodiversity knowledge.

For biodiversity research, the wider availability of correctly structured biological source attributes in sequence data will, for instance, improve the linkage with distribution data in the Global Biodiversity Information Facility (GBIF; GBIF 2022). GBIF holds a data-clustering feature that identifies records that are potentially related by matching similar metadata entries (GBIF 2020). In the case of the INSDC Sequences dataset in GBIF the fields used for matching are the biological source attributes in the triple Darwin Core format (Grosjean & Robertson 2021). Therefore, we expect that the number of sequence records linked to specimens in natural history collections and to their distribution data will increase with the usage of the ENA Source Attribute Helper API. Monitoring these links will help us to measure the impact of the usage of this tool.

Overall, we expect the enrichment of the provenance metadata of sequences in molecular biology repositories to contribute to boost our understanding of, and effectiveness of response to global challenges such as biodiversity loss, ecosystem change and food security.

Data availability

Underlying data

The data used in this API are available at https://www.ncbi.nlm.nih.gov/biocollections. The API retrieves data from the institutions, collections and unique institutions codes files that are available for public access at https://ftp.ncbi.nih.gov/pub/taxonomy/biocollections/.

Software availability

Software available from: https://www.ebi.ac.uk/ena/sah/api/

Source code available from: https://github.com/enasequence/ena-source-annotation-helper

Archived source code at time of publication: https://doi.org/10.5281/zenodo.7063227 (Jayathilaka & Gupta 2022)

License: Apache License 2.0 license

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 13 Sep 2022
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Gupta V, Paupério J, Burgin J et al. ENA Source Attribute Helper: An Application Programming Interface to facilitate accurate reference to biological source data [version 1; peer review: 1 approved, 1 not approved]. F1000Research 2022, 11:1042 (https://doi.org/10.12688/f1000research.123934.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 13 Sep 2022
Views
10
Cite
Reviewer Report 28 Oct 2022
Begoña Aguado, Centro de Biología Molecular Severo-Ochoa (CBMSO) (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Universidad Autónoma de Madrid, Madrid, Spain 
Sandra Gonzalez, Centro de Biología Molecular Severo-Ochoa (CBMSO) (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Universidad Autónoma de Madrid, Madrid, Spain 
Approved
VIEWS 10
In this manuscript, the authors present an interesting tool, the European Nucleotide Archive (ENA) Source Attribute Helper Application Programming Interface (API), to help users accurately report biological source-related sequence and sample attributes. The authors also provide links that help to ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Aguado B and Gonzalez S. Reviewer Report For: ENA Source Attribute Helper: An Application Programming Interface to facilitate accurate reference to biological source data [version 1; peer review: 1 approved, 1 not approved]. F1000Research 2022, 11:1042 (https://doi.org/10.5256/f1000research.136094.r150251)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
14
Cite
Reviewer Report 14 Oct 2022
Sujeevan Ratnasingham, Centre for Biodiversity Genomics, University of Guelph, Guelph, ON, Canada 
Not Approved
VIEWS 14
The authors present an API endpoint they described as a Source Attribute Helper, a tool that provides attribute values. Upon testing the tool and reviewing its design, this reviewer can only conclude that it an API query interface for a ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Ratnasingham S. Reviewer Report For: ENA Source Attribute Helper: An Application Programming Interface to facilitate accurate reference to biological source data [version 1; peer review: 1 approved, 1 not approved]. F1000Research 2022, 11:1042 (https://doi.org/10.5256/f1000research.136094.r150253)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 13 Sep 2022
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.