An ORCID based synchronization framework for a national CRIS ecosystem

João Mendes Moreira; Alcino Cunha; Nuno Macedo

doi:10.12688/f1000research.6499.1

Home Browse An ORCID based synchronization framework for a national CRIS ecosystem

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

An ORCID based synchronization framework for a national CRIS ecosystem

[version 1; peer review: 2 approved, 1 approved with reservations]

João Mendes Moreira¹, Alcino Cunha², Nuno Macedo²

PUBLISHED 06 Jul 2015

Author details Author details

¹ FCCN, Fundação para a Ciência e Tecnologia, Lisboa, 1700-066, Portugal
² HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Research on Research, Policy & Culture gateway.

This article is included in the Proceedings of the 2015 ORCID-Casrai Joint Conference collection.

Abstract

PTCRIS (Portuguese Current Research Information System) is a program aiming at the creation and sustained development of a national integrated information ecosystem, to support research management according to the best international standards and practices.

This paper reports on the experience of designing and prototyping a synchronization framework for PTCRIS based on ORCID (Open Researcher and Contributor ID). This framework embraces the "input once, re-use often" principle, and will enable a substantial reduction of the research output management burden by allowing automatic information exchange between the various national systems.

The design of the framework followed best practices in rigorous software engineering, namely well-established principles in the research field of consistency management, and relied on formal analysis techniques and tools for its validation and verification.

The notion of consistency between the services was formally specified and discussed with the stakeholders before the technical aspects on how to preserve said consistency were explored. Formal specification languages and automated verification tools were used to analyze the specifications and generate usage scenarios, useful for validation with the stakeholder and essential to certificate compliant services.

Keywords

CRIS, ORCID-CASRAI, PT CRIS, Synchronisation Framework

Corresponding author: João Mendes Moreira

Competing interests: The authors disclosed no competing interests.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2015 Mendes Moreira J et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Mendes Moreira J, Cunha A and Macedo N. An ORCID based synchronization framework for a national CRIS ecosystem [version 1; peer review: 2 approved, 1 approved with reservations]. F1000Research 2015, 4:181 (https://doi.org/10.12688/f1000research.6499.1) First published: 06 Jul 2015, 4:181 (https://doi.org/10.12688/f1000research.6499.1) Latest published: 06 Jul 2015, 4:181 (https://doi.org/10.12688/f1000research.6499.1)

1 Introduction

PTCRIS (Portuguese Current Research Information System) is a program, officially initiated in May 2014 by FCCN (Fundação para a Computação Científica Nacional), the FCT (Fundação para a Ciência e Tecnologia – the Portuguese Foundation for Science and Technology) unit responsible for planning, management and operation of the national research and education network, a high performance platform for developing and testing advanced communication applications and services. PTCRIS aims to ensure the creation and sustained development of a national integrated information ecosystem, to support research management according to the best international standards and practices.

One of the goals of PTCRIS is to reduce the burden of research output management, by adopting an “input once, re-use often” principle. In order to achieve this goal, a synchronization framework is being developed that relies on ORCID (http://www.orcid.org/) – a community-based service that aims to provide a registry of unique researcher identifiers and a method of linking research outputs to these identifiers, based on data collected from external sources – as a central hub for information exchange between the various national systems (including CV management systems, open-access repositories, and local CRIS systems) and international systems (WoK, Scopus, Datacite, etc.). Among other features, this framework will enable researchers (or managers) to register a given research output once at one of the interconnected national systems, and have that output automatically propagated to the other ones, thus ensuring global consistency of the stored information. The goal of this paper is to report precisely on the experience of designing and prototyping this synchronization framework.

The design of the synchronization framework followed well-established principles of rigorous software engineering. The main principle is that one should distinguish the what from the how: in this particular case, what is the desired notion of consistency between ORCID and each of the PTCRIS services, and how can a synchronization procedure be implemented to enforce such consistency. This allowed us to break down the discussion with the various stakeholders, first seeking an agreement concerning the what before dwelling in the technicalities of the how.

The second principle is that formal analysis methods and tools should be used to verify that the proposed artifacts follow desirable “well-behavedness” properties. Paraphrasing Richard Feynman, “the first principle is that you must not fool yourself, and you are the easiest person to fool”: the usage of a formal specification language and automatic verification tools allowed us to uncover several corner cases not easy to predict otherwise. In particular, we relied on the formal specification language Alloy¹ and its automatic Analyzer. This tool was also used to automatically generate usage scenarios that were useful for requirement elicitation and validation with the stakeholders, but will also be of major importance for certifying compliant PTCRIS services, by allowing rigorous testing of the proposed implementations.

The paper is organized as follows: Section 2 presents a brief overview of (and rationale for) the proposed architecture; Section 3 presents the methodology followed to achieve a trustworthy design for the various components of the framework; Section 4 describes the first prototype that was developed to validate and demo the proposed framework to the community; Section 5 briefly describes some related work; and, finally, Section 6 presents some conclusions and ideas for future work.

2 Architecture overview and rationale

Figure 1 presents an overview of the architecture of the PTCRIS synchronization framework, with some PTCRIS services shown in orange and ORCID sources in blue. PTCRIS is composed of several services with distinct objectives. Among those we have, for example:

Figure 1. PTCRIS synchronization architecture.

DeGóis The national academic CV management system (http://www.degois.pt), where information from researchers is stored and, with the proposed synchronization framework, shared across the PTCRIS ecosystem. DeGóis currently hosts around 22,000 academic CVs.
RCAAP The national open-access scientific repository portal (http://www.rcaap.pt), a platform that acts as an OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) aggregator that harvests content from a network of institutional repositories (currently, around 70 in total) and open-access journals. RCAAP currently indexes around 213,000 publications.
SARI A DSpace hosting platform for institutional open-access repository services (it currently hosts 26 repositories).

As depicted in Figure 1, not all of these services are expected to synchronize bidirectionally with ORCID. For example, RCAAP will only export research outputs to ORCID, so that they can be harvested by other PTCRIS services. In contrast, institutional repositories (namely those hosted in SARI) will just use ORCID to harvest publications, thus liberating researchers from (the often mandatory task of) having to (manually) insert them. The academic CV management service DeGóis will both import and export research outputs. As the figure also depicts, at least in the earlier stages of deployment of the synchronization framework, some services will still synchronize directly with each other, for example the RCAAP aggregation of open-access publications from institutional repositories will still be performed directly.

There were several reasons that contributed to the choice of ORCID as the central hub for PTCRIS. The main ones are described below:

High coverage of predefined requirements. The requirements were grouped into three categories: general, functional and technical, and ORCID scored well in all of them. On the general level, items like documentation and support were considered. The main functional requirements were related with both the set of APIs (get, put, etc) and the completeness of the ORCID profile (it also supports most of the CASRAI academic funding CV elements but “services”). Usability issues were also analyzed, as this is a critical issue for PTCRIS. It was concluded that the interface for the integration of systems with ORCID is not only easy to use, but also becoming a standard.
The technical requirements were related with the easiness of implementation and infrastructure reliability and resilience (ORCID infrastructure is hosted in a world-class datacenter).
High interoperability with external sources. By the time ORCID was being considered to act as the hub for PTCRIS it was already interoperable with some of the most relevant and important sources (Crossref, Datacite, Scopus and WoK). Furthermore, its interoperability tends to increase as more sources are being added.
High ORCID coverage of the national research community. In late 2013 – early 2014, due to the research assessment exercise and for the purpose of carrying out a bibliometric study, around 15,000 researchers from Portuguese research units applied for an ORCID iD. These researchers were responsible for more than 90% of the Portuguese scientific output of the 5 years prior to 2013.
Sustainability. The costs of using ORCID as a hub are very small when compared with the alternative of developing and maintaining a homegrown hub.

Besides the benefits, risks and mitigation measures were also considered when deciding whether to use ORCID as a hub for PTCRIS. The most relevant risk identified was the collapse of the ORCID organization, but the probability of this event was considered to be low. Nevertheless, two mitigation measures were considered: install the hub locally using the ORCID source code (deposited in GitHub); populate the PTCRIS database with the mensal database copy provided to ORCID premium members.

3 Specification of the synchronization framework

This section presents the specification methodology that was used to achieve a trustworthy design for the PTCRIS synchronization framework. First, a formal specification of the data models and of the desired consistency predicates was developed. Then, synchronization procedures to enforce such consistency were specified and verified for several “well-behavedness” properties. These formal specifications were also used to automatically derive the already mentioned usage scenarios.

3.1 Data model

The synchronization framework operates at the user profile level, that is it intends to synchronize user profiles from the different PTCRIS services with the corresponding user profile from ORCID. The matching of users across these systems is a simple matter, since PTCRIS services can simply store (and most already do) the ORCID iD of the researcher locally. As such, the design of the framework focused only on a single user profile.

The main difficulties in the design of this framework stemmed from fundamental differences between the data model of ORCID and that of most PTCRIS services. As such, before presenting the desired consistency notion and synchronization procedures, we briefly present such data models. Here, we present only a very abstract view of the information stored in such profiles, focusing only on research outputs, namely works, and only on the attributes that are relevant for their synchronization. An ORCID user profile contains additional information, which PTCRIS is also interested in synchronizing among its services. However, some of this information is trivial to synchronize (e.g. education affiliations) while other, albeit not trivial, may be synchronized following the technique presented in this paper for works (e.g. funding information).

Figure 2 presents an abstract model of an ORCID user profile. For our purposes, a profile consists essentially of a set of works, each a record containing: a putcode, that uniquely identifies the work internally; a (possibly empty) set of external unique identifiers (UIDs) of the work; the source of the information in the record (which can be the user himself or any other external source associated with ORCID, such as Scopus, CrossRef, or, from now on, a PTCRIS service); any meta-data associated with the work, such as its title, publication year, publication type, authors, etc; and a boolean attribute marking whether the work is the one preferred by the user among similar ones (this boolean attribute is not directly returned by the ORCID API, but can be inferred from the order in which the works are stored in an ORCID profile, see discussion below).

Figure 2. Overview of the ORCID data model.

A distinctive feature of ORCID is precisely the possibility of using different external sources to automatically populate a user profile. This means that a user profile can contain different works that actually describe the same research output (possibly containing different or even contradictory meta-data). The ORCID web interface already groups together works that describe the same output, showing only the preferred one in the overview. The grouping mechanism is quite simple, and just assumes two works w₁ and w₂ are similar if, and only if, they have a shared UID or there is another work w₃ that is similar to both w₁ and w₂. Essentially, this recursive definition considers two works to be similar if, and only if, they share directly or indirectly (via transitivity) some UID.

ORCID imposes several constraints on this data model, such as: there cannot be two works with the same external source with shared UIDs; and among sets of similar works exactly one of them is the preferred one. The ORCID API also forces every work from an external source to have some UIDs assigned, but works added by the user via the web interface may still have an empty set of UIDs. The biggest difference of a user profile in a PTCRIS service (depicted in Figure 3) is that it does not support multiple versions of the same research output, nor the grouping feature of similar versions likewise to ORCID. To avoid confusion with ORCID works we will denote research outputs in PTCRIS as productions. The profile of a user in a PTCRIS service is essentially a set of productions, each a record with the following information: a key that uniquely identifies the production; a (possibly empty) set of UIDs; the associated meta-data; and a boolean field indicating whether the production is currently selected be the user to be exported to ORCID.

Figure 3. Overview of a PTCRIS service data model.

The PTCRIS synchronization framework is semi-automatic and notification-based. As such, each service will be required to support two kinds of notifications in a user profile: creation notifications, to alert the user that a new production has been found in ORCID; and modification notifications, to alert the user that new UIDs for an existing production have been found. The latter will be particularly useful for propagating UIDs between different PTCRIS services, in particular from open access repositories that provide handles for research outputs to academic CV management services, such as DeGóis.

Likewise to ORCID, this data model is subject to several constraints, such as disallowing exported productions to share UIDs or have no UIDs at all (to comply with the above ORCID guideline).

3.2 Consistency predicates

As stated above, we first specified and validated what is the desired notion of consistency between ORCID and each of the PTCRIS services. Formally, this consistency is a predicate of type ORCID × PTCRIS → Bool, that given two user profiles returns a boolean indicating whether they are consistent with each other. Typically, this consistency predicate is specified as a set of logical rules that must all be satisfied to render the profiles consistent.

The consistency between ORCID and a PTCRIS service was factorized in two modular consistency predicates whose rules were precisely defined in the design phase:

IMPORTED : ORCID × PTCRIS → Bool This consistency predicate should be enforced by every PTCRIS service that wishes to rely on the synchronization framework to harvest research outputs from ORCID, namely new publications and new UIDs of known publications. The general principle of IMPORTED is that every UID in ORCID should be harvested. The enforcement of this consistency predicate should be semi-automatic, based on a notification system, giving freedom to the user to select which outputs or UIDs he wishes to add to his PTCRIS profile.
EXPORTED : ORCID × PTCRIS → Bool This consistency predicate should be enforced by every PTCRIS service that wishes to be an ORCID source, and export its productions to ORCID, ensuring that other PTCRIS services can harvest them. The general principle of EXPORTED is that every exported production should be stored as a work in ORCID and then automatically kept up-to-date.

These consistency predicates are logically independent, in the sense that each can either hold or not, independently of the value of the other. A PTCRIS service may also wish to implement the conjunction of both, leading to a consistency predicate we denote as (fully) SYNCED:

SYNCED : ORCID × PTCRIS → Bool

SYNCED(o, p) $\dot{=}$ IMPORTED(o, p) ∧ EXPORTED(o, p)

Since the PTCRIS services do not support grouping likewise to ORCID, some caution must be exercised to avoid the proliferation of productions and notifications that describe the same research output. In particular, when an ORCID work is unknown to the PTCRIS service, the existence of a single creation notification, grouping all UIDs of its similar works should suffice to ensure consistency. This is just one of the rules that must be satisfied for IMPORTED to hold. IMPORTED is mainly focused on UID harvesting, the consistency of the meta-data being a secondary concern. However, meta-data still needs to be filled in when a creation is notified following the discovery of a group of (unknown) similar works. Since their meta-data can (and often does) differ, it is not clear how this meta-data extraction should be performed. On first glance, the obvious choice would be to pick the meta-data of the preferred work. Unfortunately, the following reasons prevent us from currently enforcing this behavior:

Since all groups of similar works must have a preferred work (essentially the one chosen to be displayed in the user web page), a default preferred is always chosen by ORCID when a new research output is imported or the current preferred one is deleted by the user.
The ORCID API does not currently distinguish such default preferred works from user-selected ones.
This means that the user might not have sanctioned the meta-data present in his preferred works at the time they are being imported into a PTCRIS service.
Unfortunately, meta-data is of highly variable quality in ORCID, with some sources currently publishing meta-data with gross mistakes, for example, wrong publication types.

As such, the choice of how to fill in the meta-data in creations was currently left for each PTCRIS service. Some will ignore the preferred and just allow the user to rank sources according to the perceived quality of their metadata, and then try to choose the meta-data of the work from the highest ranked source.

The EXPORTED consistency predicate is considerably simpler than IMPORTED. Essentially, the specified consistency rules force that there must exist a one-to-one correspondence between exported productions and works in ORCID whose source is the PTCRIS service.

3.3 Synchronization procedures

When the user profiles at ORCID and at the PTCRIS service are inconsistent how can they be automatically synchronized to recover the consistency? To achieve that, we have specified two separate synchronization procedures to be used when the service intends to enforce consistency according to IMPORTED or EXPORTED, respectively. These modular synchronization procedures can also be combined in a precise way, to recover the consistency in services that are enforcing both consistency predicates.

IMPORT : ORCID × PTCRIS → PTCRIS This synchronization procedure should be used to enforce the IMPORTED consistency predicate. The main principle is that it does not change the user profile in ORCID. Moreover the only changes it produces to the PTCRIS profile is to add and remove notifications.
EXPORT : ORCID × PTCRIS → ORCID This synchronization procedure should be used to enforce the EXPORTED consistency predicate. The main principle is that it does not change the user profile in PTCRIS. Moreover the only changes it produces to the ORCID profile is to add / delete / modify works whose source is the PTCRIS service.

With the help of automatic formal verification tools, the specified synchronization procedures were checked for several “well-behavedness” properties. The most important of those is correctness, that ensures that after running the synchronization procedures the user profiles in ORCID and in the PTCRIS service are indeed consistent:

IMPORTED(o, IMPORT(o, p))

EXPORTED(EXPORT(o, p), p)

Another important “well-behavedness” property is stability, ensuring that if we run the synchronization procedures on already consistent states the result is the same (modulo differences in keys):

IMPORTED(o, p) ⇒ IMPORT(o, p) = p

EXPORTED(o, p) ⇒ EXPORT(o, p) = o

Having stable synchronization procedures ensures that there is no need to explicitly check the consistency to determine if they should be run. If both user profiles are consistent, running the specified procedures would not affect them. In fact, the checking procedures have the same approximate complexity as the synchronizing procedures, and thus, no significant performance gains would be achieved by running them beforehand.

The two specified synchronization procedures can be combined to obtain a synchronization procedure that enforces SYNCED, the full consistency of the user profiles according to both IMPORTED and EXPORTED (to be used by services that wish to enforce both):

SYNC : ORCID × PTCRIS → ORCID × PTCRIS

SYNC(o, p) $\dot{=}$ let o′ = EXPORT(o, p)

in (o′, IMPORT(o′, p))

The specified order of execution is not arbitrary. In fact, it is the only order that ensures that the resulting procedure is both correct and stable:

SYNCED(SYNC(o, p))

SYNCED(o, p) ⇒ SYNC(o, p) = (o, p)

In particular, if the user profiles in ORCID and PTCRIS are not consistent according to EXPORTED, running the EXPORT procedure can make them inconsistent according to IMPORTED. As such, IMPORT must be run after EXPORT to ensure that full consistency is attained. A concrete example is presented in the following section.

3.4 Usage scenarios

The formal specification of the data models, consistency predicates, and synchronization procedures, allowed the usage of automatic analysis tools (namely, the Alloy Analyzer model finder) to generate a large number of diverse usage scenarios. This section presents one of the generated scenarios. Although small, we believe it is interesting enough to convey the usefulness of this process for requirement validation and for implementation testing.

As the initial state of this scenario, consider the PTCRIS and ORCID user profiles presented below, over which both the IMPORTED and EXPORTED consistency predicates are enforced (simulating, for instance, the DeGóis CV management system). The PTCRIS profile consists of two productions, none selected to be exported, that do not share UIDs, and thus cannot be considered similar, as depicted in Figure 4.

Figure 4. Initial PTCRIS profile.

The ORCID profile contains two groups of similar works that correspond to the two productions, depicted in Figure 5 (preferred works are depicted with round shapes).

Figure 5. Initial ORCID profile.

Even though the UIDs of Work1 and Work2 are not exact matches (nor their meta-data), they both share the EID1 identifier, and thus are considered similar and grouped by ORCID. These two profiles are IMPORTED-consistent because all UIDs from ORCID are known to the PTCRIS: Production0 contains the identifiers from Work0 while Production1 aggregates the identifiers from Work1 and Work2. Note that the PTCRIS productions actually contain additional UIDs not known to ORCID; this does not affect the consistency of the profiles since the goal of IMPORTED is to harvest information from ORCID to the PTCRIS service. Since no production is selected to be exported, the profiles are also EXPORTED-consistent.

Now imagine that the user, after examining the production’s meta-data concluded that the two productions at the PTCRIS profile actually represent the same research output. To unify them, the user introduces a UID from Production0 in Production1 (e.g., DOI0), rendering them similar. Then, this update can be propagated to other services by exporting Production1 to ORCID that acts as the research hub, as depicted in Figure 6 (productions set to be exported are denoted by bold frames).

Figure 6. PTCRIS profile after user update.

At this point, the profiles are no longer EXPORTED-consistent, so an identical ORCID work must be created from the exported production. After running the EXPORT procedure, the updated ORCID profile is depicted in Figure 7.

Figure 7. ORCID profile after EXPORT execution.

This update reflected the intentions of the user: the introduction of Work3 in the ORCID profile due to the exportation of Production1, unified the two groups under a single group of similar works. (In this scenario, Work2, one of the preferred works in the initial ORCID profile, was preserved as the preferred, while Work0 was demoted. At the moment it is not clear how ORCID would select the preferred in this situation, so our specification also considers as a acceptable possible outcome a profile where Work0 is the preferred one.) However, this has consequences to the IMPORTED-consistency of the profile, since productions related with Work0 need now be updated with the new UIDs. Thus, this update needs to be propagated to all other relevant services (like the SARI repositories), but also to the PTCRIS service that triggered this update, since it was assumed to enforce both IMPORTED and EXPORTED Concretely, when IMPORT is run back to the PTCRIS profile, Production0 is matched with the whole group of works, resulting in the profile depicted at Figure 8, where a modification notification is associated to Production0 to add all harvested UIDs.

Figure 8. PTCRIS profile after `IMPORT` execution.

Since the EXPORT procedure may introduce IMPORTED-inconsistencies, the service enforcing both consistency predicates should always run IMPORT after EXPORT, that is, the SYNC procedure specified above.

4 Prototype implementation

As proof of concept, a prototype of this synchronization framework was implemented and shown to the FCCN community in its annual meeting, held in February 2015 (http://jornadas.fccn.pt). In this prototype, the following PTCRIS services and systems were involved:

DeGóis The national CV system, already supporting ORCID iDs and a preliminary version of the IMPORT and EXPORT synchronization procedures. This new version of this system is expected to be released to the community in the 3rd Quarter of 2015.
RCAAP The national OAI-PMH aggregator, implementing a preliminary version of the EXPORT synchronization procedure.
SARI The DSpace platform used to provide institutional repository services. Version 5.1 of DSpace was used with some minor adjustments aimed to support ORCID iD (for the purpose of the demo only). Notice that the current version of DSpace, branch JSPUI, still does not support ORCID.
OJS Platform used to provide hosting to open-access journals. Like DSpace, the Open Journal System was used with minor adjustments to support ORCID iD.

The demo at the aforementioned event involved the following steps:

1. At DeGóis, create a new user profile and run IMPORT to populate it with research outputs harvested from ORCID.
2. Insert a new production at DeGóis (entitled “The gap between technologies and science”) and run EXPORT to send it to ORCID.
3. At OJS submit and approve a new article from the same researcher (entitled “Registo submetido a Revista com OJS 1”).
4. At SARI deposit an article (entitled “Portuguese repositories bloom : the RCAAP project”) in a institutional open-access repository.
5. These two articles are harvested by RCAAP, and then the researcher runs EXPORT to send them to ORCID. The state of the user profile at ORCID after these steps can be seen in Figure 9.

Figure 9. User profile at ORCID.
6. At DeGóis run the IMPORT procedure to harvest these two outputs. The state of the user profile at DeGóis after this step can be seen in Figure 10.

Figure 10. User profile at DeGóis.

This prototype showed, in our opinion, compelling and real use cases for the synchronization framework. The prototype still differs from the upcoming release version of the framework in the following aspects:

The RCAAP portal still does not implement the final version of the EXPORT procedure, that relies on ORCID Metadata Round Trip functionality to automatically feed updates to the ORCID profile.
The DSpace based SARI platform does not yet implement the IMPORT procedure. From the end user perspective, this is one of the most expected features, since it will free researchers from manually filling in meta-data. From the FCT perspective, it is also of critical importance, as the notification based synchronization service will increase the output deposit rate, and thus facilitate its open-access mandate.

5 Related work

The laws presented in Section 3.3 are standard “well-behavedness” laws in synchronization frameworks, namely on those for bidirectional transformation, whose goal is precisely to maintain two artifacts consistent by means of two transformations that propagate updates from each to the other (for an overview of this research field please see 2). To be more precise, our formalization is based on the concrete framework of constraint maintainers, proposed by Meertens³, and later used by Stevens⁴ to formalize the OMG (Object Management Group) standard bidirectional transformation language QVT-R (Query/View/Transformation - Relations)⁵. In fact, an interesting question is whether the domain specific QVT-R language could be used instead of the general purpose Alloy to formalize the consistency predicates, and later used with a QVT-R engine (for example, the Echo tool^6,7) to implement the synchronization procedures.

Alloy and its Analyzer have been previously used in the validation of transformation specifications, namely for transformations specified in QVT-R⁸ and ATL (ATLAS Transformation Language)⁹. Likewise to these approaches, we have also used Alloy to verify properties of the specified consistency predicates. However, we also relied on its model finding functionalities to generate scenarios that helped the different stakeholders consensually establishing the system’s requirements.

CV management systems and open-access repositories typically connect with ORCID only in the IMPORT context and only support creation notifications, not allowing the user to EXPORT research outputs back to ORCID. Such is the case of services like Impactstory (http://impactstory.org), ScienceOpen (http://www.scienceopen.com) and Symplectic’s Elements (http://symplectic.co.uk/products/elements). The exception is Thomson Reuters’ ResearcherID (http://www.researcherid.com), which also aims to provide a unique researcher identifier, and that allows the user to export research outputs back to the ORCID profile. Interestingly, some of these services, like Impactstory and Elements, resort to ORCID only to harvest UIDs and then retrieve meta-data from other trusted services, ignoring the actual ORCID works. As a consequence, any grouping of works (possibly enforced by the user) is ignored, contradicting the perspective of ORCID as a central hub for research outputs (in our scenario, there would be a different notification for each supported UID). For ORCID works without UIDs assigned, Impactstory retrieves the ORCID meta-data, while Elements currently ignores such works. In contrast, ResearcherID considers each ORCID work as an independent entry, not embracing its essence as an aggregator of research outputs from varied sources, which may lead to several duplicated entries (in our scenario, there would be a different notification for each work). Our proposed approach sits between these two approaches: while IMPORT does focus on the retrieval of UIDs, it also considers how these UIDs are grouped in the ORCID profile. In the EXPORT context, ResearcherID, unlike our approach, does not keep track of previously exported outputs. Since the ORCID API does not allow sources to introduce works with repeated UIDs, the user is currently not able to update works from ResearcherID without previously deleting them from the ORCID profile. Outputs without UIDs are duplicated in the ORCID profile when exported; our framework forbids the exportation of productions without UIDs to avoid this issue.

6 Conclusion

This paper reported on the experience of developing an ORCID based synchronization framework for PTCRIS. This synchronization framework was recently prototyped and demoed at a national research community event, receiving quite positive feedback. During its design, formal analysis techniques and tools were used with excellent results, in particular to automatically generate usage scenarios (namely, corner cases) that proved very useful to help clarify and validate the requirements with the stakeholders. The first stable and detailed specification of the synchronization framework will be made available soon as an open-access report. We expect to have certified implementations of that specification in DeGóis, RCAAP, SARI, and two local CRIS systems by the end of 2015.

This paper is based on the ORCID API v1.2. Version 2.0, currently in early stages of development, is expected to affect some details of the described synchronization framework, but not the overall concepts. The most relevant change is that groups of similar works, as well as the preferred among them, will become explicit in the ORCID data model. This will simplify the IMPORT procedure, since it will no longer need to compute the groups of similar works.

In a future version we intend to design alternative IMPORT and EXPORT procedures with more sophisticated behaviors. For example, depending on user feedback we may consider implementing a notification dismiss feature, to accommodate users that may want to register different research outputs in their ORCID and PTCRIS profiles. Another possible interesting feature would be to allow IMPORT to somehow recognize edits to works in ORCID and automatically incorporate them in the respective PTCRIS profile.

Author contributions

João Mendes Moreira is responsible at the FCCN unit of FCT for the PTCRIS program, the proponent of the synchronization architecture with ORCID as hub (Section 2), and led the prototype implementation both at DeGóis and RCAAP (Section 4). Alcino Cunha and Nuno Macedo are active researchers in the topic of model management and synchronization, and were responsible for the formal specification and verification of the synchronization framework (Section 3).

Competing interests

The authors disclosed no competing interests.

Grant information

The author(s) declared that no funds were involved in supporting this research.

Acknowledgements

The authors would like to thank the DeGóis and RCAAP teams, namely Carlos Pinto, Luis Valério, Pedro Lopes, and José Carvalho, for their effort in the prototype implementation and for the fruitful discussions that helped us clarify and improve the specification of the framework. The authors would also like to thank the ORCID team, namely Laura Paglione, for the valuable comments on Section 3 and insight provided about the ORCID API.

Faculty Opinions recommended

References

1. Jackson D: Software Abstractions: Logic, Language, and Analysis. MIT Press, revised edition, 2012. Reference Source
2. Czarnecki K, Foster J, Hu Z, et al.: Bidirectional transformations: A cross-discipline perspective. In ICMT’09 of LNCS. 2009; 5563. : 260–283. Publisher Full Text
3. Meertens L: Designing constraint maintainers for user interaction. In Third Workshop on Programmable Structured Documents. Tokyo University, 2005. Reference Source
4. Stevens P: Bidirectional model transformations in QVT: semantic issues and open questions. Software and System Modeling. 2010; 9(1): 7–20. Publisher Full Text
5. MOF 2.0 Query/View/Transformation specification (QVT). OMG, version 1.1. 2011. Reference Source
6. Macedo N, Cunha A: Implementing QVT-R bidirectional model transformations using Alloy. In FASE’13 of LNCS, Springer, 2013; 7793. : 297–311. Publisher Full Text
7. Macedo N, Guimarães T, Cunha A, et al.: Model repair and transformation with Echo. In ASE’13. IEEE, 2013; 694–697. Publisher Full Text
8. Cabot J, Clarisó R, Guerra E, et al.: Verification and validation of declarative model-to-model transformations through invariants. J Syst Softw. 2010; 83(2): 283–302. Publisher Full Text
9. Büttner F, Egea M, Cabot J, et al.: Verification of ATL transformations using transformation models and model finders. In ICFEM’12 of LNCS, Springer. 2012; 7635. : 198–213. Publisher Full Text

Comments on this article Comments (2)

Version 1

VERSION 1 PUBLISHED 06 Jul 2015

Author Response 26 Feb 2018

João Mendes Moreira, Foundation for Science and Technology, Portugal

26 Feb 2018

Author Response

Concept video: https://vimeo.com/251179542

Demo video: https://vimeo.com/251312395
Competing Interests: No competing interests were disclosed.
Concept video: https://vimeo.com/251179542

Demo video: https://vimeo.com/251312395
Concept video: https://vimeo.com/251179542

Demo video: https://vimeo.com/251312395
Competing Interests: No competing interests were disclosed. Close
Report a concern
Author Response 09 Jun 2017

João Mendes Moreira, Foundation for Science and Technology, Portugal

09 Jun 2017

Author Response

Information on the implementation of the framework can be found at:

https://orcid.org/blog/2017/05/30/collect-connect-sync-ptcris-synchronization-framework

https://doi.org/10.23640/07243.5051746.V1

João Mendes Moreira
Competing Interests: No competing interests were disclosed.
Information on the implementation of the framework can be found at:

https://orcid.org/blog/2017/05/30/collect-connect-sync-ptcris-synchronization-framework

https://doi.org/10.23640/07243.5051746.V1

João Mendes Moreira
Information on the implementation of the framework can be found at:

https://orcid.org/blog/2017/05/30/collect-connect-sync-ptcris-synchronization-framework

https://doi.org/10.23640/07243.5051746.V1

João Mendes Moreira
Competing Interests: No competing interests were disclosed. Close
Report a concern
Comment

Author details Author details

¹ FCCN, Fundação para a Ciência e Tecnologia, Lisboa, 1700-066, Portugal
² HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

Competing interests

The authors disclosed no competing interests.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (1)

version 1

Published: 06 Jul 2015, 4:181

https://doi.org/10.12688/f1000research.6499.1

© 2015 Mendes Moreira J et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Mendes Moreira J, Cunha A and Macedo N. An ORCID based synchronization framework for a national CRIS ecosystem [version 1; peer review: 2 approved, 1 approved with reservations]. F1000Research 2015, 4:181 (https://doi.org/10.12688/f1000research.6499.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 06 Jul 2015

Views

Reviewer Report 07 Aug 2015

Simeon Warner, Division of Library Information Systems, Cornell University Library, Cornell University, Ithaca, NY, USA

Approved

https://doi.org/10.5256/f1000research.6975.r9344

This article describes work to use ORCID iDs as the basis for synchronization of data between distributed components of a national CRIS system in Portugal. Portugal has very high ORCID coverage because of the mandated use of ORCID iDs for a 2013/2014 research assessment exercise, and thus provides a good test-bed for ORCID iD based synchronization. The background for this work and the architecture used are well described in the introductory sections.

My principal criticism of the article is that it describes a formal modeling process, that was used to assess and refine the synchronization strategy, in insufficient detail for a reader assess the correctness of or repeat the analysis. Yet, the description of this modeling process nonetheless introduces a number of formalisms that distract from the flow of the article. Ideally, the description of the formal modeling process would be separated out and then expanded to a point where, along with the Alloy source files, the reader could repeat the analysis (plans to provide these details are mentioned in the conclusion, the article should be updated to link to them when available). Because this is a descriptive software tools article, I feel that this deficiency does not undermine the article's usefulness or conclusions.

The later sections of the article describe the prototype implementation, a modest amount of additional work to be completed before a production release, and related work. A number of other issues would benefit from discussion, such as the handling of edits to works in ORCID and other sources; user testing, understanding and acceptance of the system; and limitations on the type of research output included beyond publications.

Overall I feel this is a useful and timely description of new work in an evolving area and thus approve it as is. I hope the authors will follow-up with an article evaluating the system in production.

Competing Interests: I serve on the board of ORCID Inc. and thus favor the adoption of ORCID iDs. This article does not compare identifier solutions but simply describes the use of ORCID iDs, and I feel that I am able to review it impartially.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

29 Sep 2015

Author Response

First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess ... Continue reading First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess the formal design methodology that was followed. The detailed specifications of the synchronization framework can now be found at http://ptcris.pt/en/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The former contains the formal Alloy models in Appendix, but they were removed from the later, which was substantially simplified based on comments from our partners. The formal Alloy models used in the design of v0.4 can be found in the following git repository: https://gitlab.fccn.pt/dev-ptcris/ptcrisync/tree/master

Your remaining suggestions are quite relevant also, and some of them are already on our future work list, namely: improve the synchronization mechanism with versioning info to track changes to information; conduct proper user evaluation once the first implementations are released (hopefully, still in 2015); and extend the synchronization mechanism to handle other research outputs. Concerning the later, for research outputs that also have unique identifiers associated the extension is rather trivial. This is the case of research projects, already stored in ORCID.
First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess the formal design methodology that was followed. The detailed specifications of the synchronization framework can now be found at http://ptcris.pt/en/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The former contains the formal Alloy models in Appendix, but they were removed from the later, which was substantially simplified based on comments from our partners. The formal Alloy models used in the design of v0.4 can be found in the following git repository: https://gitlab.fccn.pt/dev-ptcris/ptcrisync/tree/master

Your remaining suggestions are quite relevant also, and some of them are already on our future work list, namely: improve the synchronization mechanism with versioning info to track changes to information; conduct proper user evaluation once the first implementations are released (hopefully, still in 2015); and extend the synchronization mechanism to handle other research outputs. Concerning the later, for research outputs that also have unique identifiers associated the extension is rather trivial. This is the case of research projects, already stored in ORCID.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

29 Sep 2015

Author Response

First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess ... Continue reading First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess the formal design methodology that was followed. The detailed specifications of the synchronization framework can now be found at http://ptcris.pt/en/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The former contains the formal Alloy models in Appendix, but they were removed from the later, which was substantially simplified based on comments from our partners. The formal Alloy models used in the design of v0.4 can be found in the following git repository: https://gitlab.fccn.pt/dev-ptcris/ptcrisync/tree/master

Your remaining suggestions are quite relevant also, and some of them are already on our future work list, namely: improve the synchronization mechanism with versioning info to track changes to information; conduct proper user evaluation once the first implementations are released (hopefully, still in 2015); and extend the synchronization mechanism to handle other research outputs. Concerning the later, for research outputs that also have unique identifiers associated the extension is rather trivial. This is the case of research projects, already stored in ORCID.
First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess the formal design methodology that was followed. The detailed specifications of the synchronization framework can now be found at http://ptcris.pt/en/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The former contains the formal Alloy models in Appendix, but they were removed from the later, which was substantially simplified based on comments from our partners. The formal Alloy models used in the design of v0.4 can be found in the following git repository: https://gitlab.fccn.pt/dev-ptcris/ptcrisync/tree/master

Your remaining suggestions are quite relevant also, and some of them are already on our future work list, namely: improve the synchronization mechanism with versioning info to track changes to information; conduct proper user evaluation once the first implementations are released (hopefully, still in 2015); and extend the synchronization mechanism to handle other research outputs. Concerning the later, for research outputs that also have unique identifiers associated the extension is rather trivial. This is the case of research projects, already stored in ORCID.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 28 Jul 2015

Keith G. Jeffery, Keith G. Jeffery Consultants, Faringdon, UK

Approved with Reservations

https://doi.org/10.5256/f1000research.6975.r9347

The paper describes a synchronisation method across multiple research information systems Holding CV information and full-text articles with metadata) in Portugal using ORCID as the essential linking / identification mechanism. The synchronisation protocol is specified formally and a prototype has been constructed which is reported to have been received successfully.

The paper provides a useful advance in attempts to ensure consistency in research output publications and thus should be made widely available.

There are several aspects of relevance which are not covered - or covered poorly - in the paper.

The paper does not 'stand back' and describe that the need for synchronisation arises because of a lack of policy by national research funding organisations in terms of what should be recorded/stored and how it should be recorded/stored. This policy vacuum has led - in all countries not just Portugal - to the provision of multiple independent incompatible systems and inconsistent information;
The synchronisation discussed covers only publications; there are many other aspects of research information that may be recorded inconsistently in multiple systems. This restriction should be made clear;
The data models do not seem to be recording temporal information about (a) the valid time (interval) relating to the publication and (b) the transaction time relating to the publication appearance and any subsequent changes in the digital stored representation. These temporal elements are necessary for precedence, provenance and version differentiation. The need for and lack of temporal information should be discussed;
The mechanisms for grouping 'like' publications are likely to produce inconsistent results. Relying one one or more equivalent identifiers is clearly not sufficient and the paper correctly indicates that metadata may be inconsistent. However, some measure of 'separation distance' based on metadata element values could - and probably should - be used. This would be facilitated by point 5 below;
The data models used in the participating systems are very simple and do not really reflect the real world. There exist in the real world complex inter-relationships between (versions of) publications (and indeed between publications and other entities in the research world such as persons, organisations, projects, funding, datasets, software, equipment...) and their correct representation requires a syntax and semantics beyond simple identification. The use of an overarching, superset data model in PCTRIS with formal syntax and declared semantics (thus ensuring referential and functional integrity) and providing a superset over the metadata of the collection of connected systems would give a greater chance of correct assignment of publications to groupings of 'like' publications;

However, as indicated, the paper provides a useful advance in the state of the art and forms a good basis for discussion of more complete solutions to a very difficult problem.

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

29 Sep 2015

Author Response
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
1. You are absolutely right, and in a future version of the paper we
... Continue reading
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
You are absolutely right, and in a future version of the paper we will improve the introduction to incorporate your remark.

Again, you are right. The present synchronization framework focus only on research outputs that are covered by ORCID and which have UIDs associated, namely publications and projects. We thought we had that clearly stated in the introduction, but apparently not. We will do so in a future version.

Indeed, our current version of the synchronization framework does not incorporate such temporal information, but this is one of the key points for future work, namely: improve the synchronization mechanism with versioning info to track changes to information, and thus reliably recognize and propagate changes to metadata. To help us in that task, we have already enrolled in the PTCRIS team some distributed systems researchers, whose research is precisely focused on distributed data aggregation, synchronization, and dissemination.

(and 5.) The unique identifier vs. metadata based synchronization was (and still is) a question of much debate inside the PTCRIS team. We are fully aware of the limitations of opting for the the former, but ultimately we opted for that because we believe it is more transparent (and easy to understand) to the end user, in particular because it relies on the same publication matching algorithm that is currently used by ORCID. ORCID plays a central role in this framework, and having a completely different matching algorithm would be quite confusing for a user that frequently uses both ORCID and the PTCRIS services. As soon as the first services implementing the proposed mechanism are released (hopefully until the end of 2015), we will conduct a proper evaluation to assess the reliability and usability of the system. We hope to include the results of such evaluation in a future version of the paper.
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
You are absolutely right, and in a future version of the paper we will improve the introduction to incorporate your remark.

Again, you are right. The present synchronization framework focus only on research outputs that are covered by ORCID and which have UIDs associated, namely publications and projects. We thought we had that clearly stated in the introduction, but apparently not. We will do so in a future version.

Indeed, our current version of the synchronization framework does not incorporate such temporal information, but this is one of the key points for future work, namely: improve the synchronization mechanism with versioning info to track changes to information, and thus reliably recognize and propagate changes to metadata. To help us in that task, we have already enrolled in the PTCRIS team some distributed systems researchers, whose research is precisely focused on distributed data aggregation, synchronization, and dissemination.

(and 5.) The unique identifier vs. metadata based synchronization was (and still is) a question of much debate inside the PTCRIS team. We are fully aware of the limitations of opting for the the former, but ultimately we opted for that because we believe it is more transparent (and easy to understand) to the end user, in particular because it relies on the same publication matching algorithm that is currently used by ORCID. ORCID plays a central role in this framework, and having a completely different matching algorithm would be quite confusing for a user that frequently uses both ORCID and the PTCRIS services. As soon as the first services implementing the proposed mechanism are released (hopefully until the end of 2015), we will conduct a proper evaluation to assess the reliability and usability of the system. We hope to include the results of such evaluation in a future version of the paper.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

29 Sep 2015

Author Response
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
1. You are absolutely right, and in a future version of the paper we
... Continue reading
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
You are absolutely right, and in a future version of the paper we will improve the introduction to incorporate your remark.

Again, you are right. The present synchronization framework focus only on research outputs that are covered by ORCID and which have UIDs associated, namely publications and projects. We thought we had that clearly stated in the introduction, but apparently not. We will do so in a future version.

Indeed, our current version of the synchronization framework does not incorporate such temporal information, but this is one of the key points for future work, namely: improve the synchronization mechanism with versioning info to track changes to information, and thus reliably recognize and propagate changes to metadata. To help us in that task, we have already enrolled in the PTCRIS team some distributed systems researchers, whose research is precisely focused on distributed data aggregation, synchronization, and dissemination.

(and 5.) The unique identifier vs. metadata based synchronization was (and still is) a question of much debate inside the PTCRIS team. We are fully aware of the limitations of opting for the the former, but ultimately we opted for that because we believe it is more transparent (and easy to understand) to the end user, in particular because it relies on the same publication matching algorithm that is currently used by ORCID. ORCID plays a central role in this framework, and having a completely different matching algorithm would be quite confusing for a user that frequently uses both ORCID and the PTCRIS services. As soon as the first services implementing the proposed mechanism are released (hopefully until the end of 2015), we will conduct a proper evaluation to assess the reliability and usability of the system. We hope to include the results of such evaluation in a future version of the paper.
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
You are absolutely right, and in a future version of the paper we will improve the introduction to incorporate your remark.

Again, you are right. The present synchronization framework focus only on research outputs that are covered by ORCID and which have UIDs associated, namely publications and projects. We thought we had that clearly stated in the introduction, but apparently not. We will do so in a future version.

Indeed, our current version of the synchronization framework does not incorporate such temporal information, but this is one of the key points for future work, namely: improve the synchronization mechanism with versioning info to track changes to information, and thus reliably recognize and propagate changes to metadata. To help us in that task, we have already enrolled in the PTCRIS team some distributed systems researchers, whose research is precisely focused on distributed data aggregation, synchronization, and dissemination.

(and 5.) The unique identifier vs. metadata based synchronization was (and still is) a question of much debate inside the PTCRIS team. We are fully aware of the limitations of opting for the the former, but ultimately we opted for that because we believe it is more transparent (and easy to understand) to the end user, in particular because it relies on the same publication matching algorithm that is currently used by ORCID. ORCID plays a central role in this framework, and having a completely different matching algorithm would be quite confusing for a user that frequently uses both ORCID and the PTCRIS services. As soon as the first services implementing the proposed mechanism are released (hopefully until the end of 2015), we will conduct a proper evaluation to assess the reliability and usability of the system. We hope to include the results of such evaluation in a future version of the paper.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 27 Jul 2015

Mikael K. Elbaek, Technical Information Center of Denmark, Technical University of Denmark, Kongens Lyngby, Denmark

Approved

https://doi.org/10.5256/f1000research.6975.r9346

The paper by Moreira et al. provides an interesting and relevant proto-type solution to the synchronization problem of several parallel research information systems in a national CRIS (Current Research Information Systems) setting. In the Portuguese CRIS landscape there are no local CRIS-systems (i.e. Pure, Converis, Symplectic etc.) in order to understand the Portuguese implementation it is important to make this distinction that the PTCRIS is an umbrella (or agregator) that collects data from several sources including a network of repositories SARI (DSpace), a national research portal (RCAAP) an OAI-PMH service provider, e.g. aggregator and a national CV management system (DeGóis). PTCRIS in that sense is not one system or piece of software. Even though this architecture may be unique to Portugal the implications of using ORCID as a syncornization framework is interesting and relevant for other national Research Information/CRIS-architectures. The result is interesting and important for developers of national and international research infrastructures – this is why there are also a few points where the paper may be improved.

In the paper the authors provides an analytic and generic approach to specifying the relevant synchronization procedures asking first what and then how. The article does not go into the details of the what but describes briefly how this data has been collected and analysed. In this reviewers opinion a more detailed description of the what and a description of the selected use-cases would provide a more comprehensible analysis and clear definition of the selected scenarios. Currently the authors provides abstract descriptions of the synchronization scenarios (see chapter 3.2: Consistency predicates). Because of this level of abstraction it is not completely clear who the end-users are, is it the system managers, system owners, research managers or the researchers (authors and owners of the ORCID profiles).
The proposed framework is focussing on the technical implementation of synchronization. But how will the users or researchers e.g. the ORCID profile owners influence the robustness of the synchronization job. One example is that ORCID profile owners may not want to have the same records (works/productions) in their PTCRIS profile as the want in the ORCID profile? The stakeholders of the PTCRIS could be the national government or the university of the researcher who wants all publications listed of a researcher (i.e. the numbers counts in assessments) whereas the researcher may only want a selected number of publications (e.g. the best) to be listed visible. This is briefly discussed in the conclusion. It could be suggested to separate discussion and future perspectives i.e. to a separate section.
The metadata quality must be a major challenge to this synchronization framework using ORCID. It is well-known that metadata about research outputs and funding data are often not consistent or of an acceptable quality. It is not clear from the article how the CRIS infrastructure is used in Portugal. If it is used for a research assessment i.e. bibliometric assessment this must be a major issue. It would be relevant to have discussed this issue a bit further in the paper. However since the paper is presenting a prototype framework – it is acceptable that it is not dealt with in details. I hope that this will be explored further in later writings from the authors – also when ORCIDs API version 2.0 will be in production and the metadata roundtrip also will be in production.
Annotations – there are some use of annotations like SYNCED (o, p) where o, p could have been described to help the reader.
The description and use of PTCRIS sometimes gives the impression of PTCRIS as a single entity or system – while as far as this reviewer understands it is an umbrella that encompasses several systems that together collects and exposes the collected research output, actors and activities of Portugal. It currently includes: DeGóis, RCAAP and SARI.
The authors do not discuss issues with researchers having more ORCIDs – is this not an issues for the proposed synchronization framework?

Overall the paper is solid and provides an important stepping stone towards a better use and re-use of research information using ORCID as a hub. And I would be happy to accept the paper as is – but I hope that the comments will provide thought for possible revisions or further work in a later article.

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

29 Sep 2015

Author Response
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
1. Indeed the paper focus more on the design methodology, leaving out many details
... Continue reading
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
Indeed the paper focus more on the design methodology, leaving out many details that would be necessary to fully understand the synchronization framework. The detailed specifications of the proposed framework can now be found at http://ptcris.pt/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The later version was improved and substantially simplified based on comments from the “end users” that are the target of these documents, namely the systems managers that will be responsible for the implementation of the synchronization procedures at their services. Unfortunately, we have still not yet written user manuals for the researchers, the real end users of the system. When we do so, they will be available at the same web site.

Maybe it is not clear in the paper, but the user can already select at each PTCRIS service which research outputs (namely publications) he wants to export to ORCID, and thus propagated to other services. Also, the import mechanism is notification based, so the user can choose not to import some of the research outputs. Also at ORCID, he can set the privacy level of the research outputs so that they are visible for importing by ORCID members, but not to the general public. As such, we believe the framework already provides the flexibility you propose.

Indeed, metadata quality is a major problem to be considered. We believe this problem will be less severe in the PTCRIS ecosystem, because some of the services (namely OA aggregators, such as RCAAP) will be mandated to export to ORCID (using the metadata roundtrip facility you refer) records with “good enough” metadata. Users will be recommended to give preference to such records in ORCID, and we also intend to have the import procedure biased to prefer metadata from such reliable sources.

You are right, and that will be improved in a future version. In this case, o stands for the user ORCID profile and p for the user profile at the PTCRIS service intending to synchronize with ORCID.

You are absolutely right, and we intend to incorporate your remark in a future version of the paper.

This should not be a problem, because due to the mandated use of ORCID iDs in the recent national research assessment exercises, most Portuguese researchers already have a single ORCID iD registered at the national science foundation, and are thus fully aware of what is their “official” ORCID iD that will be used in the PTCRIS ecosystem.
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
Indeed the paper focus more on the design methodology, leaving out many details that would be necessary to fully understand the synchronization framework. The detailed specifications of the proposed framework can now be found at http://ptcris.pt/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The later version was improved and substantially simplified based on comments from the “end users” that are the target of these documents, namely the systems managers that will be responsible for the implementation of the synchronization procedures at their services. Unfortunately, we have still not yet written user manuals for the researchers, the real end users of the system. When we do so, they will be available at the same web site.

Maybe it is not clear in the paper, but the user can already select at each PTCRIS service which research outputs (namely publications) he wants to export to ORCID, and thus propagated to other services. Also, the import mechanism is notification based, so the user can choose not to import some of the research outputs. Also at ORCID, he can set the privacy level of the research outputs so that they are visible for importing by ORCID members, but not to the general public. As such, we believe the framework already provides the flexibility you propose.

Indeed, metadata quality is a major problem to be considered. We believe this problem will be less severe in the PTCRIS ecosystem, because some of the services (namely OA aggregators, such as RCAAP) will be mandated to export to ORCID (using the metadata roundtrip facility you refer) records with “good enough” metadata. Users will be recommended to give preference to such records in ORCID, and we also intend to have the import procedure biased to prefer metadata from such reliable sources.

You are right, and that will be improved in a future version. In this case, o stands for the user ORCID profile and p for the user profile at the PTCRIS service intending to synchronize with ORCID.

You are absolutely right, and we intend to incorporate your remark in a future version of the paper.

This should not be a problem, because due to the mandated use of ORCID iDs in the recent national research assessment exercises, most Portuguese researchers already have a single ORCID iD registered at the national science foundation, and are thus fully aware of what is their “official” ORCID iD that will be used in the PTCRIS ecosystem.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

29 Sep 2015

Author Response
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
1. Indeed the paper focus more on the design methodology, leaving out many details
... Continue reading
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
Indeed the paper focus more on the design methodology, leaving out many details that would be necessary to fully understand the synchronization framework. The detailed specifications of the proposed framework can now be found at http://ptcris.pt/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The later version was improved and substantially simplified based on comments from the “end users” that are the target of these documents, namely the systems managers that will be responsible for the implementation of the synchronization procedures at their services. Unfortunately, we have still not yet written user manuals for the researchers, the real end users of the system. When we do so, they will be available at the same web site.

Maybe it is not clear in the paper, but the user can already select at each PTCRIS service which research outputs (namely publications) he wants to export to ORCID, and thus propagated to other services. Also, the import mechanism is notification based, so the user can choose not to import some of the research outputs. Also at ORCID, he can set the privacy level of the research outputs so that they are visible for importing by ORCID members, but not to the general public. As such, we believe the framework already provides the flexibility you propose.

Indeed, metadata quality is a major problem to be considered. We believe this problem will be less severe in the PTCRIS ecosystem, because some of the services (namely OA aggregators, such as RCAAP) will be mandated to export to ORCID (using the metadata roundtrip facility you refer) records with “good enough” metadata. Users will be recommended to give preference to such records in ORCID, and we also intend to have the import procedure biased to prefer metadata from such reliable sources.

You are right, and that will be improved in a future version. In this case, o stands for the user ORCID profile and p for the user profile at the PTCRIS service intending to synchronize with ORCID.

You are absolutely right, and we intend to incorporate your remark in a future version of the paper.

This should not be a problem, because due to the mandated use of ORCID iDs in the recent national research assessment exercises, most Portuguese researchers already have a single ORCID iD registered at the national science foundation, and are thus fully aware of what is their “official” ORCID iD that will be used in the PTCRIS ecosystem.
First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:
Indeed the paper focus more on the design methodology, leaving out many details that would be necessary to fully understand the synchronization framework. The detailed specifications of the proposed framework can now be found at http://ptcris.pt/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The later version was improved and substantially simplified based on comments from the “end users” that are the target of these documents, namely the systems managers that will be responsible for the implementation of the synchronization procedures at their services. Unfortunately, we have still not yet written user manuals for the researchers, the real end users of the system. When we do so, they will be available at the same web site.

Maybe it is not clear in the paper, but the user can already select at each PTCRIS service which research outputs (namely publications) he wants to export to ORCID, and thus propagated to other services. Also, the import mechanism is notification based, so the user can choose not to import some of the research outputs. Also at ORCID, he can set the privacy level of the research outputs so that they are visible for importing by ORCID members, but not to the general public. As such, we believe the framework already provides the flexibility you propose.

Indeed, metadata quality is a major problem to be considered. We believe this problem will be less severe in the PTCRIS ecosystem, because some of the services (namely OA aggregators, such as RCAAP) will be mandated to export to ORCID (using the metadata roundtrip facility you refer) records with “good enough” metadata. Users will be recommended to give preference to such records in ORCID, and we also intend to have the import procedure biased to prefer metadata from such reliable sources.

You are right, and that will be improved in a future version. In this case, o stands for the user ORCID profile and p for the user profile at the PTCRIS service intending to synchronize with ORCID.

You are absolutely right, and we intend to incorporate your remark in a future version of the paper.

This should not be a problem, because due to the mandated use of ORCID iDs in the recent national research assessment exercises, most Portuguese researchers already have a single ORCID iD registered at the national science foundation, and are thus fully aware of what is their “official” ORCID iD that will be used in the PTCRIS ecosystem.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (2)

Version 1

VERSION 1 PUBLISHED 06 Jul 2015

Author Response 26 Feb 2018

João Mendes Moreira, Foundation for Science and Technology, Portugal

26 Feb 2018

Author Response

Concept video: https://vimeo.com/251179542

Demo video: https://vimeo.com/251312395
Competing Interests: No competing interests were disclosed.
Concept video: https://vimeo.com/251179542

Demo video: https://vimeo.com/251312395
Concept video: https://vimeo.com/251179542

Demo video: https://vimeo.com/251312395
Competing Interests: No competing interests were disclosed. Close
Report a concern
Author Response 09 Jun 2017

João Mendes Moreira, Foundation for Science and Technology, Portugal

09 Jun 2017

Author Response

Information on the implementation of the framework can be found at:

https://orcid.org/blog/2017/05/30/collect-connect-sync-ptcris-synchronization-framework

https://doi.org/10.23640/07243.5051746.V1

João Mendes Moreira
Competing Interests: No competing interests were disclosed.
Information on the implementation of the framework can be found at:

https://orcid.org/blog/2017/05/30/collect-connect-sync-ptcris-synchronization-framework

https://doi.org/10.23640/07243.5051746.V1

João Mendes Moreira
Information on the implementation of the framework can be found at:

https://orcid.org/blog/2017/05/30/collect-connect-sync-ptcris-synchronization-framework

https://doi.org/10.23640/07243.5051746.V1

João Mendes Moreira
Competing Interests: No competing interests were disclosed. Close
Report a concern
Comment

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 1 06 Jul 15	read	read	read

Mikael K. Elbaek, Technical University of Denmark, Kongens Lyngby, Denmark
Keith G. Jeffery, Keith G. Jeffery Consultants, Faringdon, UK
Simeon Warner, Cornell University, Ithaca, USA

Comments on this article

All Comments(2)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

35 Views

07 Aug 2015 | for Version 1

Simeon Warner, Division of Library Information Systems, Cornell University Library, Cornell University, Ithaca, NY, USA

35 Views Cite this report Responses(1)

Approved

Competing Interests

I serve on the board of ORCID Inc. and thus favor the adoption of ORCID iDs. This article does not compare identifier solutions but simply describes the use of ORCID iDs, and I feel that I am able to review it impartially.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

First of all, many thanks for your review and invaluable feedback.

Concerning your principal criticism, we agree that the detail in the paper is insufficient for the reader to assess the formal design methodology that was followed. The detailed specifications of the synchronization framework can now be found at http://ptcris.pt/en/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The former contains the formal Alloy models in Appendix, but they were removed from the later, which was substantially simplified based on comments from our partners. The formal Alloy models used in the design of v0.4 can be found in the following git repository: https://gitlab.fccn.pt/dev-ptcris/ptcrisync/tree/master

Your remaining suggestions are quite relevant also, and some of them are already on our future work list, namely: improve the synchronization mechanism with versioning info to track changes to information; conduct proper user evaluation once the first implementations are released (hopefully, still in 2015); and extend the synchronization mechanism to handle other research outputs. Concerning the later, for research outputs that also have unique identifiers associated the extension is rather trivial. This is the case of research projects, already stored in ORCID.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

43 Views

28 Jul 2015 | for Version 1

Keith G. Jeffery, Keith G. Jeffery Consultants, Faringdon, UK

43 Views Cite this report Responses(1)

Approved With Reservations

The paper does not 'stand back' and describe that the need for synchronisation arises because of a lack of policy by national research funding organisations in terms of what should be recorded/stored and how it should be recorded/stored. This policy vacuum has led - in all countries not just Portugal - to the provision of multiple independent incompatible systems and inconsistent information;
The synchronisation discussed covers only publications; there are many other aspects of research information that may be recorded inconsistently in multiple systems. This restriction should be made clear;
The data models do not seem to be recording temporal information about (a) the valid time (interval) relating to the publication and (b) the transaction time relating to the publication appearance and any subsequent changes in the digital stored representation. These temporal elements are necessary for precedence, provenance and version differentiation. The need for and lack of temporal information should be discussed;
The mechanisms for grouping 'like' publications are likely to produce inconsistent results. Relying one one or more equivalent identifiers is clearly not sufficient and the paper correctly indicates that metadata may be inconsistent. However, some measure of 'separation distance' based on metadata element values could - and probably should - be used. This would be facilitated by point 5 below;
The data models used in the participating systems are very simple and do not really reflect the real world. There exist in the real world complex inter-relationships between (versions of) publications (and indeed between publications and other entities in the research world such as persons, organisations, projects, funding, datasets, software, equipment...) and their correct representation requires a syntax and semantics beyond simple identification. The use of an overarching, superset data model in PCTRIS with formal syntax and declared semantics (thus ensuring referential and functional integrity) and providing a superset over the metadata of the collection of connected systems would give a greater chance of correct assignment of publications to groupings of 'like' publications;

However, as indicated, the paper provides a useful advance in the state of the art and forms a good basis for discussion of more complete solutions to a very difficult problem.

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (1)

Author Response

29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:

You are absolutely right, and in a future version of the paper we will improve the introduction to incorporate your remark.
Again, you are right. The present synchronization framework focus only on research outputs that are covered by ORCID and which have UIDs associated, namely publications and projects. We thought we had that clearly stated in the introduction, but apparently not. We will do so in a future version.
Indeed, our current version of the synchronization framework does not incorporate such temporal information, but this is one of the key points for future work, namely: improve the synchronization mechanism with versioning info to track changes to information, and thus reliably recognize and propagate changes to metadata. To help us in that task, we have already enrolled in the PTCRIS team some distributed systems researchers, whose research is precisely focused on distributed data aggregation, synchronization, and dissemination.
(and 5.) The unique identifier vs. metadata based synchronization was (and still is) a question of much debate inside the PTCRIS team. We are fully aware of the limitations of opting for the the former, but ultimately we opted for that because we believe it is more transparent (and easy to understand) to the end user, in particular because it relies on the same publication matching algorithm that is currently used by ORCID. ORCID plays a central role in this framework, and having a completely different matching algorithm would be quite confusing for a user that frequently uses both ORCID and the PTCRIS services. As soon as the first services implementing the proposed mechanism are released (hopefully until the end of 2015), we will conduct a proper evaluation to assess the reliability and usability of the system. We hope to include the results of such evaluation in a future version of the paper.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

33 Views

27 Jul 2015 | for Version 1

Mikael K. Elbaek, Technical Information Center of Denmark, Technical University of Denmark, Kongens Lyngby, Denmark

33 Views Cite this report Responses(1)

Approved

In the paper the authors provides an analytic and generic approach to specifying the relevant synchronization procedures asking first what and then how. The article does not go into the details of the what but describes briefly how this data has been collected and analysed. In this reviewers opinion a more detailed description of the what and a description of the selected use-cases would provide a more comprehensible analysis and clear definition of the selected scenarios. Currently the authors provides abstract descriptions of the synchronization scenarios (see chapter 3.2: Consistency predicates). Because of this level of abstraction it is not completely clear who the end-users are, is it the system managers, system owners, research managers or the researchers (authors and owners of the ORCID profiles).
The proposed framework is focussing on the technical implementation of synchronization. But how will the users or researchers e.g. the ORCID profile owners influence the robustness of the synchronization job. One example is that ORCID profile owners may not want to have the same records (works/productions) in their PTCRIS profile as the want in the ORCID profile? The stakeholders of the PTCRIS could be the national government or the university of the researcher who wants all publications listed of a researcher (i.e. the numbers counts in assessments) whereas the researcher may only want a selected number of publications (e.g. the best) to be listed visible. This is briefly discussed in the conclusion. It could be suggested to separate discussion and future perspectives i.e. to a separate section.
The metadata quality must be a major challenge to this synchronization framework using ORCID. It is well-known that metadata about research outputs and funding data are often not consistent or of an acceptable quality. It is not clear from the article how the CRIS infrastructure is used in Portugal. If it is used for a research assessment i.e. bibliometric assessment this must be a major issue. It would be relevant to have discussed this issue a bit further in the paper. However since the paper is presenting a prototype framework – it is acceptable that it is not dealt with in details. I hope that this will be explored further in later writings from the authors – also when ORCIDs API version 2.0 will be in production and the metadata roundtrip also will be in production.
Annotations – there are some use of annotations like SYNCED (o, p) where o, p could have been described to help the reader.
The description and use of PTCRIS sometimes gives the impression of PTCRIS as a single entity or system – while as far as this reviewer understands it is an umbrella that encompasses several systems that together collects and exposes the collected research output, actors and activities of Portugal. It currently includes: DeGóis, RCAAP and SARI.
The authors do not discuss issues with researchers having more ORCIDs – is this not an issues for the proposed synchronization framework?

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

29 Sep 2015

Alcino Cunha, HASLab, INESC TEC and Universidade do Minho, Campus de Gualtar, Braga, 4710-057, Portugal

First of all, many thanks for your review and invaluable feedback.

Here are some comments on your remarks:

Indeed the paper focus more on the design methodology, leaving out many details that would be necessary to fully understand the synchronization framework. The detailed specifications of the proposed framework can now be found at http://ptcris.pt/hub-ptcris-en/. Two versions of the document are available: v0.3 concerning ORCID API 1.2 and v0.4 concerning ORCID API 2.0. The later version was improved and substantially simplified based on comments from the “end users” that are the target of these documents, namely the systems managers that will be responsible for the implementation of the synchronization procedures at their services. Unfortunately, we have still not yet written user manuals for the researchers, the real end users of the system. When we do so, they will be available at the same web site.
Maybe it is not clear in the paper, but the user can already select at each PTCRIS service which research outputs (namely publications) he wants to export to ORCID, and thus propagated to other services. Also, the import mechanism is notification based, so the user can choose not to import some of the research outputs. Also at ORCID, he can set the privacy level of the research outputs so that they are visible for importing by ORCID members, but not to the general public. As such, we believe the framework already provides the flexibility you propose.
Indeed, metadata quality is a major problem to be considered. We believe this problem will be less severe in the PTCRIS ecosystem, because some of the services (namely OA aggregators, such as RCAAP) will be mandated to export to ORCID (using the metadata roundtrip facility you refer) records with “good enough” metadata. Users will be recommended to give preference to such records in ORCID, and we also intend to have the import procedure biased to prefer metadata from such reliable sources.
You are right, and that will be improved in a future version. In this case, o stands for the user ORCID profile and p for the user profile at the PTCRIS service intending to synchronize with ORCID.
You are absolutely right, and we intend to incorporate your remark in a future version of the paper.
This should not be a problem, because due to the mandated use of ORCID iDs in the recent national research assessment exercises, most Portuguese researchers already have a single ORCID iD registered at the national science foundation, and are thus fully aware of what is their “official” ORCID iD that will be used in the PTCRIS ecosystem.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Jackson D: Software Abstractions: Logic, Language, and Analysis. MIT Press, revised edition, 2012. Reference Source

[2] 2. Czarnecki K, Foster J, Hu Z, et al.: Bidirectional transformations: A cross-discipline perspective. In ICMT’09 of LNCS. 2009; 5563. : 260–283. Publisher Full Text

[3] 3. Meertens L: Designing constraint maintainers for user interaction. In Third Workshop on Programmable Structured Documents. Tokyo University, 2005. Reference Source

[4] 4. Stevens P: Bidirectional model transformations in QVT: semantic issues and open questions. Software and System Modeling. 2010; 9(1): 7–20. Publisher Full Text

[5] 5. MOF 2.0 Query/View/Transformation specification (QVT). OMG, version 1.1. 2011. Reference Source

[6] 6. Macedo N, Cunha A: Implementing QVT-R bidirectional model transformations using Alloy. In FASE’13 of LNCS, Springer, 2013; 7793. : 297–311. Publisher Full Text

[7] 7. Macedo N, Guimarães T, Cunha A, et al.: Model repair and transformation with Echo. In ASE’13. IEEE, 2013; 694–697. Publisher Full Text

[8] 8. Cabot J, Clarisó R, Guerra E, et al.: Verification and validation of declarative model-to-model transformations through invariants. J Syst Softw. 2010; 83(2): 283–302. Publisher Full Text

[9] 9. Büttner F, Egea M, Cabot J, et al.: Verification of ATL transformations using transformation models and model finders. In ICFEM’12 of LNCS, Springer. 2012; 7635. : 198–213. Publisher Full Text

An ORCID based synchronization framework for a national CRIS ecosystem

Abstract

Keywords

1 Introduction

2 Architecture overview and rationale

Figure 1. PTCRIS synchronization architecture.

3 Specification of the synchronization framework

3.1 Data model

Figure 2. Overview of the ORCID data model.

Figure 3. Overview of a PTCRIS service data model.

3.2 Consistency predicates

3.3 Synchronization procedures

3.4 Usage scenarios

Figure 4. Initial PTCRIS profile.

Figure 5. Initial ORCID profile.

Figure 6. PTCRIS profile after user update.

Figure 7. ORCID profile after EXPORT execution.

Figure 8. PTCRIS profile after IMPORT execution.

4 Prototype implementation

Figure 9. User profile at ORCID.

Figure 10. User profile at DeGóis.

5 Related work

6 Conclusion

Author contributions

Competing interests

Grant information

Acknowledgements

References

Comments on this article Comments (2)

Open Peer Review

Comments on this article Comments (2)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated

Figure 8. PTCRIS profile after `IMPORT` execution.