Open laboratory notebooks: good for science, good for society, good for scientists

The fundamental goal of the growing open science movement is to increase the efficiency of the global scientific community and accelerate progress and discoveries for the common good. Central to this principle is the rapid disclosure of research outputs in open-access peer-reviewed journals and on pre-print servers. The next bold step in this direction is open laboratory notebooks, where research scientists share their research — including detailed protocols, negative and positive results — online and in near-real-time to synergize with their peers. Here, we highlight the benefits of open lab notebooks to science, society and scientists, and discuss the challenges that this nascent movement is facing. We also present the implementation and progress of our own initiative at openlabnotebooks.org, with more than 20 active contributors after one year of operation.


Amendments from Version 1 Introduction
The function of the scientific peer-reviewed system is to provide greater confidence that published research is scientifically sound. This system is widely accepted as the best available, although imperfect (as peer reviewers may miss technical flaws or be biased) 1 , to guide the global scientific community towards progress. Peer-reviewed publishing is also used by research scientists, funders and institutions as a mechanism to claim ownership of their discoveries. As a result, the community widely believes that findings should be kept secret until they are published in a peer-reviewed journal. This tradition of secrecy, which protects the scientist as opposed to the science, has been transmitted from mentor to trainee for centuries (Galileo kept his discoveries to himself until they were published). In the life sciences, this belief can reach nearmystical levels 2 , and can be compounded by constraints associated with patent protection procedures or the absence of clear mechanism to make one's data publicly available. The peer-review and publication process grew in an era where communication was largely in paper format. Today, in the age of instant communication, one would imagine there should be more efficient ways to operate.

Open lab notebooks: good for science and society
We believe that open laboratory notebooks, where research scientists record their work online and in near-real time, are an efficient way to disseminate data before it is published in peerreviewed journals, and has several advantages over the traditional "release after publication" system 3 . First, making the data accessible within weeks rather than keeping it hidden for years means that others will be able to build upon the research, and avoid spending time and resources on redundant experiments 4 . Second, open lab notebooks should include detailed protocols that can be reproduced, which is often not the case in peer-reviewed publications 5,6 . Third, negative data, which are almost never disclosed in the current publishing system but are provided in open lab notebooks, can sometime provide important insight 7,8 . Fourth, open lab notebooks offer a space for anyone to comment on experimental records. This allows experts to provide insight, but also to flag technically unsound experiments, thereby reducing the potential for flawed science to appear in peer-reviewed journals and in pre-print media. Open lab notebooks can therefore help save time, resources, and knowledge. If adopted by many, they should lead to a more synergistic way to do science and to more efficient use of public funds.

Good for scientists
Many believe that the chances of getting scooped before one publishes their work in a peer-reviewed journal increase when openly sharing their work online 9 . We argue that open lab notebooks have compensating advantages that are good for scientists. To succeed in academia, one must get funding, assert primacy over discoveries, be known in a field of research and be able to present work and ideas clearly and convincingly. Open lab notebooks can help in all aspects. Second, results in open lab notebook are date-stamped, thus claiming temporal priority of the data. Indeed, public repositories such as Zenodo 10 add a date-stamp to depositions, and assign a citable DOI to open lab notebook records (detailed below): once a record has been published, it can no longer be modified, but revised versions can be appended if necessary.
Third, early career scientists can use their open notebooks to connect with their peers and with experts in the field, start new collaborations and build their own network. Fourth, the use of open lab notebooks provides opportunity to present work clearly and concisely to both experts and non-experts. This is an important skill to master in order to write convincing grant applications. Fifth, junior scientists will also find their open lab notebook a good medium to showcase their technical skills and scientific insight, and may find it useful to add a link in their resume when applying for their next position. Finally, many will find a personal satisfaction in embracing open science and FAIR data principles 11 .

Implementation of an open lab notebook platform
Open lab notebooks have been pioneered and championed by a number of practitioners but remain a niche activity in the scientific community.  14,15 . Each post is composed of two documents. (1) A detailed and rigorous experimental record, including all data and protocols, which experts can evaluate, comment on or build upon ( Figure 1); (2) a blog, aimed at the non-specialist that explains in simple terms the motivation and rational for the experiment, summarizes results -positive and negative -and outlines next steps (Figure 2). The blogs, posted at openlabnotebooks.org, are managed by a webserver downloaded from wordpress.org (the open-source online system LabTrove  would be a valid alternative 16 ), archived weekly to GitHub (repository https://github.com/thesgc/static-openlabnotebooks), quarterly to archive.org (https://wayback.archive-it.org/6473/*/ https:/opennotebook.thesgc.org/), and link to the experimental records, which are deposited at Zenodo (zenodo.org), but can also be made available from other public repositories, such as GitHub (github.com) or Figshare (figshare.com). While the experimental details posted at Zenodo are important scientifically, the blog written in layman's term can be used to engage with scientists that may have a complementary set of expertise for future collaborations as well as other stakeholders in the research process, including patient groups, a dimension that most in academia are missing.
The Zenodo repository enables sharing research outputs from across all fields of research, creation and curation of complete digital repositories, flexible licensing with controlled degree of openness and safe storage of the data for the future in the same cloud infrastructure as CERN's own LHC research data. Open laboratory notebooks need to guarantee that the data will remain accessible, in order to avoid the fate suffered by the pioneer open notebook of Jean-Claude Bradley, which is still accessible while its associated raw data wiki is not. Zenodo is strongly committed to preserving the data it archives. CERN has existed since 1954 and has an experimental program defined for the next 20+ years. Each file has two replicas located on different disk servers. In the highly unlikely event that Zenodo closes operations, they guarantee migration of all content to other suitable repositories, and since all uploads have DOIs, citations and links to Zenodo resources (including data) will not be affected.
The ultimate goal of this open lab notebook initiative is not only to increase the impact of our work but also, along with precursors in the field such as Open Source Malaria (http://opensourcemalaria.org/) and other isolated open lab notebook efforts, to inspire others to follow, and contribute to the creation of a new open science movement in the life sciences. While it is too early to judge the success of this initiative, the number of contributing scientists and institutions is steadily increasing (Figure 3). While only one scientist was contributing in November 2017, 23 scientists from six institutions (University of Toronto, University of Oxford, University of North Carolina, University of Leicester, the Karolinska Institute in Sweden and University of Montpellier in France) are recording their work at openlabnotebooks.org as of December 2018.
As importantly, impact is also increasing, judging by the average number of views per experimental record calculated from statistical data available at Zenodo.org (Figure 3). Some reports raised a considerable interest. For instance, the crystal structure of USP5 in complex with small molecule fragments has 821 unique views and 324 unique downloads as of December 2018 17 . If the initiative is successful, we anticipate that within three to five years, usage metrics are comparable at openlabnotebooks.org and bioRxiv, the preprint server for biology.
Data posted at openlabnotebooks.org are raising interest in academic groups, but also in the industry. For instance, a notebook contributor was directly contacted by a big pharmaceutical company to further discuss the results that he had shared online, and a big biotech company asked permission to another contributor to include their data in a presentation at a public scientific meeting. Some of the research reported at openlabnotebooks.org is of direct relevance to patient groups. For instance, four scientists record their results on testing chemical inhibitors of the kinase ALK2, a potential therapeutic target for the treatment of the pediatric brain tumor diffuse intrinsic pontine glioma (DIPG), and the heterotopic ossification disorder fibrodysplasia ossificans progressive (FOP) 18,19 . The compounds, developed by the open science biotech company M4KPharma, are still in pre-clinical phase of development but should ultimately lead to clinical trials for these incurable diseases 20 . Scientists working on projects with a clear path to the clinic are eager to share their enthusiasm and commitment with patient groups (sometimes using social media to announce their latest open notebook post) who, in turn, follow their work.

The challenges of open lab notebooks
Three antagonizing points that inhibit scientists from starting their own open lab notebook are the fear of being scooped, the inability to report collaborative work when collaborators want to keep data secret, and the concern that an open notebook will take time away from an already overburdened schedule 21 . The language barrier for non-native English speakers, and the availability of open lab notebook solutions can also be challenging. It is indeed likely that maintaining an open lab notebook increases the chances of being scooped, but it is too early at this point to know whether this effect is minor or significant. Paradoxically, and given the territorial nature of the current frameworks for funding and managing scientific research, entries in one's open lab notebook may mark one's area very effectively, especially in a conceivable future when funding trusts and councils start looking into them. We would argue that most, if not all, scientists get scooped during their career, and that open lab notebooks serve as a safety net for early career scientists who have a citable record of their work if they ever get scooped. Obtaining permission from collaborators to report collaborative work in open lab notebooks can be challenging. We believe that the best way to avoid such a situation is to clearly state at the outset of a collaboration the intention to adopt open science principles 22 . Scientists are more likely to agree if presented with the idea well in advance. The time invested in practicing clear, concise and engaging scientific writing is not lost on one's career. After some practice, maintaining an open lab notebook should not take more time than using a regular lab notebook.
Open notebooks being published before peer-review, there is a risk that dubious experiments, erroneous analysis or misinterpretations find their way on open platforms, get amplified over the internet and mislead colleague scientists, patient groups or other communities. Once they become indexed by popular search engines, open lab notebooks could become a source of pollution of the scientific (and non-scientific) literature. This risk, which is not limited to open notebooks but extends to the increasing number of Journals that adopt a post-publication peer-review mechanism, is real, serious, and should be monitored. We believe that the best way to mitigate this risk is for open notebooks to provide a platform for open comments. In principle, this could be an even stronger quality control than the current peer-review system in place in most scientific journals, as the number of "open reviewers" for any given report is limitless. At the moment, we find that very few comments are posted at openlabnotebooks.org, a platform that is only a year old, but we see that comments are mainstream, and sometimes turn into healthy discussions at Open Source Malaria, a pioneer in the field 23 .

Future directions and conclusion
Open lab notebooks represent a major departure from current practices in science (especially biomedical sciences) and hold a mix of promises and risks. As the community producing these lab notebooks is increasing, there is an opportunity to move beyond ideology and anecdotal data to evidence-based policy design. In the spirit of openness, we call on colleagues from both the life science and the social sciences communities to conduct systematic evaluation of the benefits and downsides of open lab notebooks. It will be important to compare several parameters on a yearly basis. These may include the frequency of research being scooped among scientists disclosing their work in open lab notebooks versus a less open reference group; the frequency of new collaborations; the frequency of comments and ideas received by the authors of open notebooks; and instances where open lab notebooks were essential for compliance with funder or institutional requirements. More difficult to assess will be issues such as recognition, career progression, speeding up research, and impact on reproducibility, but they could all be addressed with appropriate questionnaires and data analytics.
Our goal is to see the number of open lab notebooks increase exponentially over the coming years. Future implementation of novel features, such as the ability to search for experiments containing compounds with specific chemical templates, is expected to extend the reach of the platform to medicinal and computational chemists. Indexing of open lab notebooks by popular search engines such as Google Scholar (which already indexes pre-prints and other non-peerreviewed documents) would increase the visibility and impact of open notebooks. Importantly, open lab notebook data deposited at Zenodo.org is already searchable with Google's Dataset search engine. To further encourage scientists to break free from the tradition of secrecy that has been passed on for generations, a cultural change needs to be supported at institutional and governmental levels. Funding bodies are starting to define and enforce open science publication practices 24 . Similarly, universities could take a more proactive role, for instance by including adhesion to open-access principles as an evaluation criteria for career advancement 25 . Indeed, while strong incentives described above already exist for junior scientists to start their own open lab notebook, the benefit to their PIs who already have established a professional network and don't need to showcase their skills is not always as clear. As long as scientists are not convinced that open science is good for them, Science 2.0 will have to wait.

Data availability
No data are associated with this article. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Jean-Baptiste Poline
McGill University, McGill University, Montreal, QC, Canada The article by Matthieu Schapira and Rachel Harding "Open laboratory notebooks: good for science, good for society, good for scientists" is a clear presentation of the benefits of open notebook for the scientific community. The central argument is that one of the main benefits is an increased efficiency (both in time and cost) of the scientific process through the faster re-utilization of positive or negative results. The authors discuss in detail the challenges of the initiative and the results of the current openlabnotebook.org implementation and results.
I liked in particular the straightforward discussion on the current incentive system from the authors. The perceived or real danger of being scooped is a clear obstacle to the efficient use of these. The arguments of why opennotebooks are good for scientists are clear and compelling although based mostly on anecdotal data.
The implementation of the infrastructure is solid, reusing standard and long lasting components (zenodo, github). The search capacity across notebooks could have been detailed more. An component around the standardization of terms, use of controlled vocabulary (or even ontologies) could be added in the future. For instance, an extension of the figshare data documentation system would be helpful to label notebook appropriately.
The uptake in both the building and the use of openlabnotebooks is promising, although absolute number of adaption are still limited since this is still at early stages.
The challenges for wider adoption are clear -(namely, fear of being scooped, collaborators with non open practices, and fear that the time invested would not be worth (in terms of recognition), risk of non solid information being released. The solutions to mitigate these challenges are not as well developed, but as this is a new technology and practice, only the future will say if these are indeed going to prevent the openlabnotebooks developments. I think that the close involvement of both national and private funding agencies could rapidly solve for these adoption challenges in a top-down manner.
Last, I have one question: what is the process to join the openlabnotebooks consortium?

Are all factual statements correct and adequately supported by citations? Yes
Are arguments sufficiently supported by evidence from the published literature? Yes

Are the conclusions drawn balanced and justified on the basis of the presented arguments? Yes
No competing interests were disclosed. Competing Interests: Reviewer Expertise: Brain Imaging data analysis, reproducible science, Open science, Neuro-informatics, Imaging genetics, neuro-data-science, publishing and cutture of science I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. Secondly, it would be enormously helpful to understand how the authors anticipate that open labnotebooks would operate in the context of the current scientific communication framework. Do you expect labnotebooks to replace peer-review journals or to complement them? How might a researcher convey the overarching hypothesis that they are looking to address through the set of experiments run across a series of notebook entries? It seems that neither the labnotebook approach nor the existing publication approach are sufficient to fully convey scientific output by themselves. I would very much enjoy understanding the authors' perspective on this.
Along these same lines, I am curious as to how notebook readers will incorporate this form of communication into their workflow. It will have to be useful -and easy -in order to gain the desired level of adoption. I appreciated the inclusion of the notebook use statistics -it would be helpful to understand how users found these posts, what kind of users were attracted, and how they responded to this medium as a form of scientific discourse.
The role of the gatekeeper was also not addressed. Open LabNotebooks are reliant on a series of The role of the gatekeeper was also not addressed. Open LabNotebooks are reliant on a series of technical platforms. These platforms will have an impact on the overall success based on their sustainability models, their archiving approaches, and their need for financial viability. Is there a cost to using the system -and, if not, how is it funded? Are the content fully owned by the authors and do they remain under full control of the authors? What are the responsibilities of the platforms and how might their choices impact the approach?
Finally, the authors have a tremendous opportunity to ground some of this theoretical work in their own practical application. It would be useful to understand how your scientists' experiences compared to those were anticipated. For example, one frequent concern heard by researchers is one of time -they are worried that use of the system will require an additional burden in an already over-committed schedule. However, the open lab notebook community remains small, the practice is not consistently defined or implemented and the impact of these efforts in the field have not been systematically evaluated."

Secondly, it would be enormously helpful to understand how the authors anticipate that open labnotebooks would operate in the context of the current scientific communication framework. Do you expect labnotebooks to replace peer-review journals or to complement them? How might a researcher convey the overarching hypothesis that they are looking to address through the set of experiments run across a series of notebook entries? It seems that neither the labnotebook approach nor the existing publication approach are sufficient to fully convey scientific output by themselves. I would very much enjoy understanding the authors' perspective on this.
We see open lab notebooks as a mechanism for scientists to share their work before they publish it in peer-reviewed journals. As such, they are complementary to peer-reviewed publication. This is specified in the article as follows: "We believe that open laboratory notebooks, where research scientists record their work online and in near-real time, are an efficient way to disseminate data before it is published in peer-reviewed journals, and …"

Along these same lines, I am curious as to how notebook readers will incorporate this form of communication into their workflow. It will have to be useful -and easy -in order to gain the desired level of adoption. I appreciated the inclusion of the notebook use statistics -it would be helpful to understand how users found these posts, what kind of users were attracted, and how they responded to this medium as a form of scientific discourse.
We agree that much is to be gained from knowing the demographics and motivations of open lab notebook readers. The readers input in the well populated comment section of the platform Open Source Malaria strongly suggest that most readers are scientists in the field, sharing common goals and interests.

The role of the gatekeeper was also not addressed. Open LabNotebooks are reliant on a series of technical platforms. These platforms will have an impact on the overall success based on their sustainability models, their archiving approaches, and their need for financial viability. Is there a cost to using the system -and, if not, how is it funded? Are the content fully owned by the authors and do they remain under full control of the authors? What are the responsibilities of the platforms and how might their choices impact the approach?
The server hosting openlabnotebooks.org is maintained by the Structural Genomics Consortium at University of Oxford. It relies on Wordpress, which is an open source platform, and requires limited maintenance. Consequently, the service is free for both contributors and readers. Zenodo is hosted by the CERN and is also free. The issue of permanence, sustainability, archiving and ownership are important ones and are addressed in our response to Reviewer 1 (sections #6 and ownership are important ones and are addressed in our response to Reviewer 1 (sections #6 and #17) and discussed in the revised version of the manuscript.
Finally, the authors have a tremendous opportunity to ground some of this theoretical work in their own practical application. It would be useful to understand how your scientists' experiences compared to those were anticipated. For example, one frequent concern heard by researchers is one of time -they are worried that use of the system will require an additional burden in an already over-committed schedule. How much time do the current users spend on their posts? Did this reduce over time? This same approach can be used to address the anticipate benefits.
When initiating this project, prospective open notebookers were surveyed regarding their concerns, questions and attitudes about running their own open notebook projects which highlighted many of the concerns you raise, in particular, the time burden. We are planning to conduct a follow-up survey once the number of scientists joining our platform increases. not as criticism but as testament to our shared enthusiasm for this subject and its importance in the future of research.

1)
. In the introduction, reasons are suggested for why scientists may keep results secret. We Secrecy would suggest that there are two important reasons that are not explicitly mentioned: i) that the scientist may want to patent something, and ii) that the scientist cannot be bothered to work out how to release research using atypical means. The first point is alluded to where mention is made of ownership, and the second point is alluded to by the mention of "paper" but we would argue these two factors are significant enough that they should be made explicit.

2)
. We'd be interested in whether there is a justification for the statement "Many believe that Careers openly sharing work online will limit career opportunities." If there is none, then perhaps rephrase this more as a possibility? 3) . The statement "Grant applications that highlight the use of open lab notebooks are being Grants viewed positively" may be true (one hopes it is), but the evidence presented doesn't support that statement (the grants may have been funded because the science was so good, regardless of the dissemination plan), so again, this probably needs to be made more aspirational.

4)
. The authors need to address what happens if an experiment is recorded containing a mistake, Errors and a mistake that might propagate an ungrounded conclusion. Is there a danger in leading via colleagues down the wrong path? Is there a danger to the reputation of young scientists? Would incorrect conclusions, if indexed by search engines, lead to literature pollution that might be hard to correct?

5)
. One must make the assumption that few people will ever open lab notebooks, just as Machines read few people read regular lab notebooks. More needs to be made of this, since a) it is essential, rather than just desirable, that the notebook is searched and indexed by e.g. Google, and b) the entries need, ideally, to be machine readable. Could the authors comment on this -i.e. how Googleable the contents are (beyond the Dataset Search they mention) and what can be done to ensure that the entries can be understood by machines?

6)
. There are many web links in the article. This is an academic publication, which is Permanence intended to last forever. It's likely those URLs will not last forever. Is there a way that the pages pointed to can be archived somehow? An important, relevant example: this manuscript does not refer to one of the pioneers of open lab notebooks -indeed he coined the phrase "open notebook science" -Jean-Claude Bradley. Bradley's blog can still be but the wiki that used to house all the raw data has . In accessed gone publishing our most recent paper that made use of open lab notebooks , we took pains to archive the lab notebooks on a repository to mitigate potential loss by external providers, and to back up web pages (as PDFs) to similar places. Can the authors address the two issues here: i) should we link to web pages in academic articles without backing them up? 2) How are the lab notebooks backed up and archived? 7) . The authors mention "results in open lab notebook are date-stamped." It would be Time Stamps interesting to know whether the authors have looked into whether the date stamps are "convincing" from a legal standpoint. Would they, for example be sufficient to claim priority in a legal challenge? Is there an issue with the fact that ELN pages can be edited, over an extended period of time, or is it sufficient to ensure that the page has a robust revision history?

8)
. A scooper could claim that they were simply unaware of the open scientist's work. This Scooping seems a reasonable defense. What needs to happen for ignorance of an open lab notebook not to be a defense against ignorance of prior art? 1 defense against ignorance of prior art? 9) . We are cited twice (thank you), but there are two other relevant papers the authors might be Citations interested in citing should these be deemed by them to be appropriate: i) an extensive discussion on the use cases of an open source ELN called Labtrove pioneered by Jeremy Frey's team at Southampton, and ii) a large medchem project by Open Source Malaria which was conducted entirely using open lab notebooks and which may be instructive in terms of how to publish a paper based on open notebook work (also described informally . here 10) . . The language barrier is mentioned, but we're not sure this is relevant to this paper.

Language
Language is already a barrier in the current system, though English is essentially a across lingua franca science.

13)
. What is the license that covers the authors' i) blog posts and ii) notebooks? To what extent Licence is this an important choice? For example, can others take and re-use the content without restriction, or only for non-commercial purposes?

14)
. The authors mention that the ELN entries include "all data". This is an important feature of Raw Data open ELNs that distinguishes the practice from a great deal of open science in which highlights might be discussed, or blogged about, but without the attendant raw data. We investigated briefly to see how much of the raw data could be accessed in the authors' cases. We have noticed that a fair number of the lab notebook entries recorded by the team in Zenodo contains PDF/Word-style summaries of data, rather than the data themselves. This limits re-use of the data. Could the authors comment on the central importance of the availability of all raw data?

Examples:
This blog post mentions performing synthesis of analogues based on lit procedures but there's no link to an ELN. Same with page. this And . this This page has a link (in the 2nd paragraph) to a (structure?) data file but no backlink from the data back to the blog page, risking orphaning of content? This blog page does have a link to the ELN. There is more experimental detail, but there is no raw characterization data. Also the to the blog page is broken. backlink Many biology entries contain pictures of purifications and for the most part, the ELN pages only have a docx or pdf there. Should there be any raw data available for these kinds of pages, or are docx and PDF enough?

15)
. It is mentioned that is is desirable to have "the ability to search for experiments containing Identifiers compounds with specific chemical templates". We agree. Some ELNs already allow this (e.g. Chemotion and C6H6 ). We have found a stop-gap is the manual inclusion of chemical strings such as SMILES, InChI. Can the authors comment on whether similar things can be included in order to allow machines to understand the biology contained within the entries, e.g. UniProt numbers? . Is there a way of distilling out access by non-team people, or even access by the Metrics Access re author of the page who may have accessed multiple times during the editing of a page?

17)
. When a researcher moves on, does that researcher have any responsibility towards the Ownership ELN? e.g. if a mistake is found, who needs to correct it? Is it the responsibility of the PI to act as future curator?

1) . In the introduction, reasons are suggested for why scientists may keep results secret. Secrecy
We would suggest that there are two important reasons that are not explicitly mentioned: i) that the scientist may want to patent something, and ii) that the scientist cannot be bothered to work out how to release research using atypical means. The first point is alluded to where mention is made of ownership, and the second point is alluded to by the mention of "paper" but we would argue these two factors are significant enough that they should be made explicit.
Points well taken. The following statement was added to the Introduction "…and can be compounded by constraints associated with patent protection procedures or the absence of clear mechanism to make one's data publicly available."

2)
. We'd be interested in whether there is a justification for the statement "Many believe Careers that openly sharing work online will limit career opportunities." If there is none, then perhaps rephrase this more as a possibility?
This was not clear. The sentence was replaced as follows: "Many believe that the chances of getting scooped before one publishes their work in a peer-reviewed journal increase when openly sharing their work online [9]"

3)
. The statement "Grant applications that highlight the use of open lab notebooks are Grants being viewed positively" may be true (one hopes it is), but the evidence presented doesn't support that statement (the grants may have been funded because the science was so good, regardless of the dissemination plan), so again, this probably needs to be made more aspirational.
This was revised as follows: "Our personal observations seem to indicate that grant applications highlighting the use of open lab notebooks are being viewed positively."

4) Errors. The authors need to address what happens if an experiment is recorded containing a mistake, and a mistake that might propagate via an ungrounded conclusion. Is there a danger in leading colleagues down the wrong path? Is there a danger to the reputation of young scientists? Would incorrect conclusions, if indexed by search engines, lead to literature pollution that might be hard to correct?
We agree that this important point was missing. The following was added to the section "The challenges of open lab notebooks": Open notebooks being published before peer-review, there is a risk that dubious experiments, erroneous analysis or misinterpretations find their way on open platforms, get amplified over the internet and mislead colleague scientists, patient groups or other communities. Once they become indexed by popular search engines, open lab notebooks could become a source of pollution of the scientific (and non-scientific) literature. This risk, which is not limited to open notebooks but extends to the increasing number of Journals that adopt a post-publication peer-review mechanism, is real and serious. We believe that the best way to mitigate this risk is for open mechanism, is real and serious. We believe that the best way to mitigate this risk is for open notebooks to provide a platform for open comments. In principle, this could be an even stronger quality control than the current peer-review system in place in most scientific journals, as the number of "open reviewers" for any given report is limitless. At the moment, we find that very few comments are posted at openlabnotebooks.org, a platform that is only a year old, but we see that comments are mainstream, and sometimes turn into healthy discussions at Open Source Malaria, a pioneer in the field.

5)
. One must make the assumption that few people will ever read open lab notebooks, Machines just as few people read regular lab notebooks. More needs to be made of this, since a) it is essential, rather than just desirable, that the notebook is searched and indexed by e.g. Google, and b) the entries need, ideally, to be machine readable. Could the authors comment on this -i.e. how Googleable the contents are (beyond the Dataset Search they mention) and what can be done to ensure that the entries can be understood by machines?
We believe that in the life sciences, most would find the constraint of formatting their notebook in a standardized language an unacceptable burden. Open lab notebooks should be indexable by Google or other search engines as free text records. Chemists would benefit from the integration of cheminformatics functionality, such as fingerprint-based substructure or similarity searches. These tools are openly provided by RDKit, and we expect that they will soon be integrated into open lab notebook platforms.

6)
. There are many web links in the article. This is an academic publication, which is Permanence intended to last forever. It's likely those URLs will not last forever. Is  We appreciate this concern. We have multiple layers of backup for the openlabnotebooks.org web-site including snapshots, replication, and off-site tape storage. We also export all blog posts automatically at the end of each week to the following GitHub repository . The Information Technology Services at https://github.com/thesgc/static-openlabnotebooks University of Toronto is now also maintaining an archive of the website at , which is updated quarterly. https://wayback.archive-it.org/6473/*/https:/opennotebook.thesgc.org/ The data posted on Zenodo is also backed-up, as specified at . Archiving each website that one cites is an impressive https://about.zenodo.org/infrastructure/ commitment to the permanence of one's work. We assume that consent should first be obtained to archive someone else's research output. We are not planning to follow this path at the moment. The following statement was added to the article: "The blogs, posted at openlabnotebooks.org, are managed by a webserver downloaded from wordpress.org, archived weekly to GitHub (repository ), quarterly to archive.org https://github.com/thesgc/static-openlabnotebooks (https://wayback.archive-it.org/6473/*/https:/opennotebook.thesgc.org/), and link…" 1 (https://wayback.archive-it.org/6473/*/https:/opennotebook.thesgc.org/), and link…" We also added the following section: "Open laboratory notebooks need to guarantee that the data will remain accessible, in order to avoid the fate suffered by the pioneer open notebook of Jean-Claude Bradley, which is still accessible while its associated raw data wiki is not. Zenodo is strongly committed to preserving the data it archives. CERN has existed since 1954 and has an experimental program defined for the next 20+ years. Each file copy has two replicas located on different disk servers. In the highly unlikely event that Zenodo closes operations, they guarantee migration of all content to other suitable repositories, and since all uploads have DOIs, citations and links to Zenodo resources (including data) will not be affected."

7)
. We believe that whether a particular electronic date stamp is sufficient to prove the date of disclosure would depend on how well the system was configured and how good the records were logged. Website businesses do depend on date stamping to establish when a particular change to their terms & conditions was made, for example. Our understanding is that the system implemented at Zenodo clearly indicates when a record was posted and when it was edited, which should be sufficient evidence for open notebooks to count as prior art.
To clarify the date-stamp/versioning mechanism in place at Zenodo, we added the statement "once a record has been published, it can no longer be modified, but revised versions can be appended if necessary"

8)
. A scooper could claim that they were simply unaware of the open scientist's work. Scooping This seems a reasonable defense. What needs to happen for ignorance of an open lab notebook not to be a defense against ignorance of prior art?
We believe that scientists learn to live with the potential and reality of being scooped. Plagiarism is another problem, but we believe that it is more an academic conduct issue than a legal issue. If materials & methods and results published in a date-stamped open notebook are later authored by another scientist in a peer-reviewed journal, this should in principle raise reputational problem for that person.

9)
. We are cited twice (thank you), but there are two other relevant papers the authors Citations might be interested in citing should these be deemed by them to be appropriate: i) an extensive discussion on the use cases of an open source ELN called Labtrove pioneered by Jeremy Frey's team at Southampton, and ii) a large medchem project by Open Source Malaria which was conducted entirely using open lab notebooks and which may be instructive in terms of how to publish a paper based on open notebook work (also described informally . here 2 1 We added a citation to the LabTrove paper in the section "Implementation of an open lab notebook platform", and to the Open Source Malaria paper just above the "Future directions and conclusion" section.

10)
. Yes. Examples include researchers from a pharmaceutical company that contacted a scientist to further discuss his results, a big biotech company that kindly asked permission to include data from an open notebook at a conference talk, and academics interested in specific chemical templates included in an open report. . The language barrier is mentioned, but we're not sure this is relevant to this paper. Language Language is already a barrier in the current system, though English is essentially a lingua franca across science.
We agree, but we have examples of scientists who do not feel confident enough in English to use our platform.

13)
. What is the license that covers the authors' i) blog posts and ii) notebooks? To what Licence extent is this an important choice? For example, can others take and re-use the content without restriction, or only for non-commercial purposes?
We currently license all of our open notebook blog posts and Zenodo deposits using a Creative Commons Attribution license. This allows our work to be shared and reused as widely as possible whilst asking for attribution of the original work only. More complex licenses may put off readers of the notebooks from reusing the content, which is in opposition of the open lab notebook ethos.

14)
. The authors mention that the ELN entries include "all data". This is an important Raw Data feature of open ELNs that distinguishes the practice from a great deal of open science in which highlights might be discussed, or blogged about, but without the attendant raw data. We investigated briefly to see how much of the raw data could be accessed in the authors' cases. We have noticed that a fair number of the lab notebook entries recorded by the team in Zenodo contains PDF/Word-style summaries of data, rather than the data themselves. This limits re-use of The benefits of publishing with F1000Research:

Comments on this article
Your article is published within days, with no editorial bias You can publish traditional articles, null/negative results, case reports, data notes and more The peer review process is transparent and collaborative Your article is indexed in PubMed after passing peer review Dedicated customer support at every stage For pre-submission enquiries, contact research@f1000.com