<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="data-paper" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.8414.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Data Note</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Public Engagement</subject>
                    </subj-group>
                    <subj-group>
                        <subject>Publishing &amp; Peer Review</subject>
                    </subj-group>
                    <subj-group>
                        <subject>Science &amp; Medical Education</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Innovations in scholarly communication - global survey on research tool usage</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Kramer</surname>
                        <given-names>Bianca</given-names>
                    </name>
                    <uri content-type="orcid">https://orcid.org/0000-0002-5965-6560</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Bosman</surname>
                        <given-names>Jeroen</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Utrecht University Library, Utrecht, The Netherlands</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:b.m.r.kramer@uu.nl">b.m.r.kramer@uu.nl</email>
                </corresp>
                <fn fn-type="con">
                    <p>BK and JB equally contributed to setting up, carrying out and reporting on the survey.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>During the runtime of the survey Jeroen Bosman accepted an invitation from the RIO Journal to become a subject editor. Bianca Kramer and Jeroen Bosman are both members of the steering committee of the Force11 Scholarly Communication Working Group. F1000Research was one of the partners that distributed the survey using a custom-URL.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>4</month>
                <year>2016</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2016</year>
            </pub-date>
            <volume>5</volume>
            <elocation-id>692</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>1</day>
                    <month>4</month>
                    <year>2016</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Kramer B and Bosman J</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/5-692/pdf"/>
            <abstract>
                <p>Many new websites and online tools have come into existence to support scholarly communication in all phases of the research workflow. To what extent researchers are using these and more traditional tools has been largely unknown. This 2015&#x2013;2016 survey aimed to fill that gap. Its results may help decision making by stakeholders supporting researchers and may also help researchers wishing to reflect on their own online workflows. In addition, information on tools usage can inform studies of changing research workflows.</p>
                <p>The online survey employed an open, non-probability sample. A largely self-selected group of 20663 researchers, librarians, editors, publishers and other groups involved in research took the survey, which was available in seven languages. The survey was open from May 10, 2015 to February 10, 2016. It captured information on tool usage for 17 research activities, stance towards open access and open science, and expectations of the most important development in scholarly communication. Respondents&#x2019; demographics included research roles, country of affiliation, research discipline and year of first publication.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>scholarly communication</kwd>
                <kwd>research workflow</kwd>
                <kwd>survey</kwd>
                <kwd>innovation</kwd>
                <kwd>tools</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The survey was supported by a &#x20ac;600 grant from the VOGIN-fonds for subscription to pro-versions of web tools used to distribute the survey and support the flow of data. Utrecht University Library provided the resources to have the survey and parts of the survey website translated into six languages and part of the foreign language answers translated back into English.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Many websites and tools exist to support researchers in handling information in all phases of the research cycle. For the first time a multidisciplinary and multilingual survey, carried out in 2015&#x2013;2016, details the usage of such tools. Insights from these data may help researchers and those that support them in their decisions to improve the efficiency, openness and reliability of research workflows. Anonymized data from the survey is available in both raw (multilingual) and cleaned (all-English) versions (Data availability; 
                <xref ref-type="bibr" rid="ref-1">1</xref>). Details on data collection and full description of the data is provided in this Data Note. </p>
        </sec>
        <sec>
            <title>Setup of the survey</title>
            <p>The survey includes four questions on demographics, 17 on tool usage (with pre-selected answer options and free-text answer), two on support of Open Access and Open Science (yes/no/don&#x2019;t know), one open question on the expected most important development in scholarly communication (free-text answer), one (optional) question asking for an email address and one question asking whether participants would be willing to be contacted for follow-up research. See the 
                <xref ref-type="other" rid="SM1">Supplementary material</xref> for the full list of survey questions in all languages.</p>
            <p>Questions on demographics asked about country of current or last affiliation, research discipline, research role and career stage. Country of affiliation and research discipline were included because there is indication of strong variation in tool usage and publication cultures across these parameters. Our classification of research discipline (seven categories) was based on the broad classification from Scopus, with some modifications:</p>
            <p>
				
                <list list-type="bullet">
                    <list-item>
                        <p>Physical sciences (which in Scopus includes mathematics) - from which we made Engineering &amp; Technology (including computer science) into a separate category</p>
                    </list-item>
                    <list-item>
                        <p>Life sciences</p>
                    </list-item>
                    <list-item>
                        <p>Health sciences - which we renamed Medicine</p>
                    </list-item>
                    <list-item>
                        <p>Social sciences - from which we made Arts &amp; Humanities and Law into separate categories.</p>
                    </list-item>
                </list>
			</p>
            <p>Research role (which included various academic roles, but also supporting roles such as publisher, librarian and funder) and career stage (proxied by using the year of first publication in six date ranges) were included to allow testing hypotheses on e.g. the innovation of workflows being dependent on the degree to which people are conditioned by traditions in research practices. In addition, data on demographics can serve to assess and correct for bias.</p>
            <p>The bulk of the survey consisted of questions on tool usage for 17 activities in the research workflow (see 
                <xref ref-type="other" rid="SM1">Supplementary material</xref> and 
                <xref ref-type="table" rid="T4">Table 4</xref>). These activities were selected from our database of research tools [
                <ext-link ext-link-type="uri" xlink:href="http://bit.ly/innoscholcomm-list">http://bit.ly/innoscholcomm-list</ext-link>], that distinguishes 30 research activities in seven phases of the research workflow and lists over 600 tools for these activities. The activities included in the survey were chosen for their overall importance (for example we included a question on writing tools but not on translation tools) and for their spread across the research workflow, covering discovery, analysis and writing as well as publication, outreach and assessment. For each of the 17 activities, the survey offered seven tools as preset answers and an eighth answer option to indicate use of any other tools (
                <xref ref-type="fig" rid="f1">Figure 1</xref>), followed by a question to specify those. The seven preset tools were chosen from the database of tools mentioned. In most cases we included 4&#x2013;5 of the most well-known tools but also included 2&#x2013;3 newer and smaller and in some cases even still experimental tools to stimulate respondents to also mention any less well-known tools they might use. Only in exceptional cases tools were offered as preset answer options in more than one question. Participants could skip any question (except demographic questions on research role, country of affiliation and research discipline) they felt did not apply to them, or were otherwise not willing to answer. Finally, people with a role supporting research were explicitly asked to base their answers to the questions on tools on what they would advise researchers to use.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>Examples of survey questions with preset answer options.</title>
                    <p>
                        <bold>A</bold>) Question on sharing notebooks/protocols/workflows. 
                        <bold>B</bold>) Question on measuring impact.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9058/e3b83ecc-7c2a-4f71-8198-8fb3c57fb1fd_figure1.gif"/>
            </fig>
            <p>All questions were entered into the cloud-based survey form software Typeform (
                <ext-link ext-link-type="uri" xlink:href="http://www.typeform.com">http://www.typeform.com</ext-link>). Typeform allows for ample use of graphics. These were used for all preset answers to tool usage questions. For these we used existing logos of tools and some self-made text logos. This made it very easy for respondents to recognize tools they used and enter most of their answers by simply clicking images.</p>
        </sec>
        <sec>
            <title>Distribution of the survey; sampling</title>
            <p>The survey was live on the Typeform website for a 9-month period between May 10, 2015 and February 10, 2016. Responses submitted were stored by Typeform; a backup in csv format was made at regular intervals and stored on a university server.</p>
            <p>The sample used was a fully open, self-selected, non-probability sample, meaning that the survey was open for anyone to take, with no systematic control on who took it. We used a hybrid of sampling methods, including snowball sampling and quota sampling. Distribution was targeted to researchers and people supporting research, both through direct and indirect distribution. Direct distribution included messages with the link to the survey on Twitter (e.g. in answer to people mentioning their paper/abstract/poster/manuscript got accepted), mailing lists, our own survey website, blog posts, including 
                <ext-link ext-link-type="uri" xlink:href="http://blogs.lse.ac.uk/impactofsocialsciences/2015/11/11/101-innovations-in-scholarly-communication/">one on the widely read LSE Impact blog</ext-link>, a podcast 
                <ext-link ext-link-type="uri" xlink:href="http://scholarlykitchen.sspnet.org/2015/07/22/scholarly-kitchen-podcast-101-innovations-and-scientific-workflow/">interview on the Scholarly Kitchen website</ext-link> and during meetings the authors attended. Indirect distribution included that by 108 partners who distributed the survey among their constituency (either through a direct email message, inclusion in a newsletter or a message on the organisation&#x2019;s website or intranet), in exchange for the anonymized data from that population. Of these, 
                <ext-link ext-link-type="uri" xlink:href="https://web.archive.org/web/20160320225405/https:/101innovations.wordpress.com/custom-url-option/">65 organizations</ext-link> agreed to have their role disclosed. The 108 partners consisted of 76 universities (often through their libraries), 10 hospitals, 11 publishers and 11 other organizations. Some of these organizations also distributed our translations of the survey (see below). In addition, many individuals and organizations publicized the survey through various channels, e.g. through Twitter and other social media, in blogs and by inclusion in conference presentations. We did not specifically target students and know that many partners also did not do so.</p>
            <p>We offered respondents no financial incentives or presents to stimulate take up. However, all respondents were offered the option to receive automatic feedback (
                <xref ref-type="fig" rid="f2">Figure 2</xref>) on how their choices of tools compared to those of their peer group (based on research roles entered). For this we used a dataflow from Typeform via Google Drive (
                <ext-link ext-link-type="uri" xlink:href="http://drive.google.com">http://drive.google.com</ext-link>, for calculations and creating the graphs) to WordPress (
                <ext-link ext-link-type="uri" xlink:href="http://www.wordpress.com">http://www.wordpress.com</ext-link> to publish the graphs). To transfer data between these tools we used Zapier (
                <ext-link ext-link-type="uri" xlink:href="http://www.zapier.com">http://www.zapier.com</ext-link>).</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <title>Example of automatic feedback received by survey participants.</title>
                    <p>Classification: 
                        <bold>Traditional tools (Trad)</bold> - Add no functionality compared to print era, except online accessibility; 
                        <bold>Modern tools (Mod)</bold> - Use scale and linking possibilities of the internet to increase speed and efficiency; 
                        <bold>Innovative tools (Inn)</bold> - Actually change &#x2018;the way it&#x2019;s always been done&#x2019; &#x2013; e.g. user-driven, different business models, changes in the sequence of research activities, shifting stakeholder roles; 
                        <bold>Experimental tools (Exp)</bold> - Represent radical change, with sometimes uncertain technologies and outcomes; still under development. Tools were scored on a scale of 1 (traditional) to 4 (experimental); the chart shows average scores per workflow phase. Tools mentioned as &#x2018;others&#x2019; are not included at this stage.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9058/e3b83ecc-7c2a-4f71-8198-8fb3c57fb1fd_figure2.gif"/>
            </fig>
        </sec>
        <sec>
            <title>Translation of the survey</title>
            <p>To address cultural and language bias and simply to increase uptake in non-English language areas we had the survey translated into six world languages: Spanish, French, Chinese, Russian, Japanese and Arabic. These languages were selected based on observed 
                <ext-link ext-link-type="uri" xlink:href="https://web.archive.org/web/20160321155933/https:/101innovations.wordpress.com/2015/08/23/4000-survey-responses-geographical-distribution-and-the-need-for-translation/">underrepresentation of these language areas</ext-link> after four months of having the survey available only in English. However, this was done only after attaining initial success with attracting respondents to the survey and after getting requests for translation. Translations became available in the 6
                <sup>th</sup> month (Spanish and French), the 7
                <sup>th</sup> month (Chinese and Russian), the 8
                <sup>th</sup> month (Japanese) and 9
                <sup>th</sup> month (Arabic) of the survey period.</p>
            <p>The survey was professionally translated, and reviewed by at least two native speakers (one researcher and one librarian). All questions and preselected answer options were kept identical across different language versions. However, in five of the six foreign language versions (the exception being Arabic) we included one additional question at the end of the survey on the use of tools targeting that specific language area. This was done to increase commitment, to stimulate respondents to also mention language-specific tools and to be able to check answers given here against tools mentioned as &#x2018;others&#x2019; in the regular survey questions.</p>
        </sec>
        <sec>
            <title>Distribution of responses</title>
            <p>In total, 20663 valid survey responses were received. Obvious spam responses (n=6) were removed from the data.</p>
            <p>
				
                <bold>Distribution channels</bold> - Responses received could be traced back to distribution channels by way of a suffix attached to the survey URL (
                <xref ref-type="table" rid="T1">Table 1</xref>). Although in absolute numbers the foreign language versions contributed only modestly to the overall response numbers (
                <xref ref-type="table" rid="T2">Table 2</xref>), they were quite important to stimulate response from the respective language areas (
                <xref ref-type="fig" rid="f4">Figure 4</xref>).</p>
            <table-wrap id="T1" orientation="portrait" position="anchor">
                <label>Table 1. </label>
                <caption>
                    <title>Survey responses by distribution channel.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Channel</th>
                            <th align="right" colspan="1" rowspan="1">Responses</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Mailing lists</td>
                            <td align="right" colspan="1" rowspan="1">485</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Partners: publishers</td>
                            <td align="right" colspan="1" rowspan="1">9070</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Partners: universities &amp; hospitals</td>
                            <td align="right" colspan="1" rowspan="1">6463</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Partners: others</td>
                            <td align="right" colspan="1" rowspan="1">541</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Survey website</td>
                            <td align="right" colspan="1" rowspan="1">2604</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Twitter</td>
                            <td align="right" colspan="1" rowspan="1">1220</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Social media other than Twitter</td>
                            <td align="right" colspan="1" rowspan="1">57</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Other / unknown</td>
                            <td align="right" colspan="1" rowspan="1">223</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
						
                                <bold>Total</bold>
					</td>
                            <td align="right" colspan="1" rowspan="1">
						
                                <bold>20663</bold>
					</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Responses removed (spam)</td>
                            <td align="right" colspan="1" rowspan="1">6</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <table-wrap id="T2" orientation="portrait" position="anchor">
                <label>Table 2. </label>
                <caption>
                    <title>Survey responses by language version of the survey.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Language version of
                                <break/>survey</th>
                            <th align="right" colspan="1" rowspan="1">Responses</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">English</td>
                            <td align="right" colspan="1" rowspan="1">17785</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Spanish</td>
                            <td align="right" colspan="1" rowspan="1">1052</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">French</td>
                            <td align="right" colspan="1" rowspan="1">955</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Russian</td>
                            <td align="right" colspan="1" rowspan="1">330</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Chinese</td>
                            <td align="right" colspan="1" rowspan="1">265</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Japanese</td>
                            <td align="right" colspan="1" rowspan="1">258</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Arabic</td>
                            <td align="right" colspan="1" rowspan="1">18</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
						
                                <bold>Total</bold>
					</td>
                            <td align="right" colspan="1" rowspan="1">
						
                                <bold>20663</bold>
					</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>
				
                <bold>Country of current or last affiliation</bold> - Partly helped by the translations we got a very broad response from across the globe with at least 1 response from 151 countries and at least 20 responses each from 64 countries (
                <xref ref-type="fig" rid="f4">Figure 4</xref>).</p>
            <p>
				
                <bold>Research discipline</bold> - The largest group of respondents was from social science and economics. Other disciplines were also well represented, with only law lagging (
                <xref ref-type="table" rid="T3">Table 3</xref>, 
                <xref ref-type="fig" rid="f3">Figure 3A</xref>).</p>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>Figure 3. </label>
                <caption>
                    <title>Demographic distributions of survey responses.</title>
                    <p>
			
                        <bold>A</bold>) Mentions of research discipline(s) (multiple answers possible, 25820 answers given, N=20663). 
                        <bold>B</bold>) Responses by research role (n=20663). 
                        <bold>C</bold>) Responses by year of first publication (n=20663).</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9058/e3b83ecc-7c2a-4f71-8198-8fb3c57fb1fd_figure3.gif"/>
            </fig>
            <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                <label>Figure 4. </label>
                <caption>
                    <title>Survey response levels per 100 billion US$ GDP (2013).</title>
                    <p>Number of survey responses per 100 billion US$ GDP for all countries; weighted mean of all countries with at least 1 response: 27.3, median: 27.0.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9058/e3b83ecc-7c2a-4f71-8198-8fb3c57fb1fd_figure4.gif"/>
            </fig>
            <table-wrap id="T3" orientation="portrait" position="anchor">
                <label>Table 3. </label>
                <caption>
                    <title>Mentions of research discipline(s) (multiple answers possible, 25820 answers given, N=20663).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Research discipline</th>
                            <th align="right" colspan="1" rowspan="1">Mentions</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Physical Sciences</td>
                            <td align="right" colspan="1" rowspan="1">2644</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Engineering &amp; Technology</td>
                            <td align="right" colspan="1" rowspan="1">3838</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Life Sciences</td>
                            <td align="right" colspan="1" rowspan="1">5246</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Medicine</td>
                            <td align="right" colspan="1" rowspan="1">3879</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Social Sciences &amp; Economics</td>
                            <td align="right" colspan="1" rowspan="1">6465</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Arts &amp; Humanities</td>
                            <td align="right" colspan="1" rowspan="1">3228</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Law</td>
                            <td align="right" colspan="1" rowspan="1">520</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
						
                                <bold>Total</bold>
					</td>
                            <td align="right" colspan="1" rowspan="1">
						
                                <bold>25820</bold>
					</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>
				
                <bold>Research role</bold> - The vast majority of respondents are from inside academia (from students to professors) (
                <xref ref-type="table" rid="T4">Table 4</xref>, 
                <xref ref-type="fig" rid="f3">Figure 3C</xref>). Relatively few students responded, probably because many considered themselves not active researchers yet. Other groups are also much smaller, allowing for less detailed analysis.</p>
            <table-wrap id="T4" orientation="portrait" position="anchor">
                <label>Table 4. </label>
                <caption>
                    <title>Survey responses by research role (n=20663).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Research role</th>
                            <th align="right" colspan="1" rowspan="1">Responses</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Professor/Associate professor/
                                <break/>Assistant professor</td>
                            <td align="right" colspan="1" rowspan="1">8610</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Postdoc</td>
                            <td align="right" colspan="1" rowspan="1">2312</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">PhD student</td>
                            <td align="right" colspan="1" rowspan="1">3974</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Bachelor/Master student</td>
                            <td align="right" colspan="1" rowspan="1">1756</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Librarian</td>
                            <td align="right" colspan="1" rowspan="1">1517</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Publisher</td>
                            <td align="right" colspan="1" rowspan="1">199</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Industry/Government</td>
                            <td align="right" colspan="1" rowspan="1">677</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Other</td>
                            <td align="right" colspan="1" rowspan="1">1618</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
						
                                <bold>Total</bold>
					</td>
                            <td align="right" colspan="1" rowspan="1">
						
                                <bold>20663</bold>
					</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>
				
                <bold>Career stage</bold> - 
                <xref ref-type="table" rid="T5">Table 5</xref> shows career stage of respondents carrying out research as measured by year of first publication (
                <xref ref-type="fig" rid="f3">Figure 3C</xref>). Interestingly there is a fairly even distribution, indicating interest in the topic of the survey across various ages and career stages. Please note that the answer &#x2018;
                <italic toggle="yes">not published (yet)</italic>&#x2019; may indicate that the respondent is in the beginning of a researcher&#x2019;s career, but also that someone has a role in which publishing is not a primary task. To identify these separate populations, demographic data for career stage can be combined with those on research role.</p>
            <table-wrap id="T5" orientation="portrait" position="anchor">
                <label>Table 5. </label>
                <caption>
                    <title>Survey responses by year of first publication (n=20663).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Year of 1
                                <sup>st</sup> publication</th>
                            <th align="right" colspan="1" rowspan="1">Responses</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Before 1991</td>
                            <td align="right" colspan="1" rowspan="1">2763</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">1991&#x2013;2000</td>
                            <td align="right" colspan="1" rowspan="1">3454</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">2001&#x2013;2005</td>
                            <td align="right" colspan="1" rowspan="1">2505</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">2006&#x2013;2010</td>
                            <td align="right" colspan="1" rowspan="1">3763</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">2011&#x2013;2016</td>
                            <td align="right" colspan="1" rowspan="1">4763</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Not published (yet)</td>
                            <td align="right" colspan="1" rowspan="1">3300</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">No answer</td>
                            <td align="right" colspan="1" rowspan="1">115</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
						
                                <bold>Total</bold>
					</td>
                            <td align="right" colspan="1" rowspan="1">
						
                                <bold>20663</bold>
					</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
        </sec>
        <sec>
            <title>Population, sample size &amp; response rate estimation</title>
            <p>With an open self-selected survey like this there is no fixed sample size and thus reporting response rates is not straightforward. However we have made estimations of the total number of people that has been targeted in our distribution efforts (1.4 million, 
                <xref ref-type="table" rid="T6">Table 6</xref>). This number represents an upper limit as it does not account for overlap in populations reached through various modes of distribution. Based on this estimation, the overall response rate is 1.5%. We can also relate the number of responses to officially reported numbers of researchers (i.e. response compared with total target population) and look at response rates from specific partners that distributed the survey to a defined number of researchers (i.e. response of a subset of the population). This latter approach also allows for comparison of response rates across different modes of distribution. For instance, in cases where the survey was distributed via a mass mailing response varied between 1 and 10 percent, reached within less than a week. In cases where partners used an indirect message to an undefined set of people (e.g. through a message on intranet or on social media) very few responses were generated (typically a few dozen, even when the stated target group contained many thousands of people), and it often took months to reach that number.</p>
            <table-wrap id="T6" orientation="portrait" position="anchor">
                <label>Table 6. </label>
                <caption>
                    <title>Population, sample size and response rate indicators.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th colspan="1" rowspan="1"/>
                            <th align="right" colspan="1" rowspan="1">Size</th>
                            <th align="left" colspan="1" rowspan="1">Rate</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Population size: worldwide number (head counts)
                                <break/>of researchers, based on [
                                <xref ref-type="bibr" rid="ref-2">2</xref>, p. 31]</td>
                            <td align="right" colspan="1" rowspan="1" valign="top">7.8 M</td>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Sample size: estimation of total number of people
                                <break/>targeted by survey distribution;
                                <break/>breakdown:
                                <break/>- Twitter, direct (@ tweets, estimated)
                                <break/>- Twitter, indirect (general tweets, estimated)
                                <break/>- Mailing lists (not deduplicated)
                                <break/>- Others (blogs, meetings) (estimated)
                                <break/>- Distribution by custom URL partners (estimated),
                                <break/>among which:
                                <break/>- - Universities
                                <break/>- - Publishers
                                <break/>- - Hospitals
                                <break/>- - Others</td>
                            <td align="right" colspan="1" rowspan="1" valign="top">~1.4 M
                                <break/>
                                <break/>
						
                                <break/>2700
                                <break/>8773
                                <break/>25799
                                <break/>7000
                                <break/>~1.3 M
                                <break/>
						
                                <break/>155921
                                <break/>1136401
                                <break/>6333
                                <break/>17033</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">~18% (=relative sample size)</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Response size</td>
                            <td align="right" colspan="1" rowspan="1">20663</td>
                            <td align="left" colspan="1" rowspan="1">1.5% (= response rate)</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
        </sec>
        <sec>
            <title>Completeness of the responses</title>
            <p>Not all questions received answers from all respondents and not all answers were valid. 
                <xref ref-type="table" rid="T7">Table 7</xref> shows the number of answers per question and the number of valid answers (where applicable). Also shown are the number of respondents that indicated they used (also) other tools (or had another research role) than the ones mentioned as preset answer, and how many of those specified these other tools or research roles.</p>
            <table-wrap id="T7" orientation="portrait" position="float">
                <label>Table 7. </label>
                <caption>
                    <title>Number of answers per survey question.</title>
                    <p># answers = total number of answers per survey question; # answers valid (*) = number of valid answers per survey question (where applicable); # answers yes (**) = number of respondents answering &#x2018;yes&#x2019; per survey question (where applicable); # others = number of respondents that checked the &#x2018;other&#x2019; option per survey question (where applicable); # others specified = number of respondents that specified &#x2018;others&#x2019; as free text answers.</p>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1">Question</th>
                            <th align="left" colspan="1" rowspan="1"># answers</th>
                            <th align="left" colspan="1" rowspan="1"># answers
                                <break/>valid* or
                                <break/>yes**</th>
                            <th align="left" colspan="1" rowspan="1"># others</th>
                            <th align="left" colspan="1" rowspan="1"># others
                                <break/>specified</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
								
                                <bold>
									
                                    <italic toggle="yes">Demographics</italic>
								</bold>
							</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Research role</td>
                            <td align="right" colspan="1" rowspan="1">20663</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">1534</td>
                            <td align="right" colspan="1" rowspan="1">1531</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Country</td>
                            <td align="right" colspan="1" rowspan="1">20663</td>
                            <td align="right" colspan="1" rowspan="1">20608*</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Discipline</td>
                            <td align="right" colspan="1" rowspan="1">20663</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Year of 1st publication</td>
                            <td align="right" colspan="1" rowspan="1">20548</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
								
                                <bold>
									
                                    <italic toggle="yes">Tool usage per activity</italic>
								</bold>
							</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Search</td>
                            <td align="right" colspan="1" rowspan="1">20453</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">8009</td>
                            <td align="right" colspan="1" rowspan="1">7340</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Alerts</td>
                            <td align="right" colspan="1" rowspan="1">20238</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3479</td>
                            <td align="right" colspan="1" rowspan="1">2933</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Access</td>
                            <td align="right" colspan="1" rowspan="1">16463</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">4900</td>
                            <td align="right" colspan="1" rowspan="1">4276</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Read</td>
                            <td align="right" colspan="1" rowspan="1">20029</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3584</td>
                            <td align="right" colspan="1" rowspan="1">3271</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Analyze</td>
                            <td align="right" colspan="1" rowspan="1">18577</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">6876</td>
                            <td align="right" colspan="1" rowspan="1">6366</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Share protocols/notebooks</td>
                            <td align="right" colspan="1" rowspan="1">7426</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">5015</td>
                            <td align="right" colspan="1" rowspan="1">3540</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Write</td>
                            <td align="right" colspan="1" rowspan="1">20354</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">2354</td>
                            <td align="right" colspan="1" rowspan="1">2186</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Reference management</td>
                            <td align="right" colspan="1" rowspan="1">16471</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">2908</td>
                            <td align="right" colspan="1" rowspan="1">2268</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Share publications</td>
                            <td align="right" colspan="1" rowspan="1">15658</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3477</td>
                            <td align="right" colspan="1" rowspan="1">2961</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Share data/code</td>
                            <td align="right" colspan="1" rowspan="1">7516</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3660</td>
                            <td align="right" colspan="1" rowspan="1">2239</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Select journal</td>
                            <td align="right" colspan="1" rowspan="1">11901</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3071</td>
                            <td align="right" colspan="1" rowspan="1">2277</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Publish</td>
                            <td align="right" colspan="1" rowspan="1">15646</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">1931</td>
                            <td align="right" colspan="1" rowspan="1">1277</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Share posters/presentations</td>
                            <td align="right" colspan="1" rowspan="1">7752</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3219</td>
                            <td align="right" colspan="1" rowspan="1">1994</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Outreach</td>
                            <td align="right" colspan="1" rowspan="1">11539</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">3899</td>
                            <td align="right" colspan="1" rowspan="1">2932</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Researcher profiles</td>
                            <td align="right" colspan="1" rowspan="1">17374</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">1583</td>
                            <td align="right" colspan="1" rowspan="1">1239</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Peer review</td>
                            <td align="right" colspan="1" rowspan="1">4783</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">2010</td>
                            <td align="right" colspan="1" rowspan="1">495</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Measure impact</td>
                            <td align="right" colspan="1" rowspan="1">13213</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">1872</td>
                            <td align="right" colspan="1" rowspan="1">1304</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Language-specific tools</td>
                            <td align="right" colspan="1" rowspan="1">2238</td>
                            <td colspan="1" rowspan="1"/>
                            <td align="right" colspan="1" rowspan="1">207</td>
                            <td align="right" colspan="1" rowspan="1">116</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">
                                <bold>
                                    <italic toggle="yes">Other questions</italic>
                                </bold>
                            </td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Most important development</td>
                            <td align="right" colspan="1" rowspan="1">12209</td>
                            <td align="right" colspan="1" rowspan="1">12060</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Support Open Access</td>
                            <td align="right" colspan="1" rowspan="1">19013</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Support Open Science</td>
                            <td align="right" colspan="1" rowspan="1">19157</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">E-mail address</td>
                            <td align="right" colspan="1" rowspan="1">9562</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1">Can we contact you?</td>
                            <td align="right" colspan="1" rowspan="1">18464</td>
                            <td align="right" colspan="1" rowspan="1">10033**</td>
                            <td colspan="1" rowspan="1"/>
                            <td colspan="1" rowspan="1"/>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
        </sec>
        <sec>
            <title>Anonymization of the data</title>
            <p>On our website and in the survey itself, we guaranteed participants only anonymized data would be shared. We anonymized the data by:</p>
            <p>
				
                <list list-type="bullet">
                    <list-item>
                        <p>Removing email addresses where given;</p>
                    </list-item>
                    <list-item>
                        <p>Removing information on the specific custom URL through which the response was received;</p>
                    </list-item>
                    <list-item>
                        <p>Generalizing research role specifications where traceable to specific persons (either directly or through combining with other information);</p>
                    </list-item>
                    <list-item>
                        <p>Generalizing information given about the country of affiliation (sometimes much more detailed affiliations were given);</p>
                    </list-item>
                    <list-item>
                        <p>Removing identifiable information from free text answers.</p>
                    </list-item>
                </list>
			</p>
            <p>We had to be extra careful because we do not only share the full data, but also shared subsets containing just the data of respondents invited by the respective partners through the custom survey URLs. In cases where those partners were academic institutions or hospitals, they know the institutional affiliation of respondents in that subset, making possible identification from free text answers potentially more likely.</p>
        </sec>
        <sec>
            <title>Cleaning and harmonization of the data</title>
            <p>For the cleaned dataset we harmonized free-text answers by correcting spelling (of e.g. country names and tool names), unifying acronyms and full names, and grouping similar answers that used different phrasing (e.g. &#x201c;library databases&#x201d; and &#x201c;bibliographic databases&#x201d;). For country of affiliation, we also replaced names of areas that constitute part of a country with the name of the country as a whole. For this we used the 
                <ext-link ext-link-type="uri" xlink:href="https://web.archive.org/web/20160321160118/http:/www.un.org/en/members/">UN  list of member and observer states</ext-link>. For instance, responses attributed to people from overseas areas of France and Britain simply got assigned the main country as country of affiliation. In the answers given as specification of other tools used for a certain activity, responses that contained identifying information and could not be generalized to a more generic tool name were categorized as &#x201c;other&#x201d;. Cases where respondents indicated they either use no specific tool for an activity or do not engage in the activity were removed as answers. As we chose not to let respondents specify reasons for not answering questions, these answers are conceptually no different from cases where respondents skipped a question altogether.</p>
            <p>Both raw answers and cleaned/harmonized answers are available as separate datafiles, but identifying information is removed from raw answers to guarantee anonymity (see above).</p>
        </sec>
        <sec>
            <title>Reverse translation of foreign language answers</title>
            <p>Reverse translation of answers given in languages other than English was initially done by using 
                <ext-link ext-link-type="uri" xlink:href="https://translate.google.co.uk/">Google Translate</ext-link>. The use of automated translation was justified as most answers contained just simple text, e.g. names or descriptions of tools used. For the answers on the open question on expectations of the most important development in scholarly communication, translations provided by Google Translate were manually checked by the authors for French and Spanish, and in cases of doubt help from a native speaker with domain knowledge was requested. Free text answers to this question given in Chinese, Arabic, Russian and Japanese were also translated by a professional translation service. These translations were compared with the Google Translate texts and in cases of major discrepancies the translations were put before a native speaker with domain knowledge. In all cases, both the original answers and the most suitable translation are provided in the dataset, except where identifying information was removed from raw answers to guarantee anonymity (see above).</p>
        </sec>
        <sec>
            <title>Observed and expected biases in the data</title>
            <p>Given the nature of the data collection we expect biases to be present in the data. The demographic data we collected can be used to both assess for biases (by comparing against known distributions within the target population) and overcome them, e.g. by zooming in during analyses. For instance, if the distribution over research roles seems not proportional, one could focus analysis on one group only. Where that is not viable raking is a statistical method that can be used to correct distributions, if the distribution in the overall population is known. Of course this only needs to be done if one suspects the variable at hand to be correlated with that distribution.</p>
            <p>To check for regional bias we compared numbers of responses per country to the size of that country&#x2019;s GDP
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>, which we took as a crude proxy for the number of researchers. 
                <xref ref-type="fig" rid="f4">Figure 4</xref> depicts that bias. Measured thus, the Netherlands and some other small European countries are represented far above average and many West-African and Central and Southeast Asian countries way below average or not at all. Given their large absolute sizes, the low levels of response in countries such as China and Korea are noteworthy.</p>
            <p>Biases not directly related to the demographic parameters included in the survey will be harder to assess. For instance, we were unable to confirm whether there is bias along the degree to which people are interested in or concerned about scholarly communication issues.</p>
        </sec>
        <sec>
            <title>Data description, data storage and sharing</title>
            <p>The total size of both the raw and cleaned versions of the data is 20663 records and 178 variables, of which 162 for the tools questions and 16 for demographics and other general questions. File format is csv. These files with 
                <xref ref-type="other" rid="SM1">supplementary material</xref> are bundled into one zipped citable data set with DOI identifier.</p>
            <p>The measurement level of the majority of the data is nominal (tools used, affiliation, role, discipline), in a few cases ordinal (indication of support for Open Access and Open Science) and only once interval (year ranges for year of first publication).</p>
            <p>For permanent storage, the anonymized data are deposited in Zenodo under a CC-0 license. In addition, raw data will be stored for up to five years on secure Utrecht University servers for further analysis, with email information in files separate from the rest of the data.</p>
            <p>In addition, we have made the data available through an interactive dashboard on Silk (
                <ext-link ext-link-type="uri" xlink:href="http://dashboard101innovations.silk.co/">http://dashboard101innovations.silk.co/</ext-link>) to enable quick visual exploration of the data.</p>
        </sec>
        <sec>
            <title>Consent</title>
            <p>The research is subject to the code of conduct of the Dutch Association of Universities (VSNU)
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>The data referenced by this article are under copyright with the following copyright statement: Copyright: &#x00ef;&#x00bf;&#x00bd; 2016 Kramer B and Bosman J</p>
            <p>Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).
                <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/"/>
            </p>
            <p>
                <italic toggle="yes">Zenodo</italic>: Global survey on research tool usage, doi: 
                <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.5281/zenodo.49583">https://dx.doi.org/10.5281/zenodo.49583</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>
            </p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgments</title>
            <p>We gratefully acknowledge support of 108 partners that agreed to distribute the survey among their constituency, as well as the people who generously spent time and effort in reviewing the translations and testing the survey implementation in their native language.</p>
        </ack>
        <sec id="SM1" sec-type="supplementary-material">
            <title>Supplementary material</title>
            <list list-type="bullet">
                <list-item>
                    <p>
                        <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/8414/300a78fa-e12e-4789-ab1f-4cb604012e9a.zip">Survey questionnaire text and graphics, in English, Spanish, French, Chinese, Japanese, Russian and Arabic</ext-link>.</p>
                </list-item>
            </list>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Bosman</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kramer</surname>
                            <given-names>B</given-names>
                        </name>
					</person-group>:
                    <article-title>Global survey on research tool usage.</article-title>
                    <source>
						
                        <italic toggle="yes">Zenodo.</italic>
					</source>
                    <year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.5281/zenodo.49583">Data Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="book">
                    <collab>UNESCO</collab>:
                    <article-title>UNESCO science report - towards 2030.</article-title>Paris: UNESCO,<year>2015</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://unesdoc.unesco.org/images/0023/002354/235407e.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="book">
                    <collab>VSNU</collab>:
                    <article-title>The Netherlands Code of conduct for academic practice.</article-title>(revised edition 2014). Amsterdam: VSNU,<year>2014</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://vsnu.nl/files/documenten/Domeinen/Onderzoek/The_Netherlands_Code%20of_Conduct_for_Academic_Practice_2004_(version2014).pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="book">
                    <collab>World Bank</collab>:
                    <article-title>GDP at market prices (current US$).</article-title>Washington: World Bank,<year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://data.worldbank.org/indicator/NY.GDP.MKTP.CD">Reference Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report14417">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.9058.r14417</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Peters</surname>
                        <given-names>Isabella</given-names>
                    </name>
                    <xref ref-type="aff" rid="r14417a1">1</xref>
                    <xref ref-type="aff" rid="r14417a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5840-0806</uri>
                </contrib>
                <contrib contrib-type="author">
                    <name>
                        <surname>Nuredini</surname>
                        <given-names>Kaltrina</given-names>
                    </name>
                    <xref ref-type="aff" rid="r14417a1">1</xref>
                    <role>Co-referee</role>
                </contrib>
                <aff id="r14417a1">
                    <label>1</label>ZBW Leibniz Information Centre for Economics, Kiel, Germany</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>1</day>
                <month>7</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Peters I and Nuredini K</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport14417" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.8414.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors present a data note which aims at describing a data set by giving details on how data was collected and processed and which software or protocols were used, but which will not provide an analysis of the data, results, or conclusions.</p>
            <p> </p>
            <p> The authors met these F1000Research requirements and, accordingly, describe the setup of the survey, give details on the sampling methods and ways of disseminating the survey request, and briefly introduce the distribution of responses. They also discuss the population, sample size and response rate as well as the completeness of responses and observed and biases in the data. Information on the post-hoc data processing (i.e., anonymization, cleaning, and harmonization of data) is also given. The data note finishes with a quantitative description of the data, how it is stored (i.e. openly on zenodo, as required by F1000Research), and how it can be accessed.</p>
            <p> </p>
            <p> Overall, the description of the data and the data processing is sound, seems to be reasonable, and as far as I can assess meet the standards of studies of that kind. The data generation is also suitable for investigations of usage of tools and the data set will serve the understanding of scholarly communication in the digital era in general and on social media in particular. Moreover, the data set cannot only answer if researchers use particular tools but also for what purposes or in what steps of the research cycle respectively.</p>
            <p> </p>
            <p> However, to get a more complete view on how the data has exactly been processed and collected, as well as to enhance repeatability of the study and to aid interpretation of results in later research making use of this data set I recommend adding information to following questions (which mostly refer to initial premises set by the authors of the survey and which have to been known in order to comprehend the processing steps that have been taken): 
                <list list-type="order">
                    <list-item>
                        <p>Activities were selected from a database developed by the authors. How did you create the database &#x2013; how did you find the entries? Could tool providers register themselves? How complete is it?</p>
                    </list-item>
                    <list-item>
                        <p>Described activities in the survey were chosen for their overall importance/ the most well-known tools were selected as answers: How do you define &#x201c;overall importance&#x201d; and &#x201c;most well-known&#x201d;? How did you determine this selection? Can you provide evidence (even if this is a data note)? Have you taken into account disciplinary peculiarities?</p>
                    </list-item>
                    <list-item>
                        <p>Was it possible to choose more than one tool as answer in the survey (adding up to answer numbers &gt;100%)?</p>
                    </list-item>
                    <list-item>
                        <p>Was it possible for participants to answer the survey more than once? Have you detected any bot-like behavior?</p>
                    </list-item>
                    <list-item>
                        <p>Six obvious spam answers have been removed from the data set: can you give examples on what was considered spam?</p>
                    </list-item>
                </list>
            </p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report14542">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.9058.r14542</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Illingworth</surname>
                        <given-names>Samuel</given-names>
                    </name>
                    <xref ref-type="aff" rid="r14542a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r14542a1">
                    <label>1</label>School of Research, Enterprise &amp; Innovation, Manchester Metropolitan University, Manchester, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>28</day>
                <month>6</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Illingworth S</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport14542" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.8414.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This is an exceptionally well-designed survey, which was carried out professionally and effectively. The results of this survey will be incredibly useful for future researchers who want to gain an insight into current practices relating to innovations in scholarly communications.&#x00a0;</p>
            <p> </p>
            <p> The transparency of this data set, both upon completion and also throughout the collection period is commendable&#x00a0;and is something that I would like to hold up as an example of best practice. The authors have worked tirelessly to ensure that this data set is of the greatest possible value to the wider research community. In particular, the use of the WordPress blog and the presentation of the final data set on Silk are processes that I would like to see repeated by others.</p>
            <p> </p>
            <p> I have only a couple of queries relating to the survey's design and implementation: 
                <list list-type="order">
                    <list-item>
                        <p>What quota sampling strategy was used? In the&#x00a0;
                            <bold>Distribution of the survey; sampling </bold> section the authors mention that quota sampling was used, but how was this done, which quotas were selected, and why were they chosen?</p>
                    </list-item>
                    <list-item>
                        <p>In the&#x00a0;
                            <bold>Translation of the survey</bold>&#x00a0;section the authors mention that the Arabic survey did not include "one additional question at the end of the survey on the use of tools targeting that specific language area," why was this the case?</p>
                    </list-item>
                </list> Apart from these two small details, I would like to commend the authors on such an excellent dataset, which sets a very high standard from research design right through to dissemination of results. I am also very much looking forward to what future analysis of the data will reveal about current&#x00a0;practices relating to innovations in scholarly communications.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
