ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Method Article

A citation-based, author- and age-normalized, logarithmic index for evaluation of individual researchers independently of publication counts

[version 1; peer review: 2 approved]
PUBLISHED 22 Sep 2015
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

The use of citation metrics for evaluation of individual researchers has dramatically increased over the last decade. However, currently existing indices either are based on misleading premises or are cumbersome to implement. This leads to poor assessment of researchers and creates dangerous trends in science, such as overproduction of low quality articles. Here we propose an index (namely, the L-index) that does not depend on the number of publications, accounts for different co-author contributions and age of publications, and scales from 0.0 to 9.9. Moreover, it can be calculated with the help of freely available software.

Keywords

Research, scientific, metrics, impact, weighted, bibliometric, scientometric, Publish or Perish

Introduction

There is an ever-present need to evaluate researchers’ performance, because resources are limited and contenders are numerous. Before the advent of journal impact factors (JIFs,1), all evaluations were performed via peer review. Although JIFs were intended for librarians to decide which journals to subscribe to, they have become a commonly used proxy for the quality of journal articles2. However, the distribution of citations to individual articles within a journal is highly skewed. Twenty five percent of the most highly cited articles can account for 90% of a journal’s IF3. The rest of the articles receive a few citations each, if any. Thus, using JIFs for assessing the quality of individual articles and, further, for evaluating researchers is categorically not recommended4.

The rapid development of the internet and electronic citation databases, such as PubMed, Google Scholar, Web of Science, Scopus, CiteSeer and others, has made it easier to count citations of individual articles. It is now possible to automatically calculate the total number of citations that the publications of a given researcher have accumulated. However, these numbers can range from 1 to 100,000 and, obviously, do not represent the equal variation in researchers’ capabilities. For example, human IQ scores vary only about 2–4 fold5.

There are two main reasons why the total number of citations cannot be used to adequately compare individual researchers. First, there is usually more than one author for each publication. Some of the most highly cited articles, such as reports from experiments on particle accelerators6 or from genome sequencings7, and guidelines for medical practitioners8, have from tens to hundreds of co-authors, usually listed in alphabetical order. Thus, it is inadequate to assign all the thousands of citations to each of those authors. A similar situation is when a researcher operates some very expensive and thus rare equipment, and is listed on papers of all other researchers who perform experiments on that equipment9. It is clear that the researcher also does not deserve all citations of those papers, as his contribution is purely technical. Many ways to divide citations between co-authors have been proposed10,11, but the only practical way is to split them equally, i.e. to assign each co-author 1/n of the citations, where n is the total number of co-authors.

Another factor is that citations accumulate with time. A researcher who started his career 30 years ago will undoubtedly have more citations than a young postdoc, but this does not necessarily mean that the former is a better scientist. Moreover, a paper that has been highly cited is likely to be cited even more in the future12. Thus, citations exhibit the behavior of preferential attachment, which results in their distribution according to the power law13. These considerations make it necessary to adjust for the age of each publication, in order to properly assess current capabilities and impact of researchers, not their past successes, and to partially compensate for the preferential attachment. Dividing the number of citations by the age of the publication in years seems to be an adequate measure, as it mirrors the power law distribution that citations have.

Finally, a large variety of individual citation metrics have been proposed14, the most widely disseminated of which is the h-index15. The drawback of the majority of these metrics is that they take into consideration the number of publications. For example, the h-index can never exceed the total number of publications a scientist has. However, several researchers of undisputed scientific merit, such as Sir Isaac Newton, Gregor Mendel or Peter Higgs, have published only a few, however significant, works. This lack in the number of publications leads them to have h-indices of 4, 1 and 9, respectively, which are disparagingly low. All other derivatives of the h-index, as well as all indices that take into account publication counts, suffer from the same drawback and hence should never be used for evaluation purposes. However, they are used, promoting a grueling and futile quest for quantity of publications, at the expense of quality, reflected in the infamous “publish or perish” catchphrase16. Fortunately, this issue has been recently called to public attention, most notably in the form of the San Francisco Declaration on Research Assessment, and some measures have been proposed17.

Overall, there is an immense need for a simple but reliable indicator for individual researcher assessment. Here, we propose such an index, which accounts for different co-author contributions and age of publications, and does not depend on the number of publications. Moreover, it conveniently ranges from 0.0 to 9.9, and can be calculated with the help of freely available software.

Methods

To address the concerns highlighted in the introduction of this article, we have set out to construct an index that accounts for different co-author contributions and age of publications. This has initially led us to the following formula:

I=i=1Nciaiyi(1)

where Ipreliminary index, ci – number of citations to i-th publication, ai – number of authors of i-th publication, yi – age in years of i-th publication, N – number of publications.

We then decided to estimate the range of values that our preliminary index can have. First, we calculated I for a hypothetical PhD student who recently received the first citation to his first paper, with 5 authors: I=15×1=0.2. Next, we calculated I for Albert Einstein and Charles Darwin, two of the most prominent and well-known scientists. To this aim, we utilized the freely available software Publish or Perish. This program imports citation data from Google Scholar and allows removal of irrelevant results, such as publications of homonym authors. A parameter AWCRpA (age-weighted citation rate per author) can be obtained from this program, and is equivalent to I from formula (1). The AWCRpA values for Einstein and Darwin, as calculated by Publish or Perish at the time of writing this article, were 6466 and 6178, respectively. Thus, even upon correcting for multiple authorship and age, citations vary by approximately 30,000 fold. As it is unlikely that the human brain can exhibit such a tremendous difference in its efficiency or talents, normalized citations should be mapped to a more appropriate scale, in order to make them more useful in meaningfully comparing researchers. It seems that a 1–10 scale is optimal, because it is closer to the true variation in human intellectual or other capabilities5, and is widely used in various metrics, thus being more intuitive. The natural logarithm function appears to be ideal for this scaling purpose. Compared to the square root or the cube root, the natural logarithm allows better resolution of differences between the majority of scientists, with the exception of the most prominent ones (Figure 1). To account for some of the negative values that arise when there is less than one normalized citation, the resulting index is increased by one point. In extreme cases where the index value still remains negative, it is advised to simply consider it as zero.

3d28ba53-ad21-41d2-b627-bbb33b5ffcf5_figure1.gif

Figure 1. The natural logarithm function increases evenly across the citation range.

The natural logarithm (ln cn), cube root (cn3) and square root (cn) functions of normalized citations (cn) are shown.

Finally, the formula for the Logarithm index (L-index) has become:

L=ln(i=1Nciaiyi)+1(2)

where ci – number of citations to i-th publication, ai – number of authors of i-th publication, yi – age in years of i-th publication, N – number of publications.

When i=1Nciaiyi is calculated as AWCRpA in Publish or Perish, formula (2) takes the form:

L=lnAWCRpA+1(3)

Equation 3 has been used to obtain all L-index values in this article.

To calculate typical L-indices for a PhD student, a postdoc and a principal investigator (PI), we averaged the L-index values for 5 PhD students, 10 postdocs and 15 PIs that we personally know (see Supplementary Table 1).

Results and discussion

Figure 2 shows the typical scale of the L-index with the indication of the values for ten of some of the most prominent and widely recognized scientists, as well as the typical values for a PhD student, a postdoc and a principal investigator (PI) (see Supplementary Table 1).

3d28ba53-ad21-41d2-b627-bbb33b5ffcf5_figure2.gif

Figure 2. The L-index scale and notable examples.

The typical range of the L-index is shown. The positions of 10 of the most famous scientists are indicated, along with their L-index scores. The positions of a typical PhD student, a postdoc and a principal investigator (PI) are also displayed. The raw data can be seen in Supplementary Table 1.

It can be seen from this figure that the L-index adequately captures the intuitive ranking, i.e. PhD student < Postdoc < PI < … < Albert Einstein. Moreover, it allows the objective (or, at least, statistically averaged collective subjective) quantitative assessment of researchers, which is a virtue that traditional peer review cannot accomplish. However, in cases where L-indices of the applicants are equal up to one decimal place, we strongly suggest the use of peer review, involving thorough examination of their publications, rather than differentiation of scientists based on the second decimal place, to avoid false precision and statistical bias. In case of young researchers that have only a few citations, it is also advisable to use peer review, as the limited data do not allow for the statistically robust calculation of the citation index.

The L-index can increase or decrease with time, as it depends on the age of publications. Thus, it favors the impact of recent publications and gives a much needed advantage to younger researchers. However, if a scientist has made such a significant discovery that its impact only increases with time, his L-index will stay high regardless of the age of the publication. Perfect examples of this are Albert Einstein and Charles Darwin. Despite them ceasing to publish original work decades ago, their L-indices are still higher than those of the absolute majority of current researchers (Figure 2).

The quantitative comparison of the L-index with other evaluation indices, such as the h-index, is purposefully avoided in this article, for the reason that those indices have been designed on different premises, such as to account for the number of publications. When evaluating the performance of a researcher, it should first be decided which parameter is considered adequate for the purpose – the number of publications, which does not tell anything about their quality, or the number of citations, which, however indirectly, indicates the impact that the publications have made. If the latter option is selected, the L-index can help to account for the effects of multiple co-authorship and aging of publications, and present the results in a simple and intuitive form.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 22 Sep 2015
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Belikov AV and Belikov VV. A citation-based, author- and age-normalized, logarithmic index for evaluation of individual researchers independently of publication counts [version 1; peer review: 2 approved]. F1000Research 2015, 4:884 (https://doi.org/10.12688/f1000research.7070.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 22 Sep 2015
Views
111
Cite
Reviewer Report 17 Nov 2015
Danielle M. Colbert-Lewis, Department of James E. Shepard Memorial Library, North Carolina Central University, Durham, NC, USA 
Approved
VIEWS 111
This title is appropriate for the content of the article and the abstract represents a suitable summary for the work. The design, methods and analysis used in the article represented a thorough approach to the subject. The conclusions are suitable, ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Colbert-Lewis DM. Reviewer Report For: A citation-based, author- and age-normalized, logarithmic index for evaluation of individual researchers independently of publication counts [version 1; peer review: 2 approved]. F1000Research 2015, 4:884 (https://doi.org/10.5256/f1000research.7610.r10836)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
146
Cite
Reviewer Report 16 Oct 2015
Youngim Jung, Korea Institute of Science and Technology Information, Daejeon, South Korea 
Approved
VIEWS 146
This paper proposes a novel index based on the number of citations, normalized by the number of authors and the date of publication, which is then scaled from 0.0 to 9.9 using the natural logarithm. The index proposed is intuitively ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Jung Y. Reviewer Report For: A citation-based, author- and age-normalized, logarithmic index for evaluation of individual researchers independently of publication counts [version 1; peer review: 2 approved]. F1000Research 2015, 4:884 (https://doi.org/10.5256/f1000research.7610.r10421)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 22 Sep 2015
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.