Skip to main content
Dryad

Data from: Impact of lexical and sentiment factors on the popularity of scientific papers

Cite this dataset

Sienkiewicz, Julian; Altmann, Eduardo G. (2016). Data from: Impact of lexical and sentiment factors on the popularity of scientific papers [Dataset]. Dryad. https://doi.org/10.5061/dryad.nj938

Abstract

We investigate how textual properties of scientific papers relate to the number of citations they receive. Our main finding is that correlations are nonlinear and affect differently the most cited and typical papers. For instance, we find that, in most journals, short titles correlate positively with citations only for the most cited papers, whereas for typical papers, the correlation is usually negative. Our analysis of six different factors, calculated both at the title and abstract level of 4.3 million papers in over 1500 journals, reveals the number of authors, and the length and complexity of the abstract, as having the strongest (positive) influence on the number of citations.

Usage notes