Skip to main content

Advertisement

Log in

Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations

  • Published:
Scientometrics Aims and scope Submit manuscript

A Correction to this article was published on 04 December 2020

This article has been updated

Abstract

New sources of citation data have recently become available, such as Microsoft Academic, Dimensions, and the OpenCitations Index of CrossRef open DOI-to-DOI citations (COCI). Although these have been compared to the Web of Science Core Collection (WoS), Scopus, or Google Scholar, there is no systematic evidence of their differences across subject categories. In response, this paper investigates 3,073,351 citations found by these six data sources to 2,515 English-language highly-cited documents published in 2006 from 252 subject categories, expanding and updating the largest previous study. Google Scholar found 88% of all citations, many of which were not found by the other sources, and nearly all citations found by the remaining sources (89–94%). A similar pattern held within most subject categories. Microsoft Academic is the second largest overall (60% of all citations), including 82% of Scopus citations and 86% of WoS citations. In most categories, Microsoft Academic found more citations than Scopus and WoS (182 and 223 subject categories, respectively), but had coverage gaps in some areas, such as Physics and some Humanities categories. After Scopus, Dimensions is fourth largest (54% of all citations), including 84% of Scopus citations and 88% of WoS citations. It found more citations than Scopus in 36 categories, more than WoS in 185, and displays some coverage gaps, especially in the Humanities. Following WoS, COCI is the smallest, with 28% of all citations. Google Scholar is still the most comprehensive source. In many subject categories Microsoft Academic and Dimensions are good alternatives to Scopus and WoS in terms of coverage.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Change history

  • 04 December 2020

    In the original publication of the article, Fig. 5 was incorrectly published.

Notes

  1. https://web.archive.org/web/20170105184616/https:/academic.microsoft.com/FAQ

  2. https://i4oc.org/.

  3. https://www.elsevier.com/icsr/icsrlab.

  4. https://citeseerx.ist.psu.edu/index.

  5. https://www.researchgate.net/.

  6. https://www.lens.org/.

  7. https://www.semanticscholar.org/.

  8. In the case of COCI, the results cannot reflect the full coverage of CrossRef given the incomplete availability of reference lists in this source. Nevertheless, including it in the analysis will inform us of what proportion of citations are currently available in the public domain.

  9. Visser et al. (2020) found that a large number of citations missing from Microsoft Academic were caused by missing reference lists in the citing documents. As far as we know no study has analysed how many missing citations in Google Scholar are caused by missing reference lists.

  10. https://scholar.google.com/citations?view_op=list_classic_articles&hl=en&by=2006.

  11. https://msr-apis.portal.azure-api.net/docs/services/academic-search-api.

  12. https://www.dimensions.ai/scientometric-research/.

  13. https://osf.io/gnb72/ (2019 folder).

  14. Google Scholar/Microsoft Academic: https://osf.io/g8z42/; Google Scholar/Dimensions: https://osf.io/bwv5s/.

  15. Google Scholar/Microsoft Academic: https://osf.io/jqwah/; Google Scholar/Dimensions: https://osf.io/xnf24/.

  16. https://www.tandfonline.com/doi/abs/10.1191/1478088706QP063OA.

  17. https://osf.io/gmrju/.

  18. https://osf.io/bzha2/.

  19. https://osf.io/f36sn/.

  20. https://osf.io/7qzmk/.

  21. https://osf.io/4gtdc/.

  22. https://osf.io/ctzb7/.

  23. https://osf.io/rz4cj/.

  24. https://osf.io/v6bgy/.

  25. https://osf.io/vafzp/.

  26. https://osf.io/87cdh/.

  27. https://osf.io/bqpz4/.

  28. https://osf.io/p26ua/.

  29. https://osf.io/fngph/.

  30. https://osf.io/pdnxt/.

  31. https://osf.io/xjhfw/.

  32. https://osf.io/ugvh3/.

  33. https://osf.io/6vrnp/.

  34. https://osf.io/x9g3e/.

  35. https://osf.io/54xky/.

  36. https://osf.io/fa8sr/.

  37. https://osf.io/9584j/.

  38. https://osf.io/h7jt2/.

  39. https://osf.io/ghws2/.

  40. https://osf.io/gpyse/.

  41. https://osf.io/rsj4m/.

  42. https://osf.io/bvr3p/.

  43. https://osf.io/vmdbx/.

  44. https://osf.io/zd53e/.

  45. https://osf.io/q529p/.

  46. https://osf.io/qcdsh/.

  47. https://osf.io/sfd2g/.

  48. https://osf.io/a9mtx/.

  49. https://osf.io/n2e98/.

  50. https://osf.io/za5ks/.

  51. https://osf.io/3npwu/.

  52. https://osf.io/7qb8v/.

  53. https://osf.io/n5j8v/.

  54. https://albertomartin.shinyapps.io/citation_overlap_2019/.

References

Download references

Acknowledgements

We thank Medialab UGR (Universidad de Granada) for providing funding to cover the cost of hosting the interactive web applicationFootnote 54 created to explore the data used in this study. We thank Digital Science for providing free access to the Dimensions API. We thank Jing Xuan Xie for translating the abstract to Chinese. We thank Asura Enkhbayar for suggesting the use of an upset plot in Fig. 2. Lastly, we thank two anonymous reviewers for their thoughtful comments, which have helped improved the manuscript substantially.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alberto Martín-Martín.

Additional information

The original online version of this article was revised: In the original publication of the article, Fig. 5 was incorrectly published.

Appendices

Appendix 1: Complete list of Venn diagrams computed for this study

No subject aggregation

 

Two-set Venn diagrams (all subject categories)

https://osf.io/bwpaq/

Three-set Venn diagrams (all subject categories)

https://osf.io/jkrge/

Aggregated by 8 subject areas

 

Google Scholar–Microsoft Academic–Scopus

https://osf.io/h7m8s/

Google Scholar–Microsoft Academic–Dimensions

https://osf.io/7v4kr/

Google Scholar–Microsoft Academic–Web of Science

https://osf.io/fn3yh/

Google Scholar–Microsoft Academic–COCI

https://osf.io/s3bmp/

Google Scholar–Scopus–Dimensions

https://osf.io/q8ecx/

Google Scholar–Scopus–Web of Science

https://osf.io/qkc2a/

Google Scholar–Scopus–COCI

https://osf.io/mrvdb/

Google Scholar–Dimensions–Web of Science

https://osf.io/nwm83/

Google Scholar–Dimensions–COCI

https://osf.io/dzs5x/

Google Scholar–Web of Science–COCI

https://osf.io/64chg/

Microsoft Academic–Scopus–Dimensions

https://osf.io/hgzn6/

Microsoft Academic–Scopus–Web of Science

https://osf.io/f7xpa/

Microsoft Academic–Scopus–COCI

https://osf.io/c6tpz/

Microsoft Academic–Dimensions–Web of Science

https://osf.io/f5zxs/

Microsoft Academic–Dimensions–COCI

https://osf.io/ry87a/

Microsoft Academic–Web of Science–COCI

https://osf.io/vxyj4/

Scopus–Dimensions–Web of Science

https://osf.io/xqg3y/

Scopus–Dimensions–COCI

https://osf.io/jmvb6/

Aggregated by 252 subject categories (zipped)

 

Google Scholar–Microsoft Academic

https://osf.io/v4ek3/

Google Scholar–Scopus

https://osf.io/umsyw/

Google Scholar–Dimensions

https://osf.io/jqmuy/

Google Scholar–Web of Science

https://osf.io/4b8uq/

Google Scholar–COCI

https://osf.io/gytuh/

Microsoft Academic–Scopus

https://osf.io/jw2bt/

Microsoft Academic–Dimensions

https://osf.io/a2mp7/

Microsoft Academic–Web of Science

https://osf.io/2hkxq/

Microsoft Academic–COCI

https://osf.io/ch4gb/

Scopus–Dimensions

https://osf.io/q4swk/

Scopus–Web of Science

https://osf.io/qcpbh/

Scopus–COCI

https://osf.io/2xvbh/

Dimensions–Web of Science

https://osf.io/pdycb/

Dimensions–COCI

https://osf.io/j7qte/

Web of Science–COCI

https://osf.io/mnwe7/

Appendix 2: Complete list of boxplots computed for this study

Subject category-level overlap data aggregated by 8 subject areas

Google Scholar–Microsoft Academic

https://osf.io/b94xp/

Google Scholar–Scopus

https://osf.io/rvbw9/

Google Scholar–Dimensions

https://osf.io/ubtrm/

Google Scholar–Web of Science

https://osf.io/7wb49/

Google Scholar–COCI

https://osf.io/7ekdr/

Microsoft Academic–Scopus

https://osf.io/jx7by/

Microsoft Academic–Dimensions

https://osf.io/x4257/

Microsoft Academic–Web of Science

https://osf.io/rdw7g/

Microsoft Academic–COCI

https://osf.io/f8a9e/

Scopus–Dimensions

https://osf.io/3a97k/

Scopus–Web of Science

https://osf.io/w4zv3/

Scopus–COCI

https://osf.io/jtnyu/

Dimensions–Web of Science

https://osf.io/gsjwm/

Dimensions–COCI

https://osf.io/sr4wu/

Web of Science - COCI

https://osf.io/6dkw4/

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Martín-Martín, A., Thelwall, M., Orduna-Malea, E. et al. Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations. Scientometrics 126, 871–906 (2021). https://doi.org/10.1007/s11192-020-03690-4

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-020-03690-4

Keywords

Profiles

  1. Enrique Orduna-Malea