How has wikipedia become a more reliable source of information since its early years?

  • Journal List
  • Gigascience
  • v.8(12); 2019 Dec
  • PMC6889752

Gigascience. 2019 Dec; 8(12): giz139.

Abstract

Wikipedia is by far the largest online encyclopedia, and the number of errors it contains is on par with the professional sources even in specialized topics such as biology or medicine. Yet, the academic world is still treating it with great skepticism because of the types of inaccuracies present there, the widespread plagiarism from Wikipedia, and historic biases, as well as jealousy regarding the loss of the knowledge dissemination monopoly. This article argues that it is high time not only to acknowledge Wikipedia's quality but also to start actively promoting its use and development in academia.

Keywords: Wikipedia, academia, online encyclopedia, knowledge quality, free knowledge

Background

In 2005, Nature published a study describing Wikipedia as going “head to head” with Britannica [1]. While the claim was disputed by Britannica, since then Wikipedia has grown 6-fold in the number of articles; is >85 times the size of 120-volume Encyclopedia Britannica, measured by word count; and has substantially improved its quality.

Admittedly, standards of quality are shaped by peer-to-peer local language communities and vary widely among Wikipedia projects, and also between articles within languages [2]. Yet, the quality of Wikipedia articles is very high [3]. This is true even in many specialized topics, such as anatomy, biology, or medicine, where Wikipedia is as accurate as the professional sources [4–6], even though sometimes it does not score high on readability.

Yet, Wikipedia is still treated with suspicion by the professoriate and sneered at in academic circles [7]. This is especially disturbing, as academics are best positioned to shape Wikipedia [8], because of their expertise, as well as because of their access to students, who can improve Wikipedia for coursework under their supervision. Thus, it may be worthwhile to consider the reasons for scholars’ reluctance to openly use, recommend, and incorporate Wikipedia into coursework.

Main Text

Some of the reasons for these reservations may be legitimate. Although Wikipedia has a similar number of errors to professional and peer-reviewed sources [4–6], the types of inaccuracies on Wikipedia are different. They may involve replacing the content of an article with nonsense, or someone's name with a slur. There is no question that such vandalism damages the perception of the quality of Wikipedia as a whole. Still, Wikipedia takes vandalism seriously and constantly develops new methods of combating malicious edits, including, e.g., machine learning algorithms, as well as human patrolling. The sorts of vandalism that pass through may misinform the readers but are overall quite rare, especially in popular articles. More importantly, most vandalism is easily spotted and as such is harmful mainly to the image of Wikipedia as a trustworthy source, and does not actually misinform the readers.

Another reason for academia's dislike of Wikipedia may be its association with plagiarism. Students are notorious for copying from Wikipedia. However, this is clearly an unfortunate testimony to its quality and should not be held against Wikipedia, just as it should not be held against any other plagiarized academic resource. On a side note, Wikipedia has iron-clad copyright policies and treats plagiarism more seriously than regular media.

Some other reasons may be related to a historic bias, a perception of Wikipedia as not rigorous enough, or underestimation of the ability of amateurs to disseminate knowledge in a robust way. As scholars, we should be able to confront and eliminate such biases once we are presented with evidence, and many studies show that Wikipedia delivers high-quality output in practice, even if in theory it may seem impossible. Wikipedia simply is a living testament to Linus's Law: “given enough eyeballs, all bugs are shallow,” and the more edited articles are actually more accurate. It may be surprising and strange, but the results speak for themselves. Over time Wikipedia's quality has improved substantially, and yet it is still perceived in a static and dated way, as from the time of its inception.

Some professors dislike it when students cite Wikipedia. While no encyclopedia should be the only source in academic-level essays, it should be emphasized that our primary duty is to report and accurately refer to all sources that were actually used, with no exceptions. Academic honesty and transparency are crucial for scholarly work, and it is difficult to understand why citing specifically Wikipedia is taboo.

Yet, the most important reason for animosity towards Wikipedia may be that it challenges the existing institutional hierarchy of knowledge distribution and is much more successful in reaching the public than academic publications. We, the professors, were the only ones legitimized to disseminate academic knowledge. Now, we have to compete with a product of anonymous amateurs, which has a readership much wider than any of us could ever dream of. In fact, Wikipedia systematically compensates for the lack of credentials by heavy emphasis on reliable sources. It is a paradox: Wikipedia is one of the 10 most popular websites in the world according to TopSites, and by most measures it is the most widely read knowledge repository on Earth, but still it is often treated as not worth academic attention.

We need to change this. Writing a Wikipedia article is a perfect academic assignment for students. It requires finding reliable, verifiable sources, synthesizing their content, writing an encyclopedic entry: a true paragon of scholarly effort and transferable information literacy skills. Moreover, it makes the professor's life so much easier because a new article is often checked for plagiarism and commented on by members of the community. However, I believe there are even more important reasons for students and scholars to appreciate Wikipedia. Billions of people do not have access to free knowledge. We are the 1% in terms of knowledge access privilege; developing Wikipedia, the common good of humanity, is our moral obligation. The fact that Wikipedia development makes our coursework easier is only a nice bonus.

Conclusions

There are already initiatives in computational biology or genetics aimed at developing Wikipedia articles from these topics by scholars [9]. GeneWiki project, established to transfer information about relationships and functions of all human genes from scientific resources to Wikipedia, already contains 10,000 distinct gene pages, viewed >50 million times per year [10]. Nevertheless, Wikipedia development is not yet routinely considered as valuable in tenure reviews, and Wikipedia article writing is not yet a mainstream coursework assignment in colleges. It is high time to make that happen. In 2019 Wikipedia turned 18, so maybe academics should start treating it as an adult.

Competing interests

The author is a member of the Wikimedia Foundation Board of Trustees.

Author's information

D.J. is Professor and Head of the Management in Networked and Digital Societies (MINDS) department at Kozminski University, associate faculty at Berkman-Klein Center for Internet and Society at Harvard University, and fellow at MIT Center for Collective Intelligence. He serves on the Wikimedia Foundation Board of Trustees. In 2014 he published Common Knowledge? An Ethnography of Wikipedia (Stanford University Press).

Funding

Working on this article was possible thanks to grant No. PPN/BEK/2018/1/00009 from the Polish National Agency for Academic Exchange.

Supplementary Material

giz139_GIGA-D-19-00332_Original_Submission

giz139_GIGA-D-19-00332_Revision_1

giz139_Response_to_Reviewer_Comments_Original_Submission

giz139_Reviewer_1_Report_Original_Submission

Thomas Shafee -- 10/18/2019 Reviewed

giz139_Supplemental_Files

References

1. Giles J. Internet encyclopaedias go head to head. Nature. 2005;438:900–1. [PubMed] [Google Scholar]

2. Jemielniak D, Wilamowski M. Cultural diversity of quality of information on Wikipedias. J Assoc Inf Sci Technol. 2017;68:2460–70. [Google Scholar]

3. Michelucci P, Dickinson JL. The power of crowds. Science. 2016;351:32–3. [PubMed] [Google Scholar]

4. James R. WikiProject medicine: Creating credibility in consumer health. J Hosp Librariansh. 2016;16:344–51. [Google Scholar]

5. Mesgari M, Okoli C, Mehdi M, et al... “The sum of all human knowledge”: A systematic review of scholarly research on the content of Wikipedia. J Assoc Inf Sci Technol. 2015;66:219–45. [Google Scholar]

6. London DA, Andelman SM, Christiano AV, et al... Is Wikipedia a complete and accurate source for musculoskeletal anatomy?. Surg Radiol Anat. 2019;41(10):1187. [PubMed] [Google Scholar]

7. Jemielniak D, Aibar E. Bridging the gap between Wikipedia and academia. J Assoc Inf Sci Technol. 2016;67:1773–6. [Google Scholar]

8. Shafee T, Mietchen D, Su AI. Academics can help shape Wikipedia. Science. 2017;357:557–8. [PubMed] [Google Scholar]

9. Mietchen D, Wodak S, Wasik S, et al... Submit a topic page to PLOS Computational Biology and Wikipedia. PLoS Comput Biol. 2018;14:e1006137. [PMC free article] [PubMed] [Google Scholar]

10. Tsueng G, Good BM, Ping P, et al... Gene Wiki Reviews-Raising the quality and accessibility of information about the human genome. Gene. 2016;592:235–8. [PMC free article] [PubMed] [Google Scholar]


Articles from GigaScience are provided here courtesy of Oxford University Press


How does the Internet research differ from using a traditional library?

Libraries provide free access to scholarly books, journals, newspapers, encyclopedias, and other print reference sources. A lot of information on the Internet is FREE, except scholarly materials. A paid subscription is required to access.

How do experts suggest that you view the relationship between looking for research resources on the Internet and research at a library multiple choice question?

How do experts suggest that you view the relationship between looking for research resources on the Internet and research at a library? Internet sources supplement library research.

Which are major styles most commonly used by communication scholars for citing sources in a bibliography multiple select question?

Citing Your Sources MLA (Modern Language Association), Chicago, and APA (American Psychological Association) are three of the most commonly used citation styles, but there are hundreds.

Which of the following terms is used to describe a document that contains a list of all the books periodicals and other resources in the library?

BIBLIOGRAPHY: A list of sources. It may be published as a book, a journal article or pamphlet. A bibliography may include CITATIONS of books, journal articles, web sites, etc. TO BIND - (A periodical): To join several issues of a magazine or journal in one volume with a hard cover.