Back
Deep Dive · 2w ago

Edit Wars: The Battle for Wikipedia Integrity

0:00 15:21
wikipediawikimedia-foundationinternet-culturecontent-creationarbitration-committee

Other episodes by Kitty Cat.

If you liked this, try these.

The full episode, in writing.

Wikipedia disputes start when volunteer editors disagree about article content, internal Wikipedia rules, or alleged misconduct. These disagreements often erupt into "edit wars," where editors repeatedly override each other’s changes instead of reaching a consensus. Edit wars became so common that the Wikipedia community instituted the "three revert rule" in 2004, which prohibits editors from making more than three reverts on a single page within 24 hours. Subsequent scholarship found that this rule reportedly cut the number of reverts in half, showing a measurable reduction in these visible conflicts.
One of the longest edit wars documented occurred in 2008 over the biography of Mustafa Kemal Atatürk, Turkey’s first president. This particular war involved 105 reverts by 20 different users, marking an unusually persistent and multi-participant conflict. Such intense edit wars often appear in articles relating to highly controversial topics. For example, articles on abortion or the Israeli–Palestinian conflict routinely attract entrenched opposing sides. However, edit wars can also develop over topics less obviously contentious, like the nationality of the artist Francis Bacon.
To analyze and identify these conflicts, researchers have developed platforms like Contropedia. This tool measures the level of controversy in articles by tracking patterns of reverts and mutual re-reverts over time. In 2012, Yasseri and colleagues introduced a pattern recognition algorithm for detecting disputes that avoided relying on language-based criteria. Their method enabled comparisons of controversial articles across different languages and cultures. By 2014, Yasseri’s team published a list of the most controversial articles in ten different language versions of Wikipedia, including Arabic, Hebrew, and Hungarian. The 2014 study identified "Israel," "Adolf Hitler," "The Holocaust," and "God" as the most hotly debated articles across those languages.
A 2021 study used discussion structure—specifically, the "ABA" pattern, where two editors comment back and forth before a third joins—to predict conflict-prone discussions with 80% accuracy. Other research found that the use of certain phrases and pronouns can indicate the level of politeness or willingness to collaborate, helping further refine models for spotting brewing disputes.
Disputes are not confined to content, but often center on whether articles or content should be deleted. English Wikipedia has had more than 400,000 Articles for Deletion (AfD) discussions since 2004. In 2017, Wikipedia put new restrictions on article creation, which resulted in a sharp decline in the number of AfD submissions. As of 2018, about 64% of AfD debates ended with the article being deleted, while 24% of discussions resulted in keeping the article. Nearly all these debates are closed by a Wikipedia administrator, signifying the importance of trusted veteran editors in dispute resolution.
Deletion debates differ considerably between language versions. In English Wikipedia, about 20% of AfD comments cite Wikipedia policy to justify a stance, while in German and Turkish Wikipedias it is less than 3%. The dominance of experienced editors is pronounced: out of over 160,000 users who participated in AfD discussions, more than half of all comments were posted by just 1,218 users. This concentration of influence by a small core of veteran editors has become more marked over time.
Deletion disputes are not confined to articles but also include categories and lists. In 2019, researchers Elijah Mayfield and Alan W. Black created a natural language processing model to forecast AfD outcomes. They found that the first comment in a deletion debate had a "herd effect," predicting outcomes over 20% better than baseline models.
Gatekeeping is a central factor in Wikipedia disputes. In a comparative study analyzing French and Spanish decolonization cases, more active editors experienced fewer deletions and formed rival camps, functioning as powerful gatekeepers. The same study found that editors who engaged more frequently on Talk pages—where disputes are supposed to be ironed out—had fewer of their edits deleted. Rival groups tended to delete content from opposing camps: for example, editors associated with France were more likely to delete contributions from Algeria, and vice versa.
Talk pages are the front line for many editorial disputes. Wikipedia policy encourages editors to seek consensus through discussion on these pages. However, these conversations often include impoliteness, condescension, and pointed criticism. A study of 120 Talk pages found that "Wikipedians do not prolong the conflicts" and that impolite comments often received no response at all, two-fifths of the time. When there was a response, 37% were defensive, such as explaining oneself or asking for more details, while 53.5% of responses to rudeness were offensive, escalating the conflict. Another study determined that personal attacks are reciprocated immediately in 26% of cases.
Editors use a variety of tactics to disagree, ranging from insults and derailing to counter-argument and refutation. Higher quality rebuttals—those that directly address points—correlate with more constructive outcomes. Other coordination tactics include asking questions, providing relevant information, suggesting compromises, or even admitting ignorance. Deferential wording, such as "by the way" or hedging statements, helps to reduce the intensity of disputes by signaling openness to compromise.
During editing disputes, Wikipedians tend to take on five conversational roles: architect, content expert, moderator, policy wonk, and wordsmith. Studies indicate that those who focus on the content—the expert and the wordsmith—are more likely to be successful in disputes than those who focus solely on rules or process. Ironically, when editors invoke Wikipedia policy in general content disputes, a behavior known as "wiki-lawyering," it often escalates conflict rather than resolving it. However, in deletion debates, citing policy such as "Notability" has been shown to help settle disputes more consistently.
Wikipedia has developed several formal mechanisms for dispute resolution over its history. For content disagreements, editors may request third party opinions or submit disputes to the Dispute Resolution Noticeboard (DRN). The DRN rarely results in a binding compromise; of 2,520 cases up to mid-2020, 237 were resolved, 149 failed, and 2,134—about 85%—were closed without a result. Another key mechanism is the Requests for Comment (RfC) system, where disputes are circulated to uninvolved editors via a bot and are meant to be resolved within 30 days. In a seven-year span, there were over 7,300 RfC discussions, but a significant number went stale due to either lack of attention or the overwhelming complexity of the issue.
For user conduct disputes, the Arbitration Committee—called ArbCom—serves as the highest authority. Created in 2003, ArbCom handles intractable disputes, especially those involving conduct issues among editors. Between 2004 and 2020, more than 500 complaints were submitted to ArbCom. Cases before the committee are highly structured, though the decision-making process is flexible and informal. ArbCom has been criticized for favoring parties who are more socially skilled or effective in navigating Wikipedia’s procedures.
Wikipedia disputes often spill over into the media, especially when internal conflicts become public. After the Wikimedia Foundation banned a user in 2019, the incident was widely covered, and 21 Wikipedia administrators resigned in protest, exposing deep rifts in the community. These episodes highlight how internal Wikipedia disputes can become public controversies, affecting both reputation and governance.
Not all disputes are detrimental; some scholars and Wikipedia leaders have argued that adversarial editing is an essential part of collaboration. A well-managed dispute can surface new information and lead to higher quality articles. For example, a 2017 laboratory experiment found that controversial topics on German Wikipedia actually attracted more editors, suggesting some users are motivated to participate precisely because of the chance to challenge opposing viewpoints.
Research into Wikipedia disputes peaked in 2012, with overall Wikipedia editing activity peaking in 2007. A 2023 review identified 217 academic articles about Wikipedia contributor goals, interactions, and collaboration. Of those, 34 focused on the causes and impact of conflict, conflict resolution mechanisms, and how to measure or predict disputes.
Wikipedia disputes have also been analyzed for their impact on the community. Prolonged or repeated conflicts are widely seen as a drain on volunteer energy and are associated with a competitive, conflict-based culture. Some studies have linked the prevalence of disputes with gender imbalances, noting that the editing culture may reflect conventional masculine roles and deter participation by women. Research has documented how conflict and impoliteness can harm personal identities, violate boundaries, and diminish the willingness of volunteers to contribute. Toxic comments have been shown to reduce the activity levels of volunteer editors.
Entrenched editor conflicts also detract from the quality and neutrality of Wikipedia articles. A 2020 study of 1,206 contentious articles found that editors form clear in-groups and out-groups, readily identifying friends—those who share their views—and enemies—those who do not. In a detailed analysis of 5,414 editor profiles, researchers found that editors who "won" edit wars were more likely to ban opponents, revert edits, remove links to competing perspectives, and invoke Wikipedia policy selectively. These editors also tended to control the cited references in contentious articles.
Certain topics are consistently contentious across languages and cultures. As mentioned earlier, "Israel," "Adolf Hitler," "The Holocaust," and "God" top the list in studies comparing ten different language versions of Wikipedia. Some disputes, however, are more pronounced in one language Wikipedia than another. Other studies have shown that the most reputable editors—those whose contributions tend to last—are less involved in disputes, and their reputation can persuade others to adopt their viewpoint.
In an analysis of disputes about post-colonial topics, such as Algeria versus France or Gran Colombia versus Spain, the editors from the more dominant groups suffered the fewest deletions of their writing. Editors from France and Gran Colombia were more likely to delete contributions from Algerian and Spanish editors, indicating how power dynamics play out in content moderation.
The focus of disputes can shift over time. In French Wikipedia, contested articles about the Shroud of Turin and Sigmund Freud evolved from disagreements about the topic to arguments about the reliability of sources and the epistemologies of the editors themselves. In the case of Freud, disagreements reflected competing scientific and philosophical perspectives. In the case of the Shroud of Turin, editors with religious commitments used argumentation as a way to keep alternative views in play, sometimes producing an endless chain of rebuttals.
Historically, Wikipedia’s first major internal dispute was an argument over advertising, involving Larry Sanger and a group of Spanish editors. This debate in 2002 led to a "fork," where the Spanish Wikipedia split off from the main site. The controversy over whether to display images from the Jyllands-Posten Muhammad cartoons in 2005–2006 is another early example of a dispute with wide-reaching impact. Another well-known internal dispute, the 2006 userbox controversy, was resolved in part by restricting certain templates to personal user pages, and in part by direct intervention from Wikipedia co-founder Jimmy Wales.
Over the years, Wikipedia has experimented with a range of dispute resolution mechanisms. Early on, editors could appeal to the Mediation Committee for binding decisions, but this process was discontinued in 2018 due to inactivity. Informal groups like the "Mediation Cabal" also provided mediation services. A 2010 study found that mediators could de-escalate disputes by editing Talk page discussions, clarifying ambiguous statements, distinguishing personal from substantive arguments, and helping editors frame their exchanges more constructively.
Wikipedia enforces a Universal Code of Conduct, designed to restrain egregious behaviors that sometimes arise during disputes. Algorithmic governance, often using bots, has become a significant part of enforcing Wikipedia policies during conflicts. Automated systems flag suspicious edits and enforce the core principles of Wikipedia, like "Neutral point of view" and "No original research."
A 2020 study found that editors frequently align themselves with like-minded peers, creating distinct camps over controversial articles. Discourse analysis revealed that contentious Wikipedia articles tend to divide editors into clear friend and enemy groups. This dynamic can be traced through deletion patterns, revert histories, and discourse on Talk pages.
A case study by David Moats followed the life cycle of disputes over the Fukushima nuclear disaster article. He observed that, in the early phase, editors focused on finding and evaluating reliable news sources. As the article evolved, disputes shifted towards disagreements about which sources should be included and how information should be framed.
Among the most active editors, reputation and influence play an outsized role. Editors with high social capital are more likely to prevail in disputes, ban opposing editors, and steer article content in their preferred direction. ArbCom decisions have been criticized for favoring contributors who are skilled at social navigation, rather than those necessarily providing the most accurate information.
On average, contentious topics not only foster more disputes, but also attract more editors. A study involving the German Wikipedia found that controversial articles drew higher engagement, even when the public debate on the topic was less intense. This suggests that the drive to resolve—or escalate—conflict is a key motivator for many contributors.
In an effort to facilitate disagreement without derailing collaboration, some researchers have investigated the effectiveness of specific rebuttal and coordination strategies. High-quality rebuttals, such as directly countering an argument, correlate with more constructive dispute outcomes. Deferential, hedging language reduces friction by signaling a willingness to compromise, while overt policy citations can either help or hinder depending on context.
A 2019 incident drew significant attention when internal debates over a user ban led to widespread media coverage and the resignation of 21 administrators from English Wikipedia. This episode underscored how internal conflicts can have far-reaching impacts, not just on Wikipedia’s governance but on its public legitimacy.
During a seven-year period, over 7,300 requests for comment were initiated, but many failed to reach consensus. In the majority of Dispute Resolution Noticeboard cases—over 85%—no resolution was reached. This highlights the limits of voluntary, consensus-driven processes in a large, decentralized community.
A 2023 literature review cataloged 217 studies about Wikipedia’s collaborative processes, 34 of which focused specifically on conflict causes, impact, and resolution mechanisms. The review found that attention to Wikipedia disputes peaked in 2012, paralleling a peak in overall editing activity in 2007.
According to a 2020 analysis, the single longest documented edit war on Wikipedia saw 105 reverts among 20 users, all fighting over the biography of Mustafa Kemal Atatürk.

Hear the full story.
Listen in PodCats.

The full episode, all the chapters, your own library — and a feed of voices worth following.

Download on theApp Store
Hear the full episode Open in PodCats