Automatic Vandalism Detection in Wikipedia.

Bibliographic Details
Title: Automatic Vandalism Detection in Wikipedia.
Authors: Potthast, Martin, Stein, Benno, Gerling, Robert
Source: Advances in Information Retrieval: 30th European Conference on Ir Research, Ecir 2008, Glasgow, Uk, March 30-april 3, 2008. Proceedings; 2008, p663-668, 6p
Abstract: We present results of a new approach to detect destructive article revisions, so-called vandalism, in Wikipedia. Vandalism detection is a one-class classification problem, where vandalism edits are the target to be identified among all revisions. Interestingly, vandalism detection has not been addressed in the Information Retrieval literature by now. In this paper we discuss the characteristics of vandalism as humans recognize it and develop features to render vandalism detection as a machine learning task. We compiled a large number of vandalism edits in a corpus, which allows for the comparison of existing and new detection approaches. Using logistic regression we achieve 83% precision at 77% recall with our model. Compared to the rule-based methods that are currently applied in Wikipedia, our approach increases the F-Measure performance by 49% while being faster at the same time. [ABSTRACT FROM AUTHOR]
Copyright of Advances in Information Retrieval: 30th European Conference on Ir Research, Ecir 2008, Glasgow, Uk, March 30-april 3, 2008. Proceedings is the property of Springer Nature / Books and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
DOI: 10.1007/978-3-540-78646-7_75
Database: Complementary Index