What's in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus
Authors
Alexandra Sasha Luccioni, Joseph D. Viviano
Venue
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)