Data Leverage References

← Back to browse

What's in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus

2021 inproceedings luccioni2021box Confirmed 2025-12-30 (openalex)

All fields match the external database.

Authors
Alexandra Sasha Luccioni, Joseph D. Viviano
Venue
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)

Citations

Cited in projects (1)

BibTeX

Local Entry
@inproceedings{luccioni2021box,
  title = {What's in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus},
  author = {Alexandra Sasha Luccioni and Joseph D. Viviano},
  year = {2021},
  booktitle = {Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
  doi = {10.18653/v1/2021.acl-short.24},
  url = {https://aclanthology.org/2021.acl-short.24/}
}
From AUTO:OPENALEX
@inproceedings{luccioni2021box,
  title = {What’s in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus},
  author = {Alexandra Sasha Luccioni and Joseph D. Viviano},
  year = {2021},
  doi = {10.18653/v1/2021.acl-short.24}
}