Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Common Crawl Foundation

Team
non-profit
Verified
https://commoncrawl.org
commoncrawl
commoncrawl
Activity Feed

AI & ML interests

Crawled data and metadata

Recent Activity

malteos  updated a bucket 2 days ago
commoncrawl/commoncrawl
malteos  updated a bucket 5 days ago
commoncrawl/test-bucket
lfoppiano  updated a dataset 6 days ago
commoncrawl/statistics
View all activity

Thom Vaughan's profile picturePedro Ortiz Suarez's profile picturePaul Lazar's profile pictureGreg Lindahl's profile pictureFord H's profile pictureJen English's profile pictureSebastian Nagel's profile pictureLaurie Burchell's profile pictureHande Celikkanat's profile picturemalteos's profile pictureThijs Dalhuijsen's profile pictureLuca's profile pictureCatherine Arnett's profile picture

commoncrawl 's datasets 7

commoncrawl/statistics

Viewer • Updated 6 days ago • 626k • 414 • 26

commoncrawl/citations

Viewer • Updated Apr 2 • 9.18k • 77 • 2

commoncrawl/CommonLID

Viewer • Updated Feb 10 • 373k • 173 • 52

commoncrawl/gneissweb-annotation-host-testing-v1

Viewer • Updated Dec 11, 2025 • 617M • 103

commoncrawl/gneissweb-annotation-url-testing-v1

Viewer • Updated Dec 10, 2025 • 11.5B • 77

commoncrawl/host-index-testing-v2

Preview • Updated Nov 10, 2025 • 1.52k

commoncrawl/eot2024_hostlevel_logs

Viewer • Updated Oct 9, 2024 • 271k • 5 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs