Skip to content
@internetarchive

Internet Archive

The Internet Archive is "the library of the Internet", and a big supporter of Free Software.

Pinned Loading

  1. openlibrary openlibrary Public

    One webpage for every book ever published!

    Python 5.3k 1.4k

  2. bookreader bookreader Public

    The Internet Archive BookReader

    JavaScript 1k 421

  3. heritrix3 heritrix3 Public

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    Java 2.9k 762

  4. cicd cicd Public

    build & test using github registry; deploy to nomad clusters

    14

Repositories

Showing 10 of 248 repositories
  • openlibrary Public

    One webpage for every book ever published!

    internetarchive/openlibrary’s past year of commit activity
    Python 5,284 AGPL-3.0 1,418 812 (30 issues need help) 166 Updated Dec 22, 2024
  • iaux-notification-toast Public

    displays notifications and automatically clears them

    internetarchive/iaux-notification-toast’s past year of commit activity
    TypeScript 0 AGPL-3.0 0 1 11 Updated Dec 22, 2024
  • Zeno Public

    State-of-the-art web crawler 🔱

    internetarchive/Zeno’s past year of commit activity
    HTML 84 AGPL-3.0 12 27 (5 issues need help) 4 Updated Dec 21, 2024
  • iaux-donation-form Public

    The Internet Archive Donation Form

    internetarchive/iaux-donation-form’s past year of commit activity
    TypeScript 4 0 0 31 Updated Dec 21, 2024
  • rclone Public Forked from rclone/rclone

    [vault fork] of "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Yandex Files

    internetarchive/rclone’s past year of commit activity
    Go 2 MIT 4,419 0 0 Updated Dec 20, 2024
  • esbuild_es5 Public

    minify JS/TS files using `esbuild` and `swc` down to ES5 (uses `deno`)

    internetarchive/esbuild_es5’s past year of commit activity
    TypeScript 6 AGPL-3.0 0 0 1 Updated Dec 20, 2024
  • heritrix3 Public

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    internetarchive/heritrix3’s past year of commit activity
    Java 2,860 762 35 5 Updated Dec 20, 2024
  • internetarchive/internetarchivebot’s past year of commit activity
    PHP 129 AGPL-3.0 35 0 2 Updated Dec 19, 2024
  • ads-common Public

    Common components and utilities for the Archiving & Data Services (ADS) team at the Internet Archive

    internetarchive/ads-common’s past year of commit activity
    TypeScript 2 MIT 0 0 0 Updated Dec 19, 2024
  • bookreader Public

    The Internet Archive BookReader

    internetarchive/bookreader’s past year of commit activity
    JavaScript 1,003 AGPL-3.0 421 134 (3 issues need help) 87 Updated Dec 18, 2024