Skip to content
Change the repository type filter

All

    Repositories list

    • python-zyte-api

      Public
      Python client for Zyte API
      Python
      BSD 3-Clause "New" or "Revised" License
      62974Updated Apr 23, 2026Apr 23, 2026
    • sphinx-markdown-builder

      Public
      Temporary fork of https://github.com/liran-funaro/sphinx-markdown-builder until #49 is merged and released.
      Python
      MIT License
      76000Updated Apr 22, 2026Apr 22, 2026
    • extract-summit-contest-solutions

      Public
      Example solutions for the practice and contest websites of the code contest of Web Data Extraction Summit.
      Python
      MIT License
      2504Updated Apr 21, 2026Apr 21, 2026
    • onefile

      Public
      Merge multiples files into one!
      HTML
      MIT License
      0103Updated Apr 21, 2026Apr 21, 2026
    • Django Cloud Task Queue. Integrate your Django Application with Google Cloud Task from Google Cloud Platform
      Python
      MIT License
      4000Updated Apr 18, 2026Apr 18, 2026
    • web-snap

      Public
      Create "perfect" snapshots of web pages
      JavaScript
      MIT License
      43402Updated Apr 10, 2026Apr 10, 2026
    • sphinx-llms-txt

      Public
      Temporary fork of https://github.com/jdillard/sphinx-llms-txt until #69 is merged and released upstream.
      Python
      MIT License
      7000Updated Apr 2, 2026Apr 2, 2026
    • zyte-common-items

      Public
      Contains the common item definitions used in Zyte.
      Python
      BSD 3-Clause "New" or "Revised" License
      111056Updated Mar 31, 2026Mar 31, 2026
    • hbase-operator-tools

      Public
      Apache HBase Operator Tools
      Java
      Apache License 2.0
      150100Updated Mar 26, 2026Mar 26, 2026
    • Spider templates for automatic crawlers.
      Python
      BSD 3-Clause "New" or "Revised" License
      434149Updated Mar 26, 2026Mar 26, 2026
    • URL matching library that relates URLs with resources
      Python
      BSD 3-Clause "New" or "Revised" License
      2911Updated Mar 23, 2026Mar 23, 2026
    • Python
      BSD 3-Clause "New" or "Revised" License
      112112Updated Mar 18, 2026Mar 18, 2026
    • https://docs.zyte.com/web-scraping/tutorial/index.html
      Python
      BSD 3-Clause "New" or "Revised" License
      2600Updated Mar 18, 2026Mar 18, 2026
    • x402

      Public
      A payments protocol for the internet. Built on HTTP.
      TypeScript
      Other
      1.5k1016Updated Feb 11, 2026Feb 11, 2026
    • HTML
      MIT License
      31211Updated Oct 28, 2025Oct 28, 2025
    • Remove DIVs, style stuff and normalize HTML preserving structure information
      Python
      MIT License
      21400Updated Oct 24, 2025Oct 24, 2025
    • hetzner

      Public
      A high-level Python API for accessing the Hetzner robot.
      Python
      Other
      41000Updated Oct 9, 2025Oct 9, 2025
    • html-text

      Public
      HTML
      MIT License
      11930Updated Oct 6, 2025Oct 6, 2025
    • 0220Updated Sep 23, 2025Sep 23, 2025
    • Python
      MIT License
      2571Updated Sep 5, 2025Sep 5, 2025
    • Bash scripts to universally deploy various distributions
      Shell
      Other
      155200Updated Aug 4, 2025Aug 4, 2025
    • Python
      0000Updated Jul 17, 2025Jul 17, 2025
    • Websites for testing spiders
      Python
      MIT License
      0300Updated May 15, 2025May 15, 2025
    • A stub implementation of a subset of Zyte API
      Python
      MIT License
      0200Updated Apr 22, 2025Apr 22, 2025
    • Python
      0000Updated Mar 18, 2025Mar 18, 2025
    • Contains rules for https://github.com/zytedata/duplicate-url-discarder.
      Python
      MIT License
      1000Updated Feb 5, 2025Feb 5, 2025
    • http-parser

      Public archive
      Fork of 'https://github.com/benoitc/http-parser'
      C
      Other
      96000Updated Nov 14, 2024Nov 14, 2024
    • Example site for web scraping tutorials
      Julia
      BSD 3-Clause "New" or "Revised" License
      173132Updated Oct 9, 2024Oct 9, 2024
    • rrweb

      Public archive
      record and replay the web
      TypeScript
      MIT License
      1.7k003Updated Sep 14, 2024Sep 14, 2024
    • geventhttpclient

      Public archive
      A high performance, concurrent http client library for python with gevent
      Python
      Other
      137001Updated Sep 3, 2024Sep 3, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.