Skip to content
Change the repository type filter

All

    Repositories list

    • Apify ESLint preset to be shared between projects
      JavaScript
      β€’
      Apache License 2.0
      β€’0β€’2β€’1β€’2β€’Updated Feb 3, 2025Feb 3, 2025
    • crawlee

      Public
      Crawleeβ€”A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
      TypeScript
      β€’
      Apache License 2.0
      β€’742β€’17kβ€’130β€’22β€’Updated Feb 3, 2025Feb 3, 2025
    • actor-cmd

      Public
      TypeScript
      β€’0β€’1β€’0β€’0β€’Updated Feb 3, 2025Feb 3, 2025
    • Crawleeβ€”A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
      Python
      β€’
      Apache License 2.0
      β€’343β€’5.2kβ€’81β€’14β€’Updated Feb 3, 2025Feb 3, 2025
    • apify-cli

      Public
      Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
      TypeScript
      β€’20β€’128β€’39β€’6β€’Updated Feb 3, 2025Feb 3, 2025
    • Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
      TypeScript
      β€’
      Apache License 2.0
      β€’119β€’1.2kβ€’21β€’9β€’Updated Feb 3, 2025Feb 3, 2025
    • Apify API client for Python
      Python
      β€’
      Apache License 2.0
      β€’12β€’53β€’10β€’5β€’Updated Feb 3, 2025Feb 3, 2025
    • The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
      Python
      β€’
      Apache License 2.0
      β€’10β€’122β€’13β€’2β€’Updated Feb 3, 2025Feb 3, 2025
    • Model Context Protocol (MCP) Client for Apify's Actors
      TypeScript
      β€’
      Apache License 2.0
      β€’0β€’12β€’0β€’1β€’Updated Feb 2, 2025Feb 2, 2025
    • Apify LangChain integration
      Apache License 2.0
      β€’0β€’1β€’0β€’1β€’Updated Feb 2, 2025Feb 2, 2025
    • This project is the home of Apify's documentation.
      API Blueprint
      β€’
      Apache License 2.0
      β€’81β€’32β€’76β€’19β€’Updated Feb 1, 2025Feb 1, 2025
    • Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.
      MDX
      β€’
      MIT License
      β€’0β€’1β€’3β€’8β€’Updated Feb 1, 2025Feb 1, 2025
    • Apify SDK monorepo
      TypeScript
      β€’
      Apache License 2.0
      β€’41β€’129β€’11β€’9β€’Updated Feb 1, 2025Feb 1, 2025
    • workflows

      Public
      Apify's reusable github workflows
      Python
      β€’4β€’7β€’4β€’7β€’Updated Jan 31, 2025Jan 31, 2025
    • Model Context Protocol (MCP) Server for Apify's Actors
      TypeScript
      β€’
      Apache License 2.0
      β€’3β€’13β€’0β€’2β€’Updated Jan 31, 2025Jan 31, 2025
    • Utilities and constants shared across Apify projects.
      TypeScript
      β€’
      Apache License 2.0
      β€’11β€’12β€’5β€’1β€’Updated Jan 29, 2025Jan 29, 2025
    • impit

      Public
      impit | rust library for browser impersonation
      Rust
      β€’0β€’13β€’1β€’3β€’Updated Jan 29, 2025Jan 29, 2025
    • Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
      JavaScript
      β€’
      Apache License 2.0
      β€’147β€’875β€’7β€’11β€’Updated Jan 29, 2025Jan 29, 2025
    • JavaScript
      β€’1β€’0β€’0β€’1β€’Updated Jan 28, 2025Jan 28, 2025
    • rustls

      Public
      Patched fork of `ruslts` for `impit`
      Rust
      β€’
      Other
      β€’673β€’0β€’0β€’0β€’Updated Jan 28, 2025Jan 28, 2025
    • Apify API client for JavaScript / Node.js.
      TypeScript
      β€’
      Apache License 2.0
      β€’28β€’67β€’18β€’5β€’Updated Jan 28, 2025Jan 28, 2025
    • The /llms.txt Generator Actor πŸ•ΈοΈπŸ“„ extracts website content to create an llms.txt file for AI apps πŸ€–βœ¨ like LLM fine-tuning and indexing. Output is available πŸ“₯ in the Key-Value Store for easy download and integration into workflows. πŸš€
      Python
      β€’
      Apache License 2.0
      β€’1β€’3β€’1β€’1β€’Updated Jan 27, 2025Jan 27, 2025
    • Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
      Python
      β€’
      Apache License 2.0
      β€’4β€’6β€’2β€’0β€’Updated Jan 25, 2025Jan 25, 2025
    • This project is the 🏠 home of Apify Actor templates to help users quickly get started. Contributions welcome!
      Python
      β€’18β€’26β€’10β€’1β€’Updated Jan 23, 2025Jan 23, 2025
    • Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
      TypeScript
      β€’28β€’0β€’0β€’0β€’Updated Jan 22, 2025Jan 22, 2025
    • RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.
      TypeScript
      β€’
      Apache License 2.0
      β€’5β€’24β€’4β€’0β€’Updated Jan 17, 2025Jan 17, 2025
    • A Homebrew tap for Apify tools
      Ruby
      β€’1β€’8β€’0β€’4β€’Updated Jan 16, 2025Jan 16, 2025
    • Base Docker images for Apify actors.
      Dockerfile
      β€’
      Apache License 2.0
      β€’24β€’72β€’9β€’4β€’Updated Jan 14, 2025Jan 14, 2025
    • h2

      Public
      Patched fork of h2 for impit
      Rust
      β€’
      MIT License
      β€’290β€’0β€’0β€’0β€’Updated Jan 14, 2025Jan 14, 2025
    • A GitHub Action to push an Actor the the Apify platform
      Apache License 2.0
      β€’0β€’15β€’0β€’0β€’Updated Jan 14, 2025Jan 14, 2025