Name	Name	Last commit message	Last commit date
Latest commit ManiMozaffar Fix compatibility to support python >=3.9 (#1 ) Dec 2, 2023 96f95ae · Dec 2, 2023 History 3 Commits
.github	.github	Fix compatibility to support python >=3.9 (#1 )	Dec 2, 2023
cfcrawler	cfcrawler	Fix compatibility to support python >=3.9 (#1 )	Dec 2, 2023
docs	docs	🎉 Add project boilerplate	Nov 29, 2023
tests	tests	🎉 Add project boilerplate	Nov 29, 2023
.gitignore	.gitignore	🎉 Add project boilerplate	Nov 29, 2023
.pre-commit-config.yaml	.pre-commit-config.yaml	🎉 Add project boilerplate	Nov 29, 2023
CONTRIBUTING.rst	CONTRIBUTING.rst	🎉 Add project boilerplate	Nov 29, 2023
LICENSE	LICENSE	🎉 Add project boilerplate	Nov 29, 2023
Makefile	Makefile	🎉 Add project boilerplate	Nov 29, 2023
README.md	README.md	📝 Update docs & CI	Nov 29, 2023
codecov.yaml	codecov.yaml	🎉 Add project boilerplate	Nov 29, 2023
mkdocs.yml	mkdocs.yml	🎉 Add project boilerplate	Nov 29, 2023
poetry.lock	poetry.lock	Fix compatibility to support python >=3.9 (#1 )	Dec 2, 2023
poetry.toml	poetry.toml	🎉 Add project boilerplate	Nov 29, 2023
pyproject.toml	pyproject.toml	Fix compatibility to support python >=3.9 (#1 )	Dec 2, 2023
tox.ini	tox.ini	Fix compatibility to support python >=3.9 (#1 )	Dec 2, 2023

Repository files navigation

cfcrawler

Cloudflare scraper and cralwer written in Async, In-place library for HTTPX. Crawl website that has cloudflare enabled, easier than ever!

To use library, simply replace your aiohttp client with ours!

from cfcrawler import AsyncClient

async def get(url):
    client = AsyncClient()
    await client.get(url)

You can also rotate user agents

from cfcrawler import AsyncClient

client = AsyncClient()
client.rotate_useragent()

You can also specify which browser you want to use

from cfcrawler.types import Browser
from cfcrawler import AsyncClient

AsyncClient(browser=Browser.CHROME)

You can also use asyncer to syncify the implementation

from cfcrawler import AsyncClient
from asyncer import syncify

def get(url):
    client = AsyncClient()
    syncify(client.get)(url)

I'll work on this library in few months, I don't have free time right now, but feel free to contribute. I'll check and test the PRs myself!