Skip to content

Redscrap is a simple web scraper for various local job boards, designed to collect job postings efficiently.

License

Notifications You must be signed in to change notification settings

hurby24/redscrap

Repository files navigation

Redscrap

Redscrap is a simple web scraper for various local job boards, designed to collect job postings efficiently.

Features

  • Scrapes job postings from multiple local job boards
  • Supports database storage for structured data
  • Uses proxy support for better reliability

Installation

Prerequisites

  • Node.js (Recommended: LTS version)
  • pnpm (Install with npm install -g pnpm)
  • A PostgreSQL database

Clone the Repository

git clone https://github.com/yourusername/redscrap.git
cd redscrap

Install Dependencies

pnpm install

Configuration

Before running Redscrap, you need to configure environment variables.

  1. Create a .env file in the project root:

    touch .env
  2. Add the following variables to .env:

    DATABASE_URL=your_postgres_connection_string
    PROXY_URLS="proxy1,proxy2,proxy3"  # Comma-separated list of proxies
    

Running Locally

Once configured, start the scraper with:

pnpm run start

License

This project is licensed under MIT.

About

Redscrap is a simple web scraper for various local job boards, designed to collect job postings efficiently.

Resources

License

Stars

Watchers

Forks

Releases

No releases published