We invite you to explore our blog for more details.
Before building the application, you’ll need to set up a basic Python environment. Follow these steps to get started:
- Install Python 3 on your system.
- Install the required dependencies by running:
python -m pip install -r requirements.txt
- To make the webhook publicly accessible to Crawlbase servers for demonstration purposes, install and configure ngrok.
- Sign up for a Crawlbase account and log in.
- Upon registration, you’ll receive 5,000 free requests to get started.
- Navigate to your Account Docs and copy your Crawling API token (Normal or JavaScript requests).
- Create a new Crawler to start configuring your crawl tasks.
Before running the examples, ensure that you replace all instances of the following placeholders:
<Normal or Javascript requests token>
- Replace this with your Crawling API requests token.<Crawler name>
- Replace this with the name of your newly created crawler. You can create or view it here.
- Start the ngrok tunnel:
ngrok http 5768
- Set the callback URL:
Copy the forwarding URL provided by ngrok and paste it into the Callback URL field of your Crawler settings.
Example:
https://xxxx-xxx-xxx-xxx-xx.ngrok-free.app/webhook
- Run the Webhook HTTP server:
python webhook_http_server.py
- Send a crawl request (in a separate terminal):
python crawl.py
🛡 Disclaimer This repository is for educational purposes only. Please make sure you comply with the Terms of Service of any website you scrape. Use this responsibly and only where permitted.
Copyright 2025 Crawlbase