Skip to content

Example Jupyter notebooks and documentation for the xReverseLabs Domain Dataset, part of the AWS Open Data Sponsorship Program.

Notifications You must be signed in to change notification settings

xReverseLabs/open-data-examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Open Data Examples – xReverseLabs

This repository contains example notebooks and tutorials for working with the xReverseLabs Domain Dataset, which is openly available through the AWS Open Data Sponsorship Program.

πŸ“– Dataset Overview

The xReverseLabs Domain Dataset provides multiple collections of domain and DNS-related data:

  1. Daily Domain Dump Dataset

    • Plain text files (YYYY-MM-DD_domains.txt)
    • Each line is a domain, updated daily
  2. Domain By Date Full Data (Historical)

    • Daily new/unique domains (all_new_domains_YYYY-MM-DD.txt)
  3. Domain By Extension Dataset

    • Domains grouped by TLD/extension (com.txt.gz, org.txt.gz, etc.)
  4. Forward DNS (FDNS) Dataset

    • JSON records of DNS resolution results (status, timestamp, IP addresses)

πŸ“š Full dataset documentation: https://opendata.xreverselabs.org/about.php


πŸš€ Tutorials

▢️ Get To Know A Dataset

Notebook: xreverselabs/get-to-know-a-dataset.ipynb

This notebook demonstrates how to:

  • Access the dataset from AWS S3
  • Load daily domain dump files
  • Analyze new domains by date
  • Explore TLD-specific data
  • Parse Forward DNS (FDNS) JSON records

πŸ› οΈ Requirements

Install dependencies with:

pip install boto3 pandas
Optional (for FDNS exploration and compressed TLD files):

pip install gzip json

πŸ“œ License This repository and example notebooks are provided under the MIT License. The dataset itself is distributed under CC-BY 4.0 license.

πŸ“¬ Contact For questions about the dataset or tutorials:

Website: https://opendata.xreverselabs.org

Email: opendata@xreverselabs.org

About

Example Jupyter notebooks and documentation for the xReverseLabs Domain Dataset, part of the AWS Open Data Sponsorship Program.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published