Skip to content

chumphrey-cmd/PDF-Extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Simple PDF Text Extractor

Overview

Simple tool designed to extract text from standard or password-protected PDFs. Prompts the user to input the password securely or press Enter if there is no password. It then outputs the extracted text into a specified file or directory of the users choice.

Prerequisites

  • Python 3.10+
  • Required Python packages listed in requirements.txt

Installation

  1. Create a Virtual Environment:

    python -m venv venv
  2. Activate the Virtual Environment:

    • On Windows:

      .\venv\Scripts\activate
    • On macOS/Linux:

      source venv/bin/activate
  3. Install Required Packages:

    pip install -r requirements.txt

Usage

  • With directory
python extractor.py -i '.\path\to\file.pdf' -o '.\path\to\output.txt'
  • Without directory
python extractor.py -i '.\file.pdf' -o 'output.txt'

Command Line Arguments

  • -i, --input: Path to the input PDF file.
  • -o, --output: Path to the output text file.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages