Simple tool designed to extract text from standard or password-protected PDFs. Prompts the user to input the password securely or press Enter if there is no password. It then outputs the extracted text into a specified file or directory of the users choice.
- Python 3.10+
- Required Python packages listed in
requirements.txt
-
Create a Virtual Environment:
python -m venv venv
-
Activate the Virtual Environment:
-
On Windows:
.\venv\Scripts\activate
-
On macOS/Linux:
source venv/bin/activate
-
-
Install Required Packages:
pip install -r requirements.txt
- With directory
python extractor.py -i '.\path\to\file.pdf' -o '.\path\to\output.txt'
- Without directory
python extractor.py -i '.\file.pdf' -o 'output.txt'
-i
,--input
: Path to the input PDF file.-o
,--output
: Path to the output text file.