Skip to content

Intelligence tool for Security Document Retrieval and Filtering through LLMs

Notifications You must be signed in to change notification settings

andreitrcv/Intel-LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Intel-LLM: (Cyber-)Security Document Retrieval and Filtering

Intel-LLM

Intel-LLM is a Python tool that identifies and download relevant files from the Internet that cover topics such as cyber threat intelligence, Advanced Persistent Threats (APT), military and government-backed operations, cybersecurity research, cyber warfare, and related topics. It uses Google's search engine and a (Large) Language Model (LLM) to filter files and find the most pertinent results based on user-defined queries.

  • The user can specify a target language or region to search for documents using government and military domains.
  • The tool fetches search results from Google based on the query.
  • The results are filtered and analyzed by an LLM, which ranks files by relevance to cyber threat intelligence and other specified topics.
  • The most relevant files are automatically downloaded for further analysis.


Installation:

Environment Setup: Ensure you have Python 3.x installed. Install the required packages using pip:

  • pip install requests googlesearch-python google-generativeai
    

API Key Configuration: Set up the environment variable for the Google Generative AI (LLM) API key:

  • export API_KEY="your-google-api-key"
    


Usage:

Run the Script: Start by running the main Python script:

  •  python Intel-LLM.py
    

Select a Country Query: You will be prompted to choose from a list of languages and country-specific queries (e.g., English, German, Russian, Chinese...).

image

Choose how many documents you want to filter. All of them will be given to the LLM and filtered for downloading.

Releases

No releases published

Packages

No packages published

Languages