Installation

A set of tools for controlling processing workflow with spiders and script running in scrapinghub ScrapyCloud.

Installation

pip install shub-workflow

If you want to support s3 tools:

pip install shub-workflow[with-s3-tools]

For google cloud storage tools support:

pip install shub-workflow[with-gcs-tools]

Usage

Check Project Wiki for documentation. You can also see code tests for lots of examples of usage.

Note

The requirements for this library are defined in setup.py as usual. The Pipfile files in the repository don't define dependencies. It is only used for setting up a development environment for shub-workflow library development and testing.

For developers

For installing a development environment for shub-workflow, the package comes with Pipfile and Pipfile.lock files. So, clone or fork the repository and do:

> pipenv install --dev
> cp pre-commit .git/hooks/

for installing the environment, and:

> pipenv shell

for initiating it.

There is a script, lint.sh, that you can run everytime you need from the repo root folder, but it is also executed each time you do git commit (provided you installed the pre-commit hook during the installation step described above). It checks code pep8 and typing integrity, via flake8 and mypy.

> ./lint.sh

Name	Name	Last commit message	Last commit date
Latest commit kalessin updated to version 1,14.20.8 May 9, 2025 1d163f1 · May 9, 2025 History 961 Commits
.github/workflows	.github/workflows	added test for python 3.11	Jun 6, 2023
shub_workflow	shub_workflow	updated to version 1,14.20.8	May 9, 2025
tests	tests	avoid errors on tests due to sc operations	Sep 13, 2024
.gitignore	.gitignore	unignore pipenv files	Feb 18, 2022
Changelog.md	Changelog.md	update to version 1.14.21	Apr 30, 2025
LICENSE	LICENSE	first commit	Sep 25, 2018
Pipfile	Pipfile	for scanjobs --period argument, allow any string recognizable by time…	May 7, 2025
Pipfile.lock	Pipfile.lock	for scanjobs --period argument, allow any string recognizable by time…	May 7, 2025
README.md	README.md	README text fix	Sep 4, 2024
lint.sh	lint.sh	typing improvements	Aug 21, 2023
pre-commit	pre-commit	updated README with section for developers	Sep 4, 2024
pyproject.toml	pyproject.toml	define some flake8/black defaults	May 23, 2021
setup.cfg	setup.cfg	check updates, some typing lint fixes	Apr 21, 2023
setup.py	setup.py	updated to version 1,14.20.8	May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Usage

Note

For developers

About

Releases

Packages

Contributors 5

Languages

License

scrapinghub/shub-workflow

Folders and files

Latest commit

History

Repository files navigation

Installation

Usage

Note

For developers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages