Filtering runtime comparisons of Seurat added (figure coming sooon)
Filtering memory comparisons for AnnSQL, AnnData, Seurat
Benchmark dataset generation using Splatter added for sparser filtering runtimes and memory profiles.

Considerations

Importing h5ad files with columns > 30k. This is an issue related to the db engine we're working to mitigate. The current work around to use the make buffer file parameter in the MakeDb class.
PCA runtime is slow; however, is memory efficient for larger datasets. We currently do not have plans to optimize this as we consider it to be highly experiment functionality. Currently, no PCA implementations exist using SQL and this is a hybrid SQL/Python approach. Additionally, the PCA method is resource intensive and will use all threads available to the system. We will release an update which limits thread usage in the near future.
Differential expression is memory respectful and implemented as a ttest in SQL, however, there may be slower performance when comparing to other methods. This is due to the lack of matrix operation support in SQL. It will run though with limited resources, where other packages may fail.

Assets 2

14 Feb 20:28

kennypavan

v0.9.9.1

01e51f4

v0.9.9.1

Extended functionality added:

Filter by cell counts
Filter by gene counts
Save expression to raw layer
Raw layer to main layer
Save highly variable genes to main layer (X)
Impose memory limits when instantiation of AnnSQL class
PCA (highly experimental)

Analysis Benchmarks added

Filtering runtime comparisons of Seurat added (figure coming sooon)
Filtering memory comparisons for AnnSQL, AnnData, Seurat
Benchmark dataset generation using Splatter added for sparser filtering runtimes and memory profiles.

Known Issues

Importing h5ad files with columns > 30k. We are addressing this issue in the next release
PCA runtime is slow; however, is memory efficient for larger datasets. We currently do not have plans to optimize this as we consider it to be highly experiment functionality. Currently, no PCA implementations exist using SQL and this is a hybrid SQL/Python approach. Additionally, the PCA method is resource intensive and will use all threads available to the system. We will release an update which limits thread usage in the near future.

Forward Functionality We will be developing extended functionality for the following below. These methods will allow users to complete a very basic full preprocessing single-cell/nuclei workflow.

Nearest neighbors
Leiden clustering
Umap
Differential expression

Assets 2

02 Nov 20:23

kennypavan

v0.9.8

1bd3353

v0.9.8

dependency updates

Assets 2

02 Nov 20:02

kennypavan

v0.9.6

243a370

v0.9.6

version v0.9.6 release

Assets 2

02 Nov 19:44

kennypavan

v0.9.5

bee19bc

v0.9.5

Readme updates

Assets 2

01 Nov 23:16

kennypavan

v0.9.4

8db1c96

v0.9.4 Pre-release

Pre-release

typo fixed

Assets 2

01 Nov 15:41

kennypavan

v0.9.3

17bf045

v0.9.3 Pre-release

Pre-release

Code cleanup

Assets 2

30 Oct 21:10

kennypavan

v0.9.2

9ff2e68

v0.9.2 Pre-release

Pre-release

Added chunk size parameter to MakeDb for backed mode.

Assets 2

29 Oct 22:30

kennypavan

v0.9.1

9faa85d

v0.9.1 Pre-release

Pre-release

Package updated

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major release

Full Preprocessing example:

AnnSQL & MakeDb API

Extended functionality added:

Analysis Benchmarks added

Considerations

Releases: ArpiarSaundersLab/annsql

v1.0.1

v1.0.0

Major release

Full Preprocessing example:

AnnSQL & MakeDb API

Extended functionality added:

Analysis Benchmarks added

Considerations

v0.9.9.1

v0.9.8

v0.9.6

v0.9.5

v0.9.4

v0.9.3

v0.9.2

v0.9.1