Skip to content

Conversation

NikolaosPapailiou
Copy link
Contributor

@NikolaosPapailiou NikolaosPapailiou commented Jul 15, 2024

Using FIRST_N sampling can lead to quality and ingestion partition balancing problems for users that are not aware of it and try to experiment with TileDB vector search.

FIRST_N can give some ingestion performance boost but it is better for a user to configure this specifically rather than facing hidden quality issues.

sc-50492

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant