feat(Google BigQuery Node): Add parameterized query support #14302
+82
−0
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds a basic support for BigQuery's Parameterized Queries feature. This important feature helps prevent SQL injections that are otherwise widespread in n8n workflows due to wide use of string concatenation.
The implementation is a stripped down subset of what it could be if implemented correctly, and should be treated as a PoC. I'm more than happy to implement a few changes along the way! I've been using the parameterized query implementation from the PostgreSQL node as a reference.
This is my first PR, so I left out documentation and tests until the general direction of implementation is confirmed.
Named parameters vs positional vs both
For simplicity I decided to only support named parameters. It won't be hard to support both modes, we just need to decide on the UX. Ideally, I'd avoid the solution that the PostgreSQL node uses:
Having to pre-process the input to strip down commas, sanitize and extract the values defeats some of the security benefits that parameterized queries are designed to add in the first place.
Configuration visibility
I decided to hide the named parameters into the
option
dropdown. This is based on an assumption that not a lot of users would care enough about this feature to bring it to the top level, and this would work have better backwards compatibility (existing nodes won't show the new options unless the user explicitly enables them).The feature is incompatible with Legacy SQL, so the new inputs will only show up when the "Legacy SQL" toggle is switched off. This can be confusing for people who still use Legacy SQL and would discourage them from using a better secure default, so I would rather look for a different UX solution for disabling this option in legacy mode.
Advanced type descriptors
The BigQuery API enforces strict typing of parameters in parameterized queries. Full support of the BQ type system in this feature is out of question, as this would result in a complex UX. It would be a stretch to support all the various ways the types can be defined, so I decided to only supported the "STRING" type.
This can be improved in a few ways:
Using @google-cloud/bigquery package
If we switch this node to the official JS SDK from Google, we should be able to use the getTypeDescriptorFromValue helper at runtime, which would infer BQ type descriptors of any valid JS value automatically, allowing for complex javascript values.
Manual type definitions
Another, less user-friendly option, would be to allow users to specify the types manually by providing a JSON describing the type:
SQL Editor autocompletion
Unfortunately, this feature doesn't play well with the SQL editor. It doesn't break the highlighting of the queries, but autocompletion doesn't know about the parameterized value placeholders such as
@name
. I don't think it's a major blocker, but we could look into it further.Related Linear tickets, Github issues, and Community forum posts
Review / Merge checklist
release/backport
(if the PR is an urgent fix that needs to be backported)