-
there must be a better way ?
|
Beta Was this translation helpful? Give feedback.
Replies: 9 comments 6 replies
-
Perhaps this? TABLE_NAMES = ["lineitem", "nation", "region", "customer", "supplier", "orders", "partsupp", "part"]
table_dfs = {
name: daft.read_parquet(f"{path}/{sf}/{name}/*.parquet") for name in TABLE_NAMES
} |
Beta Was this translation helpful? Give feedback.
-
that does not help much
I am getting Table not found: lineitem |
Beta Was this translation helpful? Give feedback.
-
in polars as an example, I can just do this
|
Beta Was this translation helpful? Give feedback.
-
Oh! I see what you're trying to do now. @universalmind303 can probably advise better, but you might want to use our SQLCatalog abstraction here. from daft.sql import SQLCatalog
TABLE_NAMES = ["lineitem", "nation", "region", "customer", "supplier", "orders", "partsupp", "part"]
table_dfs = {
name: daft.read_parquet(f"{path}/{sf}/{name}/*.parquet") for name in TABLE_NAMES
}
catalog = SQLCatalog(table_dfs)
daft.sql("SELECT * FROM lineitem", catalog=catalog) Do bear with us as we work on a better catalog API. We have some proposals in the works here for unifying the story around iceberg/delta/HMS etc. |
Beta Was this translation helpful? Give feedback.
-
please can we have this
|
Beta Was this translation helpful? Give feedback.
-
that's perfect !!! |
Beta Was this translation helpful? Give feedback.
-
can we register iceberg catalog please
|
Beta Was this translation helpful? Give feedback.
-
@jaychia maybe just start with register_iceberg_rest first then figure out HMS later, the combo daft/polaris is very interesting actually |
Beta Was this translation helpful? Give feedback.
-
It is !! I am in the next stage 😛 |
Beta Was this translation helpful? Give feedback.
I'm trying to move towards #3036 which is more like:
This also opens up opportunities for something like
Which lets us register entire catalog services to start reading tables from