You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
whereas in 1.12.1 it fails. From the description of #1113 it seems this omission was intentional, but I don't see why it's necessary to remove support for this (extremely fundamental and ubiquitous) type of partitioning: the old special casing for DATE seems like it would apply just as well here and IMO should be restored. Same goes for the change #1116 which was similarly not addressed.
The text was updated successfully, but these errors were encountered:
@bnaul: apologize for the delay and any confusion.
I suspect that I fell afoul of a discrepancy in the "documentation" and ended up going with the most restrictive. Let me talk to some folks to figure what is supposed to be the approved and supported API behavior.
Context
I may not have fully recognized the discrepancy when I originally approved PR #1074.
I got more insight when I was rooting through all the internals to enable the follow-on PR ##1113
What discrepancy? We are provided different guidance in the error message returned as a DatabaseError (which implies DATE_TRUNC should only handle month/year) and the guidance found in the online BigQuery Documentation. See below (which implies DATE_TRUNC can handle 'day', 'month', 'year'):
DatabaseError
DatabaseError: (google.cloud.bigquery.dbapi.exceptions.DatabaseError) 400
PARTITION BY expression must be
_PARTITIONDATE,
DATE(_PARTITIONTIME),
DATE(<timestamp_column>),
DATE(<datetime_column>),
DATETIME_TRUNC(<datetime_column>, DAY/HOUR/MONTH/YEAR),
a DATE column,
TIMESTAMP_TRUNC(<timestamp_column>, DAY/HOUR/MONTH/YEAR),
DATE_TRUNC(<date_column>, MONTH/YEAR),
or RANGE_BUCKET(<int64_column>, GENERATE_ARRAY(<int64_value>, <int64_value>[, <int64_value>]))
Time-unit column partitioning
You can partition a table on a DATE,TIMESTAMP, or DATETIME column in the table. When you write data to the table, BigQuery automatically puts the data into the correct partition, based on the values in the column.
For TIMESTAMP and DATETIME columns, the partitions can have either hourly, daily, monthly, or yearly granularity. For DATE columns, the partitions can have daily, monthly, or yearly granularity. Partitions boundaries are based on UTC time.
It appears my fix from #1074 was undone by #1113. In 1.12.0 the following works:
whereas in 1.12.1 it fails. From the description of #1113 it seems this omission was intentional, but I don't see why it's necessary to remove support for this (extremely fundamental and ubiquitous) type of partitioning: the old special casing for DATE seems like it would apply just as well here and IMO should be restored. Same goes for the change #1116 which was similarly not addressed.
The text was updated successfully, but these errors were encountered: