Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Managed BQ read transform fails to attach config row #33864

Open
1 of 17 tasks
chamikaramj opened this issue Feb 5, 2025 · 5 comments
Open
1 of 17 tasks

[Bug]: Managed BQ read transform fails to attach config row #33864

chamikaramj opened this issue Feb 5, 2025 · 5 comments
Assignees

Comments

@chamikaramj
Copy link
Contributor

What happened?

I'm trying out a Managed BQ read pipeline and seems like it's failing to attach the config row preventing Dataflow upgrade.

WARNING: Could not attach the config row for transform BigQueryIO.TypedRead: java.lang.ClassCastException: class [B cannot be cast to class java.lang.String ([B and java.lang.String are in module java.base of loader 'bootstrap')

Pipeline is given below.

    String table =
        String.format(
            "%s.%s.%s",
            options.as(GcpOptions.class).getProject(), BIG_QUERY_DATASET_ID, options.getTable());
    Pipeline pipeline = Pipeline.create(options);
    Map<String, Object> readConfig =
        ImmutableMap.of("query", String.format("SELECT * FROM `%s`", table));
    // read and validate
    PCollection<Row> outputRows =
        pipeline
            .apply(Managed.read(Managed.BIGQUERY).withConfig(readConfig))
            .getSinglePCollection();
    pipeline.run().waitUntilFinish();

Ahmed, any idea what's going on ?

Issue Priority

Priority: 1 (data loss / total loss of function)

Issue Components

  • Component: Python SDK
  • Component: Java SDK
  • Component: Go SDK
  • Component: Typescript SDK
  • Component: IO connector
  • Component: Beam YAML
  • Component: Beam examples
  • Component: Beam playground
  • Component: Beam katas
  • Component: Website
  • Component: Infrastructure
  • Component: Spark Runner
  • Component: Flink Runner
  • Component: Samza Runner
  • Component: Twister2 Runner
  • Component: Hazelcast Jet Runner
  • Component: Google Cloud Dataflow Runner
@chamikaramj
Copy link
Contributor Author

It's good if we can get this bug fixed in the 2.63.0 release.

@chamikaramj
Copy link
Contributor Author

cc: @jrmccluskey

@jrmccluskey
Copy link
Contributor

Noted. Do you have an idea of how long this will take to resolve? If it's < 1 day I can hold the branch cut until it's fixed, otherwise we can aim for a cherrypick

@chamikaramj
Copy link
Contributor Author

Thanks! It should be fine to cut the branch and cherrypick.

@chamikaramj
Copy link
Contributor Author

We determined that this warning and the Dataflow upgrade issue are unrelated. So we can just move this out of the release blocking queue.

Thanks @jrmccluskey

@chamikaramj chamikaramj self-assigned this Feb 5, 2025
@chamikaramj chamikaramj added P2 and removed P1 labels Feb 5, 2025
@chamikaramj chamikaramj removed this from the 2.63.0 Release milestone Feb 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants