Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

50395 2 2 #1

Open
wants to merge 42 commits into
base: temp-pr-target
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
4cdc87c
Implement https://github.com/airbytehq/airbyte/issues/50395 discussed…
rpopov Jan 13, 2025
92bb031
Polish
rpopov Jan 18, 2025
7b6ce9b
Merge branch 'main' into 50395-2
rpopov Jan 18, 2025
71e3c70
Merged from main, updated dependencies
Jan 21, 2025
8ff298e
Merge branch 'main' into 50395-2-2
Jan 21, 2025
071a839
Created a DPath Enhancing Extractor
Jan 21, 2025
2aa3a52
Working on tests
Jan 21, 2025
3d4ffe6
Split the tests of DPathExtractor and DPathEnhancingExtractor
Jan 22, 2025
2381916
poetry run ruff check --fix
Jan 22, 2025
c986bdc
Merge remote-tracking branch 'remotes/airbyte/main' into 50395-2-2
Jan 23, 2025
241cdab
Merge branch 'airbytehq:main' into 50395-2-2
rpopov Jan 23, 2025
145a43a
Added instructions on activating the proper env.
rpopov Jan 24, 2025
542e194
Removed temporary files
rpopov Jan 24, 2025
55e160b
Merge branch 'main' into 50395-2-2
rpopov Jan 24, 2025
bcfa6a2
Restore the original test logic (still excluding the tests of the int…
rpopov Jan 25, 2025
3ed8ae7
Fix the failing tests:
rpopov Jan 26, 2025
588ca53
Integrate the DpathEnhancingExtractor in the UI of Airbyte.
rpopov Jan 26, 2025
0eba8a9
ci: migrate to Poetry 2.0 and remove `--no-update` references (#276)
devin-ai-integration[bot] Jan 28, 2025
d9d93ab
ci: fix poetry lock workflow for missing/blank file (#284)
devin-ai-integration[bot] Jan 28, 2025
d318618
feat(low-code concurrent): Add `use_global_cursor` flag to Concurrent…
tolik0 Jan 28, 2025
4f8e9d8
chore: add missing poetry-core constraint (<2.0), tighten python vers…
aaronsteers Jan 28, 2025
0b7d976
fix(low-code CDK): fix checkpointing for declarative streams (#177)
lazebnyi Jan 28, 2025
c0ba13f
ci: add pre-commit config for airbyte-python-cdk (#270)
devin-ai-integration[bot] Jan 29, 2025
3729574
chore: remove deprecated `distutils` references (#292)
aaronsteers Jan 29, 2025
128b678
feat: expand pre-commit config (#259)
devin-ai-integration[bot] Jan 29, 2025
ad20739
chore: remove pendulum dependency (#258)
devin-ai-integration[bot] Jan 29, 2025
3af96dc
ci: add Python 3.12 support (#291)
devin-ai-integration[bot] Jan 29, 2025
4e7f94a
feat(low-code): add use check availability flag to dynamic check (#293)
lazebnyi Jan 30, 2025
0b14379
ci: skip py3.12 in test matrix (#300)
aaronsteers Jan 30, 2025
33e9f5e
fix(concurrent-partition-cursor): Fix cursor comparison error (#298)
tolik0 Jan 30, 2025
a6d55be
fix: make new datetime parse functions more permissive (#296)
aaronsteers Jan 30, 2025
dea2cc9
feat(low-code): added json.loads to jwt authenticator (#301)
lazebnyi Jan 30, 2025
ee537af
feat: use create_concurrent_cursor_from_perpartition_cursor (#286)
artem1205 Jan 30, 2025
65e6a0d
fix: (OAuthAuthenticator) - get the `access_token`, `refresh_token`, …
bazarnov Jan 30, 2025
10a7a87
fix: Relocates `self._extract_slice_fields(stream_slice=stream_slice)…
pnilan Jan 31, 2025
e57d38a
docs: add migration note for pendulum removal (#308)
devin-ai-integration[bot] Feb 1, 2025
426ab5b
fix(low-code): add wrong dynamic stream name type validation (#305)
lazebnyi Feb 1, 2025
ef97304
feat(low-code): improve logging on async retriever errors (#307)
maxi297 Feb 2, 2025
979598c
chore: use python 3.11 for connector tests in CI (#313)
ChristoGrab Feb 4, 2025
126e233
feat: enable handling of nested fields when injecting request_option …
ChristoGrab Feb 4, 2025
ca68c5c
fix(concurrent cdk): Properly call set_initial_state() on the cursor …
brianjlai Feb 4, 2025
2e416f2
Merge branch 'main' into 50395-2-2
rpopov Feb 5, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .github/workflows/autofix-command.yml
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/connector-tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -128,7 +128,7 @@ jobs:
- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: "3.10"
python-version: "3.11"
# Create initial pending status for test report
- name: Create Pending Test Report Status
if: steps.no_changes.outputs.status != 'cancelled'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/pdoc_preview.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/pdoc_publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
13 changes: 12 additions & 1 deletion .github/workflows/poetry-lock-command.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,17 @@ jobs:
with:
pr: ${{ github.event.inputs.pr }}

- name: Check for blank or missing poetry.lock
run: |
if [ ! -s "poetry.lock" ]; then
echo "poetry.lock missing or blank. Fetching from main branch..."
git fetch origin main
git checkout origin/main -- poetry.lock
echo "Lock file restored from main."
else
echo "poetry.lock found. Proceeding."
fi

- name: Get PR info
id: pr-info
run: |
Expand Down Expand Up @@ -72,7 +83,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/pytest_fast.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"

- name: Check Poetry lock file is current
run: poetry check
Expand All @@ -44,7 +44,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/pytest_matrix.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ jobs:
python-version: [
"3.10",
"3.11",
#'3.12', # Currently blocked by Pendulum
# "3.12", # `requests-cache` blocker: https://github.com/airbytehq/airbyte-python-cdk/issues/299
]
os: [
Ubuntu,
Expand Down Expand Up @@ -61,7 +61,7 @@ jobs:
uses: Gr1N/setup-poetry@v9
if: steps.changes.outputs.src == 'true'
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
if: steps.changes.outputs.src == 'true'
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/python_lint.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/test-command.yml
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,7 @@ jobs:
- name: Set up Poetry
uses: Gr1N/setup-poetry@v9
with:
poetry-version: "1.8.4"
poetry-version: "2.0.1"
- name: Set up Python
uses: actions/setup-python@v5
with:
Expand Down
63 changes: 63 additions & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,63 @@
# See https://pre-commit.com for more information
# See https://pre-commit.com/hooks.html for more hooks
exclude: |
(?x)(
# Python/system files
^.*/__init__\.py$|
^.*?/\.venv/.*$|
^.*?/node_modules/.*$|

# Generated/test files
^.*?/\.pytest_cache/.*$|
^.*?/__pycache__/.*$|
^.*?/\.mypy_cache/.*$|
^.*?/\.ruff_cache/.*$

# Package management
^.*?/poetry\.lock$|
^.*?/package-lock\.json$|
^.*?/pnpm-lock\.yaml$|

# Build and test artifacts
^.*?/build/.*$|
^.*?/dist/.*$|
^.*?/\.coverage$|
^.*?/coverage\.xml$|
)

repos:
- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v4.5.0
hooks:
- id: trailing-whitespace
- id: end-of-file-fixer
- id: check-yaml
- id: check-added-large-files
- id: check-toml

- repo: https://github.com/astral-sh/ruff-pre-commit
rev: v0.8.3
hooks:
# Run the linter with repo-defined settings
- id: ruff
args: [--fix]

# Run the formatter with repo-defined settings
- id: ruff-format

- repo: https://github.com/pre-commit/mirrors-prettier
rev: v3.0.3
hooks:
- id: prettier
types_or: [json, yaml]
additional_dependencies:
- [email protected]

- repo: local
hooks:
- id: addlicense
name: Add license headers
entry: addlicense -c "Airbyte, Inc." -l apache -v -f LICENSE_SHORT
language: golang
additional_dependencies: [github.com/google/[email protected]]
files: \.py$
1 change: 1 addition & 0 deletions LICENSE_SHORT
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Copyright (c) 2025 Airbyte, Inc., all rights reserved.
3 changes: 2 additions & 1 deletion airbyte_cdk/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -92,7 +92,7 @@
from .sources.declarative.declarative_stream import DeclarativeStream
from .sources.declarative.decoders import Decoder, JsonDecoder
from .sources.declarative.exceptions import ReadException
from .sources.declarative.extractors import DpathExtractor, RecordSelector
from .sources.declarative.extractors import DpathExtractor, DpathEnhancingExtractor, RecordSelector
from .sources.declarative.extractors.record_extractor import RecordExtractor
from .sources.declarative.extractors.record_filter import RecordFilter
from .sources.declarative.incremental import DatetimeBasedCursor
Expand Down Expand Up @@ -234,6 +234,7 @@
"DefaultPaginator",
"DefaultRequestOptionsProvider",
"DpathExtractor",
"DpathEnhancingExtractor",
"FieldPointer",
"HttpMethod",
"HttpRequester",
Expand Down
6 changes: 3 additions & 3 deletions airbyte_cdk/cli/source_declarative_manifest/_run.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,6 @@
import sys
import traceback
from collections.abc import Mapping
from datetime import datetime
from pathlib import Path
from typing import Any, cast

Expand All @@ -44,6 +43,7 @@
)
from airbyte_cdk.sources.declarative.yaml_declarative_source import YamlDeclarativeSource
from airbyte_cdk.sources.source import TState
from airbyte_cdk.utils.datetime_helpers import ab_datetime_now


class SourceLocalYaml(YamlDeclarativeSource):
Expand Down Expand Up @@ -101,7 +101,7 @@ def _get_local_yaml_source(args: list[str]) -> SourceLocalYaml:
type=Type.TRACE,
trace=AirbyteTraceMessage(
type=TraceType.ERROR,
emitted_at=int(datetime.now().timestamp() * 1000),
emitted_at=ab_datetime_now().to_epoch_millis(),
error=AirbyteErrorTraceMessage(
message=f"Error starting the sync. This could be due to an invalid configuration or catalog. Please contact Support for assistance. Error: {error}",
stack_trace=traceback.format_exc(),
Expand Down Expand Up @@ -191,7 +191,7 @@ def create_declarative_source(
type=Type.TRACE,
trace=AirbyteTraceMessage(
type=TraceType.ERROR,
emitted_at=int(datetime.now().timestamp() * 1000),
emitted_at=ab_datetime_now().to_epoch_millis(),
error=AirbyteErrorTraceMessage(
message=f"Error starting the sync. This could be due to an invalid configuration or catalog. Please contact Support for assistance. Error: {error}",
stack_trace=traceback.format_exc(),
Expand Down
4 changes: 2 additions & 2 deletions airbyte_cdk/connector_builder/connector_builder_handler.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,6 @@
#

import dataclasses
from datetime import datetime
from typing import Any, List, Mapping

from airbyte_cdk.connector_builder.message_grouper import MessageGrouper
Expand All @@ -21,6 +20,7 @@
ModelToComponentFactory,
)
from airbyte_cdk.utils.airbyte_secrets_utils import filter_secrets
from airbyte_cdk.utils.datetime_helpers import ab_datetime_now
from airbyte_cdk.utils.traced_exception import AirbyteTracedException

DEFAULT_MAXIMUM_NUMBER_OF_PAGES_PER_SLICE = 5
Expand Down Expand Up @@ -114,4 +114,4 @@ def resolve_manifest(source: ManifestDeclarativeSource) -> AirbyteMessage:


def _emitted_at() -> int:
return int(datetime.now().timestamp()) * 1000
return ab_datetime_now().to_epoch_millis()
14 changes: 7 additions & 7 deletions airbyte_cdk/sources/declarative/async_job/job_orchestrator.py
Original file line number Diff line number Diff line change
Expand Up @@ -437,10 +437,10 @@ def create_and_get_completed_partitions(self) -> Iterable[AsyncPartition]:
yield from self._process_running_partitions_and_yield_completed_ones()
self._wait_on_status_update()
except Exception as exception:
LOGGER.warning(
f"Caught exception that stops the processing of the jobs: {exception}. Traceback: {traceback.format_exc()}"
)
if self._is_breaking_exception(exception):
LOGGER.warning(
f"Caught exception that stops the processing of the jobs: {exception}"
)
self._abort_all_running_jobs()
raise exception

Expand Down Expand Up @@ -482,16 +482,16 @@ def _is_breaking_exception(self, exception: Exception) -> bool:
and exception.failure_type == FailureType.config_error
)

def fetch_records(self, partition: AsyncPartition) -> Iterable[Mapping[str, Any]]:
def fetch_records(self, async_jobs: Iterable[AsyncJob]) -> Iterable[Mapping[str, Any]]:
"""
Fetches records from the given partition's jobs.
Fetches records from the given jobs.

Args:
partition (AsyncPartition): The partition containing the jobs.
async_jobs Iterable[AsyncJob]: The list of AsyncJobs.

Yields:
Iterable[Mapping[str, Any]]: The fetched records from the jobs.
"""
for job in partition.jobs:
for job in async_jobs:
yield from self._job_repository.fetch_records(job)
self._job_repository.delete(job)
28 changes: 17 additions & 11 deletions airbyte_cdk/sources/declarative/auth/jwt.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
#

import base64
import json
from dataclasses import InitVar, dataclass
from datetime import datetime
from typing import Any, Mapping, Optional, Union
Expand Down Expand Up @@ -104,21 +105,21 @@ def __post_init__(self, parameters: Mapping[str, Any]) -> None:
)

def _get_jwt_headers(self) -> dict[str, Any]:
""" "
"""
Builds and returns the headers used when signing the JWT.
"""
headers = self._additional_jwt_headers.eval(self.config)
headers = self._additional_jwt_headers.eval(self.config, json_loads=json.loads)
if any(prop in headers for prop in ["kid", "alg", "typ", "cty"]):
raise ValueError(
"'kid', 'alg', 'typ', 'cty' are reserved headers and should not be set as part of 'additional_jwt_headers'"
)

if self._kid:
headers["kid"] = self._kid.eval(self.config)
headers["kid"] = self._kid.eval(self.config, json_loads=json.loads)
if self._typ:
headers["typ"] = self._typ.eval(self.config)
headers["typ"] = self._typ.eval(self.config, json_loads=json.loads)
if self._cty:
headers["cty"] = self._cty.eval(self.config)
headers["cty"] = self._cty.eval(self.config, json_loads=json.loads)
headers["alg"] = self._algorithm
return headers

Expand All @@ -130,18 +131,19 @@ def _get_jwt_payload(self) -> dict[str, Any]:
exp = now + self._token_duration if isinstance(self._token_duration, int) else now
nbf = now

payload = self._additional_jwt_payload.eval(self.config)
payload = self._additional_jwt_payload.eval(self.config, json_loads=json.loads)
if any(prop in payload for prop in ["iss", "sub", "aud", "iat", "exp", "nbf"]):
raise ValueError(
"'iss', 'sub', 'aud', 'iat', 'exp', 'nbf' are reserved properties and should not be set as part of 'additional_jwt_payload'"
)

if self._iss:
payload["iss"] = self._iss.eval(self.config)
payload["iss"] = self._iss.eval(self.config, json_loads=json.loads)
if self._sub:
payload["sub"] = self._sub.eval(self.config)
payload["sub"] = self._sub.eval(self.config, json_loads=json.loads)
if self._aud:
payload["aud"] = self._aud.eval(self.config)
payload["aud"] = self._aud.eval(self.config, json_loads=json.loads)

payload["iat"] = now
payload["exp"] = exp
payload["nbf"] = nbf
Expand All @@ -151,7 +153,7 @@ def _get_secret_key(self) -> str:
"""
Returns the secret key used to sign the JWT.
"""
secret_key: str = self._secret_key.eval(self.config)
secret_key: str = self._secret_key.eval(self.config, json_loads=json.loads)
return (
base64.b64encode(secret_key.encode()).decode()
if self._base64_encode_secret_key
Expand All @@ -176,7 +178,11 @@ def _get_header_prefix(self) -> Union[str, None]:
"""
Returns the header prefix to be used when attaching the token to the request.
"""
return self._header_prefix.eval(self.config) if self._header_prefix else None
return (
self._header_prefix.eval(self.config, json_loads=json.loads)
if self._header_prefix
else None
)

@property
def auth_header(self) -> str:
Expand Down
Loading
Loading