Work in progress - switch to eventstreaming #77

marcbowes · 2021-10-27T23:59:01Z

No description provided.

andrewt-amzn

Some superficial comments and questions mostly on the builder. I will take a second pass at grokking the session pool management and error handling later.

andrewt-amzn · 2021-11-01T20:13:19Z

amazon-qldb-driver-core/src/driver.rs

-    pub async fn build_with_client<C>(self, client: C) -> QldbResult<QldbDriver<C>>
+    /// Builds a `QldbDriver` using the AWS SDK for Rust.
+    ///
+    /// Note that `config` is the service-specific (QldbSession) config. For


Is there a way to alias the import to make that clear in the code? Or is the generic sounding Config type just something to get used to?

I'm not sure, this is a newly introduced change in the SDK. I think there might be something clever I can do with generics to make it so there is only 1 API that can accept either.

andrewt-amzn · 2021-11-01T20:18:12Z

amazon-qldb-driver-core/src/driver.rs

+    ///
+    /// Note that `config` is the service-specific (QldbSession) config. For
+    /// shared config, see [`sdk_config`].
+    pub async fn config(self, config: Config) -> QldbResult<QldbDriver<DynConnector>> {


Why do these functions need to be async?

Setting up the bb8 pool launches a background coroutine that manages connections. In Kotlin speak, this needs to be in the same coroutine context, and thus would have been modeled as an extension function.

One nice thing about Rust coroutines vs Kotlin coroutines is all the await points are obvious:

pub async fn config(self, config: Config) -> QldbResult<QldbDriver<DynConnector>> { let client = Client::from_conf(config); self.build_with_client(client).await // ^^^^^ }

pub async fn build_with_client<C>(self, client: Client<C>) -> QldbResult<QldbDriver<C>> where C: SmithyConnector, { let ledger_name = self .ledger_name .ok_or(error::usage_error("ledger_name must be initialized"))?; let transaction_retry_policy = Arc::new(Mutex::new(self.transaction_retry_policy)); let session_pool = Pool::builder() .test_on_check_out(false) .max_lifetime(None) .max_size(self.max_concurrent_transactions) .connection_timeout(Duration::from_secs(10)) .error_sink(Box::new(QldbErrorLoggingErrorSink::new())) .build(QldbSessionV2Manager::new(client, ledger_name.clone())) .await // <------- // ^^^^^ .map_err(|_| error::todo_stable_error_api())?; Ok(QldbDriver { ledger_name: Arc::new(ledger_name.clone()), session_pool: Arc::new(session_pool), transaction_retry_policy, }) }

andrewt-amzn · 2021-11-01T20:25:35Z

amazon-qldb-driver-core/src/driver.rs

-            })
-            .await?;
+            .build(QldbSessionV2Manager::new(client, ledger_name.clone()))
+            .await


The bb8 doc says that the future completes when the pool has the requested number of connections open. Since you're not specifying min_idle, we will have a pool with zero sessions initially, right?

I can't remember now, but yeah I think so.

I was thinking about making APIs that mirror the bb8 pool (i.e. exposing min_idle on our builder), but I'm not sure. Got any thoughts?

andrewt-amzn · 2021-11-01T20:35:53Z

amazon-qldb-driver-core/src/driver.rs

-            .await?;
+            .build(QldbSessionV2Manager::new(client, ledger_name.clone()))
+            .await
+            .map_err(|_| error::todo_stable_error_api())?;


This is mapping the error variant from bb8 Builder.build, not an aws sdk error, right?

And map_err results in another Result, rather than the success variant of the prior Result rather than being some macro which can return an error? I'm a bit lost about how we get from here to building the Ok result with a concrete session_pool.

Yeah. So..

pub async fn build(self, manager: M) -> Result<Pool<M>, M::Error> {

The error variant is the associated Error type, i.e. this one:

#[async_trait] impl<C> ManageConnection for QldbSessionV2Manager<C> where C: SmithyConnector, { type Connection = QldbHttp2Connection; type Error = ConnectionError;

and ConnectionError is defined by us to be:

pub type ConnectionError = SdkError<SendCommandError>;

So it's just the underlying SdkError that represents the issue - connection, credentials, invalid endpoint etc.

Previously, I had the carrier infrastructure (? operator) turn that into QldbError::SdkError. As per our discussion last week, you convinced me to abstract the SDK errors away. So I just put a todo there to remind myself.

map_err is a function that turns Result<T, E1> to Result<T, E2> (as you said). So it doesn't mess with the pool at all, just the error. The T itself is the pool, and the carrier trait essentially expands to:

let pool = match stuff() { Ok(it) => it, Err(err) => return Err(err).into(), };

andrewt-amzn · 2021-11-01T20:43:12Z

amazon-qldb-driver-core/src/error.rs

 use aws_smithy_http::operation::BuildError;
 use thiserror::Error;

 pub type QldbResult<T> = std::result::Result<T, QldbError>;
+pub type BoxError = Box<dyn std::error::Error + Send + Sync + 'static>;


Why include the word Box in the type name? Isn't the type signature enough for the reader to know this is a Box, if that is important for them to know?

This has become idiomatic in libraries. I'm not sure what a better name might be, so I'm copying a pattern I've seen elsewhere. One example is the aws sdk itself.

andrewt-amzn · 2021-11-01T20:46:40Z

amazon-qldb-driver-core/src/driver.rs

-        ))?;
+        let ledger_name = self
+            .ledger_name
+            .ok_or(error::usage_error("ledger_name must be initialized"))?;


If the None variant is illegal, why use an Option at all?

This is a standard idiom with builders. Otherwise you need to make loads of builder structs to model a finite state machine:

struct QldbDriverBuilder; impl QldbDriverBuilder { fn new() -> QldbDriverBuilder { QldbDriverBuilder {} } fn ledger_name(ledger_name: impl Into<String>) -> QldbDriverBuilderStep1 { QldbDriverBuilderStep1 { ledger_name: ledger_name.into() } } } struct QldbDriverBuilderStep1 { ledger_name: String } // ...

Because this isn't a "true runtime" error (i.e. something that randomly happens), it seems OK to allow for some misuse to escape the compiler.

There are several broad strokes made in this commit: 1. QldbSession v1 is gone entirely. I initially made an attempt to have these live side-by-side, but they're just so different. If there is a desire to allow runtime switching of underlying drivers, we can bring that back in a future commit. 2. In particular, having QldbError::SdkError was a huge mistake. Going forward, we're going to abstract the SDK errors entirely rather than couple the transport and application layers. 3. Eventstreaming is used in a 1:1 fashion only - 1 request, 1 response. In the future, we may have concurrent request-responses, but we will need correlation ids to support that. 4. I've entirely punted on the user errors and retries for now. 5. The support for dynamic SDKs has been removed. I think it's useful to get this back, but I want to try get it done using the Smithy patterns rather than the trait stuff. As a result, a bunch of indirection is gone which made this much easiser to write. Of note, I really like how the `<C>` parameter just went away - especially in the user-facing `TransactionAttempt<C>`. This is really nice. I was also pleased how the bb8 pool abstraction actually made a very natural place to stash the input/output channels. There is no testing yet.

The driver no longer needs this as eventstreaming takes care of these concerns. We may need to offer QldbHash as its own library, e.g. to offer client-side computation of the hash chain.

The doctests don't pass :(

This lets us move the PooledConnection into TransactionAttempt. Having that type have a lifetime is really painful. At this point, the doctest passes, but it's ugly!

The test fails with a timeout because the DVR isn't loaded with any precanned request-response pairs!

Breaking change - call `bufferred().await?` to explicitly load all values into memory.

This extracts the interaction logic to results.rs and makes poll_next drag in stats under the covers. Lots of commenting too!

Use buffered(), move into -core

This commit removes the distinction between the -core crate and the driver facade. Initially the idea was to have a "pure" implementation that didn't have any networking stuff so that we could compile to wasm and dependency inject different SDKs. However, we're not actually doing that yet and this indirection wasn't helping us move quickly. Furthermore, the new SDK has taken the directly where the toplevel client isn't a trait. Rather, SmithyConnector (1 layer down) allows for pluggable connectivity. This is really nice, and means that when we re-pursue the wasm directly we might not even need this two-crate approach anyway!

unique.rs showed that when using tokio::spawn, the future needs to be Send, so any suspend points (resume types) need to be Send too. This meant that statement results (which are generic over E) failed the bounds checks.

Flesh out integration test to support recording and include an example that worked for me. (The test currently fails, not sure why yet.)

I think the reason this test is failing is related to awslabs/aws-sdk-rust#296

I think it's going to be common for applications to have a single error type. The new pattern makes Infallible the default variant (read: "no custom user error"), and `.with_user_error` can be used to customize the variant.

marcbowes marked this pull request as draft October 27, 2021 23:59

andrewt-amzn reviewed Nov 1, 2021

View reviewed changes

marcbowes added 11 commits November 1, 2021 14:11

Generate a streaming sdk off v0.25.1

d7ca601

New client that uses restJson1

5153ec1

New client based off 0.27.0-alpha

8a36632

Update to .22

ba16dbc

Remove QldbHash

bafc4ed

The driver no longer needs this as eventstreaming takes care of these concerns. We may need to offer QldbHash as its own library, e.g. to offer client-side computation of the hash chain.

Comment why the pooled connection isn't owned

154b32e

Working on errors

f57cd5b

Add another layer of errors!

2e95e7d

More work on errors

0b0f09e

The doctests don't pass :(

Erase SmithyConnector

7687ecf

This lets us move the PooledConnection into TransactionAttempt. Having that type have a lifetime is really painful. At this point, the doctest passes, but it's ugly!

marcbowes force-pushed the streaming branch from 4701be8 to 7687ecf Compare November 4, 2021 03:17

marcbowes added 13 commits November 4, 2021 10:45

Fix compile of examples

027a810

doctest to use Infallible

9db1688

Add a TransactionResult alias

2fb6e09

Add a framework for testing streaming end-to-end

6e00d1e

The test fails with a timeout because the DVR isn't loaded with any precanned request-response pairs!

Results are now streamed by default

73ed424

Breaking change - call `bufferred().await?` to explicitly load all values into memory.

Extract results mod

b291f72

Fix execution stats for execute statement

2cac889

This extracts the interaction logic to results.rs and makes poll_next drag in stats under the covers. Lots of commenting too!

Doctests for builder

820230c

Basic usage doctests

856e85b

Start to fix up examples

690f4d3

Use buffered(), move into -core

TransactError<E> now requires Send+Sync

684037f

unique.rs showed that when using tokio::spawn, the future needs to be Send, so any suspend points (resume types) need to be Send too. This meant that statement results (which are generic over E) failed the bounds checks.

Document results API

ad2479b

marcbowes mentioned this pull request Nov 12, 2021

'error writing body to connection' after dropping eventstreaming connection awslabs/aws-sdk-rust#296

Closed

marcbowes added 2 commits November 12, 2021 16:31

Fix v2 Uri

1185c98

Flesh out integration test to support recording and include an example that worked for me. (The test currently fails, not sure why yet.)

Improvements to debugability of integration.rs

9a687b1

I think the reason this test is failing is related to awslabs/aws-sdk-rust#296

Move <E> into the driver

2782bac

I think it's going to be common for applications to have a single error type. The new pattern makes Infallible the default variant (read: "no custom user error"), and `.with_user_error` can be used to customize the variant.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Work in progress - switch to eventstreaming #77

Work in progress - switch to eventstreaming #77

Uh oh!

marcbowes commented Oct 27, 2021

Uh oh!

andrewt-amzn left a comment

Uh oh!

andrewt-amzn Nov 1, 2021

Uh oh!

marcbowes Nov 1, 2021

Uh oh!

andrewt-amzn Nov 1, 2021

Uh oh!

marcbowes Nov 1, 2021 •

edited

Loading

Uh oh!

andrewt-amzn Nov 1, 2021

Uh oh!

marcbowes Nov 1, 2021

Uh oh!

andrewt-amzn Nov 1, 2021

Uh oh!

marcbowes Nov 1, 2021

Uh oh!

andrewt-amzn Nov 1, 2021

Uh oh!

marcbowes Nov 1, 2021

Uh oh!

andrewt-amzn Nov 1, 2021

Uh oh!

marcbowes Nov 1, 2021

Uh oh!

Uh oh!

Work in progress - switch to eventstreaming #77

Are you sure you want to change the base?

Work in progress - switch to eventstreaming #77

Uh oh!

Conversation

marcbowes commented Oct 27, 2021

Uh oh!

andrewt-amzn left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marcbowes Nov 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

marcbowes Nov 1, 2021 •

edited

Loading