Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

implement loading by primary key #69

Merged

Conversation

NathanQingyangXu
Copy link
Contributor

@NathanQingyangXu NathanQingyangXu commented Mar 17, 2025

https://jira.mongodb.org/browse/HIBERNATE-22

The first "select statement" translation ticket, though the simplest one (where clause only contains primary key equalness comparison). Very business logic intensitve PR.

I added comments to explain the various edge cases of business logic and hope that would help.

@NathanQingyangXu NathanQingyangXu requested a review from jyemin March 17, 2025 01:19
@NathanQingyangXu NathanQingyangXu force-pushed the HIBERNATE-22-load-translation-new branch from 182d8fc to 9cb4599 Compare March 17, 2025 01:28
@stIncMale stIncMale self-requested a review March 17, 2025 02:13
var astElements = new ArrayList<AstElement>(tableInsert.getNumberOfValueBindings());
for (var columnValueBinding : tableInsert.getValueBindings()) {
var columnExpression = columnValueBinding.getColumnReference().getColumnExpression();

var valueExpression = columnValueBinding.getValueExpression();
if (valueExpression == null) {
throw new FeatureNotSupportedException();
}
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

after further investigation, it seems we can rule out the above possibility (only table updating has such concern for some reason), so I deleted it to avoid long-term potential confusion.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

only table updating has such concern

What is that "table updating", and why it can never happen to us? Also, if we can see why it can never happen, but it's a thing nevertheless generally speaking, we should leave an explicit assertion, informing a reader that we did not miss the case when columnValueBinding.getValueExpression() is null, but thought about it and claim that it will never happen to us.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Below is the scenario that columnValueBinding.getValueExpression() could be null (https://github.com/hibernate/hibernate-orm/blob/main/hibernate-core/src/main/java/org/hibernate/sql/ast/spi/AbstractSqlAstTranslator.java#L8745C6-L8745C58):

                       if ( tableUpdate.getNumberOfOptimisticLockBindings() > 0 ) {
				tableUpdate.forEachOptimisticLockBinding( (position, columnValueBinding) -> {
					sqlBuffer.append( " and " );
					sqlBuffer.append( columnValueBinding.getColumnReference().getColumnExpression() );
					if ( columnValueBinding.getValueExpression() == null
							|| columnValueBinding.getValueExpression().getFragment() == null ) {
						sqlBuffer.append( " is null" );
					}
					else {
						sqlBuffer.append( "=" );
						columnValueBinding.getValueExpression().accept( this );
					}
				} );
			}

The reason is the requirement of SQL's special nullness syntax (i.e. xxx is null), so if columnValueBinding.getValueExpression() is a parameter placeholder, there is no accommodation of the nullness SQL requirement or not easy; for insertion case, there seems no such possibility, so that is why I claimed that we might be able to remove the unnecessary nullness checking.

So it seems it doesn't hurt to retain the nullness checking in insertion case. I reverted it back.

|| selectStatement.getCteObjects() != null
&& !selectStatement.getCteObjects().isEmpty()) {
throw new FeatureNotSupportedException("CTE feature not supported");
}
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

CTE (Common Table Expression; See https://hightouch.com/sql-dictionary/sql-common-table-expression-cte for a gentle introduction to CTE) is about creating some temporary table. Mongo aggregate command seems to support it, but for this PR let us throw exception for now.

&& !selectStatement.getCteObjects().isEmpty()) {
throw new FeatureNotSupportedException("CTE feature not supported");
}
selectStatement.getQueryPart().accept(this);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

QueryPart has two child classes:

  • QueryGroup
  • QuerySpec

The latter will be covered in the visitor method below; QueryGroup is used for a set of QuerySpec and how to combine them together. A typical example is as follows:

SELECT column_name(s) FROM table1
UNION
SELECT column_name(s) FROM table2;

throw new FeatureNotSupportedException("TODO-HIBERNATE-22 https://jira.mongodb.org/browse/HIBERNATE-22");
logSqlAst(selectStatement);
if (!selectStatement.getQueryPart().isRoot()) {
throw new FeatureNotSupportedException("Subquery not supported");
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a subquery example:

select d from Department as d where d in (select s.department from Salesperson s where s.name = ?1)"

public void visitFromClause(FromClause fromClause) {
if (fromClause.getRoots().size() != 1) {
throw new FeatureNotSupportedException();
}
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An example HQL in which there are multipe roots:

select li.id from LineItem li, Product as p where p.id = li.product.id and li.quantity >= ?1 and p.name = ?2


if (!(tableGroup.getModelPart() instanceof EntityPersister entityPersister)
|| entityPersister.getQuerySpaces().length != 1) {
throw new FeatureNotSupportedException();
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tableGroup's modelPart doesn't necessarily corresponds to an entity table (EntityPersister class corresponds to it), for instance, for @OneToMany association, the collection field will have its SQL translated and its model part will be a special PluralAttributeMappingImpl class object containing foreign key specific logic.

When model part is an EntityPersister corresponding to an entity, not entity collection, there could be the possibility that the entity class hierarchy ends up with multiple tables (getQuerySpaces() will return all the relevant table names during query) corresponding to the "joinied table" mapping option (see https://docs.jboss.org/hibernate/orm/6.6/userguide/html_single/Hibernate_User_Guide.html#entity-inheritance for details)


for (SqlSelection sqlSelection : selectClause.getSqlSelections()) {
if (sqlSelection.isVirtual()) {
continue;
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is some edge case that some SqlSelection entry exists for some reason but are not meant to be used in SQL. For such virtual entries, we simply ignore during SQL translation (a pattern also in AbstractSqlTranslator).

Copy link
Collaborator

@jyemin jyemin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just did an initial pass of the core functionality, and had a few comments.

@NathanQingyangXu NathanQingyangXu requested a review from jyemin March 19, 2025 14:48
Copy link
Member

@stIncMale stIncMale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Partial review results. I am submitting them only because it's the only way to leave a comment in an existing discussion.

@jyemin jyemin self-requested a review March 19, 2025 22:53
Copy link
Member

@stIncMale stIncMale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The review hasn't been completed. Leaving partial results again to comment on an existing discussion.

# Conflicts:
#	src/main/java/com/mongodb/hibernate/internal/translate/AbstractMqlTranslator.java
#	src/main/java/com/mongodb/hibernate/internal/translate/mongoast/command/AstUpdateCommand.java
@@ -143,6 +159,8 @@ abstract class AbstractMqlTranslator<T extends JdbcOperation> implements SqlAstT

private final List<JdbcParameterBinder> parameterBinders = new ArrayList<>();

private final Set<String> affectedTableNames = new HashSet<>();
Copy link
Member

@stIncMale stIncMale Mar 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I started thinking on which of the classes we plug in Hibernate ORM have to be thread-safe, and which are used multiple times - it's not obvious, and is, of course, not documented by Hibernate ORM.

  1. MongoDialect - only one instance is created. Does this mean it has to be thread-safe, or does Hibernate ORM guarantee not to use it concurrently?
  2. MongoTranslatorFactory - essentially, Hibernate ORM uses a single instance, but creates two due to this sloppy code. As far as I understand, this instance should be thread-safe.
  3. SqlAstTranslators created by MongoTranslatorFactory
    3.1. It looks like a new instance of SelectStatementMqlTranslator is created for each query that is different in any way from those previously translated, even if the difference is in QueryOptions. Furthermore, SelectStatementMqlTranslator.translate is called only once per instance of SelectStatementMqlTranslator. That is, it looks like SelectStatementMqlTranslator does not need to be thread-safe, nor does it need to be reusable, i.e., we are fine with making it stateful and not resetting the state before returning from the SelectStatementMqlTranslator.translate method.
    3.2. Is the same true about all translators created by MongoTranslatorFactory?
  4. StandardServiceRegistryScopedState.ServiceContributor.contribute - may be called multiple times if multiple instances of StandardServiceRegistry are built.
  5. MongoAdditionalMappingContributor - TODO for @stIncMale.

We should document (for ourselves) the above, provided that the above is correct. @NathanQingyangXu, do you think the above assessment is correct?


A similar thing is about the SPI we expose:

  1. MongoConfigurationContributor - its method configure is called once per building an instance of StandardServiceRegistry. We should document this, because this affects both reusability requirements applied to implementations, and only an application author may figure out based on the application whether his MongoConfigurationContributor has to be reusable.

I created PR 76: Add MongoConfigurationContributor tests and document it better to address this part.


With our JDBC implementations things are simple, because we are familiar with the JDBC API, and are not familiar with the Hibernate ORM extension API, which is not documented extensively - none of them are used concurrently, i.e., they don't need to be thread-safe.

Copy link
Contributor Author

@NathanQingyangXu NathanQingyangXu Mar 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think what you think is correct, except for the point to create multiple StandardServiceRegistry instances (I explained about it below).

As a general rule, all those class related to global configuration will end up with singleton instance (similar to Spring bean). Technically these classes are not thread-safe and it is developer's responsibility to ensure thread-safety (e.g. Dialect singleton instance could be accessed from different concurrent sessions at the same time), but usually these classes don't maintain their own state and is read-only in essence, so it is very rare to see concurrency related stuff is there.

Only for the global SessionFactory scope, concurrency is a relevant concern (like QueryPlan cache, which is based on double lock checking to ensure thread-safety).

Yeah, JDBC APIs will be used in Session which is thread-safe, so we don't need bother with concurrency issue in JDBC layer.

The various translators would be created dynamically as the following javadoc implies (by single-use verbiage):

/**
 * Factory for obtaining single-use SQL AST translators
 *
 * @author Steve Ebersole
 */
public interface SqlAstTranslatorFactory {
	/**
	 * Builds a single-use select translator
	 */
	SqlAstTranslator<JdbcOperationQuerySelect> buildSelectTranslator(SessionFactoryImplementor sessionFactory, SelectStatement statement);

	/**
	 * Builds a single-use mutation translator
	 */
	SqlAstTranslator<? extends JdbcOperationQueryMutation> buildMutationTranslator(SessionFactoryImplementor sessionFactory, MutationStatement statement);

	/**
	 * Builds a single-use delete translator
	 *
	 * @deprecated Use {@link #buildMutationTranslator(SessionFactoryImplementor, MutationStatement)} instead
	 */
	@Deprecated(forRemoval = true)
	default SqlAstTranslator<JdbcOperationQueryDelete> buildDeleteTranslator(SessionFactoryImplementor sessionFactory, DeleteStatement statement) {
		//noinspection unchecked
		return (SqlAstTranslator<JdbcOperationQueryDelete>) buildMutationTranslator( sessionFactory, statement );
	}

	/**
	 * Builds a single-use insert-select translator
	 *
	 * @deprecated Use {@link #buildMutationTranslator(SessionFactoryImplementor, MutationStatement)} instead
	 */
	@Deprecated(forRemoval = true)
	default SqlAstTranslator<JdbcOperationQueryInsert> buildInsertTranslator(SessionFactoryImplementor sessionFactory, InsertStatement statement) {
		//noinspection unchecked
		return (SqlAstTranslator<JdbcOperationQueryInsert>) buildMutationTranslator( sessionFactory, statement );
	}

	/**
	 * Builds a single-use update translator
	 *
	 * @deprecated Use {@link #buildMutationTranslator(SessionFactoryImplementor, MutationStatement)} instead
	 */
	@Deprecated(forRemoval = true)
	default SqlAstTranslator<JdbcOperationQueryUpdate> buildUpdateTranslator(SessionFactoryImplementor sessionFactory, UpdateStatement statement) {
		//noinspection unchecked
		return (SqlAstTranslator<JdbcOperationQueryUpdate>) buildMutationTranslator( sessionFactory, statement );
	}

	/**
	 * Builds a single-use translator for dealing with model mutations
	 */
	<O extends JdbcMutationOperation> SqlAstTranslator<O> buildModelMutationTranslator(TableMutation<O> mutation, SessionFactoryImplementor sessionFactory);

}

I failed to see the necessity of multiple instances of StandardServiceRegistry. It should be created as singleton for one SessionFactory as well.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If our user is tech-savy, I think such doc might only end up with confusion.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry, if the doc is for our own side, go ahead

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for raising the awareness of the duplicated SqlAstFactory creation issue in JdbcEnvironmentImpl. I'd created a PR and it got merged at hibernate/hibernate-orm#9900 already.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Factory for obtaining single-use SQL AST translators

I was looking for a piece of doc like that, but missed it. Thank you for pointing to it!

I'd created a PR and it got merged at hibernate/hibernate-orm#9900 already.

Great! I'll create a ticket making MongoTranslatorFactory a singleton, as we won't get that fix in Hibernate ORM 6.x.

I'll also create a PR addressing the rest of internal documentation changes mentioned here.

Copy link
Member

@stIncMale stIncMale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The last reviewed commit is 4fca70d.

@NathanQingyangXu
Copy link
Contributor Author

P.S. I struggle to understand why situations like this keep happening quite often. Was the first comment I left actually so indecipherable, or was it not read carefully?

It is mainly due to the knowledge gap from my side. I didn't know of the subtlety of contact.phone.number until I experiemented locally and afterwards read the relevant mongo doc.

Copy link
Member

@stIncMale stIncMale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The last reviewed commit is 4a165c6.

Copy link
Member

@stIncMale stIncMale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The last reviewed commit is c779cfd.

@NathanQingyangXu NathanQingyangXu requested a review from jyemin March 26, 2025 15:30
@NathanQingyangXu NathanQingyangXu merged commit 011e6e4 into mongodb:main Mar 26, 2025
6 checks passed
@NathanQingyangXu NathanQingyangXu deleted the HIBERNATE-22-load-translation-new branch March 26, 2025 15:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants