Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ESQL: Document warnings behavior in CsvTests #125441

Merged
merged 2 commits into from
Mar 24, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,17 @@
import java.util.Objects;

/**
* Finds all fields with a single-value. If a field has a multi-value, it emits a {@link Warnings}.
* Finds all fields with a single-value. If a field has a multi-value, it emits
* a {@link Warnings warning}.
* <p>
* Warnings are only emitted if the {@link TwoPhaseIterator#matches}. Meaning that,
* if the other query skips the doc either because the index doesn't match or because it's
* {@link TwoPhaseIterator#matches} doesn't match, then we won't log warnings. So it's
* most safe to say that this will emit a warning if the document would have
* matched but for having a multivalued field. If the document doesn't match but
* "almost" matches in some fairly lucene-specific ways then it *might* emit
* a warning.
* </p>
*/
public final class SingleValueMatchQuery extends Query {

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,7 @@
import org.elasticsearch.compute.operator.DriverRunner;
import org.elasticsearch.compute.operator.exchange.ExchangeSinkHandler;
import org.elasticsearch.compute.operator.exchange.ExchangeSourceHandler;
import org.elasticsearch.compute.querydsl.query.SingleValueMatchQuery;
import org.elasticsearch.core.Releasables;
import org.elasticsearch.core.Tuple;
import org.elasticsearch.index.IndexMode;
Expand Down Expand Up @@ -153,6 +154,17 @@
* it’s creating its own Source physical operator, aggregation operator (just a tiny bit of it) and field extract operator.
* <p>
* To log the results logResults() should return "true".
* <p>
* This test never pushes to Lucene because there isn't a Lucene index to push to. It always runs everything in
* the compute engine. This yields the same results modulo a few things:
* <ul>
* <li>Warnings for multivalued fields: See {@link SingleValueMatchQuery} for an in depth discussion, but the
* short version is this class will always emit warnings on multivalued fields but tests that run against
* a real index are only guaranteed to emit a warning if the document would match all filters <strong>except</strong>
* it has a multivalue field.</li>
* <li>Sorting: This class emits values in the order they appear in the {@code .csv} files that power it. A real
* index emits documents a fair random order. Multi-shard and multi-node tests doubly so.</li>
* </ul>
*/
// @TestLogging(value = "org.elasticsearch.xpack.esql:TRACE,org.elasticsearch.compute:TRACE", reason = "debug")
public class CsvTests extends ESTestCase {
Expand Down