Support CODDTest in sqlancer#1054

DerZc

No description provided.

mrigger

Thanks for the PR! Added some initial comments. I think it's quite challenging to review such a large PR. Do you think we could split it into multiple smaller ones (referencing this as the main PR)? For example, as a first step, we could add some of the new expression nodes and the visit methods.

The initial CERT and CODDTest implementations diverged from their papers in ways that defeated the test signal: CERT was using actual row counts from running the queries and a bidirectional mutator framework. Per Ba and Rigger, ICSE 2024 the property under test is `EstCard(Q', D) <= EstCard(Q, D)` -- the *estimator's* projection, with Q' strictly more restrictive than Q, and "CERT eschews executing queries". This rewrite: * Reads cardinality from `EXPLAIN ESTIMATE`, summing `rows` across the per-table tuples it returns. The query is never executed. * Restricts mutations to one direction. `mutateWhere`/`mutateAnd` always AND-tighten or introduce a WHERE; `mutateOr` drops an OR operand (per the paper's restrictive-OR rule) or falls back to AND; `mutateDistinct` only promotes ALL -> DISTINCT. All return `increase=false`. * Skips runs where the estimator returns nothing (non-MergeTree engines, `ORDER BY tuple()`, unsupported expressions), and skips runs where the structural-similarity gate on `EXPLAIN PLAN` shows too much drift. CODDTest was toggling random optimizer flags via per-query `SETTINGS` clauses and comparing results. Per Zhang and Rigger, SIGMOD 2025 the oracle is constant-folding-driven: take a subexpression E in Q, evaluate E to a value V via an auxiliary query A, build a folded query F by substituting V for E, then assert results of Q and F are identical. This rewrite implements the scalar-subquery variant (same as DuckDBCODDTestOracle in the upstream PR sqlancer#1054): aux: SELECT min(c)/max(c) FROM t -> V Q: SELECT * FROM t WHERE col op (SELECT min/max(c) FROM t) F: SELECT * FROM t WHERE col op V Only `Int32`/`String` columns are folded since they are the only types the existing schema generator and `ClickHouseSchema.getConstant` support; NULL auxiliary results are skipped (NULL-propagation would make the predicate UNKNOWN for every row and the equivalence does not hold). Verified locally against a release ClickHouse 26.5 server: * CERT: ~6 q/s effective (most attempts skip because no estimate responds to the random mutation), 0 false positives in a 30s window. * CODDTest: ~22 q/s, 96-97% successful statements, 0 false positives. `mvn checkstyle:check` clean, `mvn package -Dmaven.test.skip=true` succeeds. Papers: CERT https://doi.org/10.1145/3597503.3639076 CODDTest https://doi.org/10.1145/3709674

DerZc added 11 commits January 3, 2025 20:02

add the implementation of CODDTest for SQLite3

6b86175

fix a typo and remove some redundant code of CODDTest for sqlite3

c6f2c4a

add CODDTest for DuckDB

7a754ba

Merge remote-tracking branch 'upstream/main'

f18638e

enable CTE and insert test in CODDTest for DuckDB

5db6e3c

support CODDTest for CockroachDB

dbd8c89

fix some false positive in CODDTest for DuckDB

0a8f9a1

fix the false positive in CODDTest for SQLite

9596426

remove a piece of redundant code

d4e0f8f

remove some radundant log code in CODDTest for CockroachDB

4122731

support CODDTest for TiDB

bc53382

mrigger reviewed Jan 19, 2025

View reviewed changes

DerZc added 8 commits March 4, 2025 15:45

support mysql

e9f1418

fix a typo

412f69e

add comments for ExpressionBag

2d74254

remove deleteCharAt to simplify code logic

9db8ae0

modify DuckDBConstantWithType to DuckDBTypeCast

9581dbb

remove an internel error from the ignore list

f99cc40

handle confilct in MySQL

6343041

Merge remote-tracking branch 'upstream/main'

3045bad

DerZc mentioned this pull request Mar 22, 2025

Support CODDTest for SQLite #1181

Merged

DerZc mentioned this pull request Apr 30, 2025

Support CODDTest for DuckDB #1224

Open

fm4v mentioned this pull request May 15, 2026

ClickHouse: wrong-result oracle suite (31 oracles), recursive type system, and generator expansion ClickHouse/sqlancer#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support CODDTest in sqlancer#1054

Support CODDTest in sqlancer#1054
DerZc wants to merge 19 commits into
sqlancer:mainfrom
DerZc:main

DerZc commented Jan 5, 2025

Uh oh!

mrigger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DerZc commented Jan 5, 2025

Uh oh!

mrigger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants