Add lazy round-trip benchmark (case 09) by ghostiee-11 · Pull Request #178 · alxmrs/xarray-sql

ghostiee-11 · 2026-06-24T18:05:32Z

For #177 , Adds 09_lazy_roundtrip.py: the same air[t0] slab is 1,325 rows / ~0.7 MB via lazy to_dataset(chunks={"time":1}) + .sel(time=t0) (one WHERE pushed down) vs 3.86M rows / ~161 MB eager, all asserted equal to the xarray reference.
Also hardens _harness.py (grid-coverage guard in assert_grid_close, tracemalloc finally in measured); runs green locally via uv.

Add benchmarks/geospatial/09_lazy_roundtrip.py showing the SQL to xarray round-trip (to_dataset) is lazy: a chunked to_dataset plus .sel(time=t0) pushes a single WHERE into SQL (1,325 vs 3,869,000 rows; ~0.7 vs ~161 MB) and asserts equal to the xarray reference. Also harden _harness.py: assert_grid_close fails on a partial grid, and measured() stops tracemalloc in a finally. Document case 09 in the suite README.

alxmrs · 2026-06-24T19:55:30Z

+#
+# [tool.uv.sources]
+# xarray-sql = { path = "../../", editable = true }
+# ///


I see this as a good unit test or property that cross cuts all the other geo benchmarks, but I don't think it alone makes for a good benchmark example.

Okay, makes sense

alxmrs

A few other notes. If you removed case 09 and we got to the bottom of the fixes, I'd be happy to merge this.

alxmrs · 2026-06-24T20:21:35Z

        t0 = time.perf_counter()
-        yield
-        elapsed = time.perf_counter() - t0
-        _, peak = tracemalloc.get_traced_memory()
-        tracemalloc.stop()
+        try:
+            yield
+        finally:
+            elapsed = time.perf_counter() - t0
+            _, peak = tracemalloc.get_traced_memory()
+            tracemalloc.stop()


This fix seems like a good idea

alxmrs · 2026-06-24T20:23:45Z

+    short = {
+        d: (got.sizes[d], ref.sizes[d])
+        for d in ref.dims
+        if d in got.sizes and got.sizes[d] != ref.sizes[d]
+    }
+    if short:
+        raise AssertionError(
+            f"{name}: SQL result does not cover the reference grid "
+            f"(dim: got vs ref = {short}); the comparison would be partial"
+        )


I'm surprised that Xarray's all close doesn't cover this case. Are you sure this is necessary?

good catch, you're right allclose would catch it on its own. it's the reindex_like(got) one line up that hides it: it shrinks ref down to got's coords first, so a result missing cells still passes on the subset.

got = ref.isel(lat=[0, 1, 2]) # 2 cells dropped xr.testing.assert_allclose(got, ref.reindex_like(got)) # passes, silently xr.testing.assert_allclose(got, ref) # raises

so the guard just restores the check reindex_like removes. could also drop reindex_like for xr.align(..., join="exact"), but that line handles label ordering so the guard felt smaller. either works.

ghostiee-11 mentioned this pull request Jun 24, 2026

Geospatial SQL benchmark suite: prove core geo/climate ops are relational #177

Open

alxmrs reviewed Jun 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lazy round-trip benchmark (case 09)#178

Add lazy round-trip benchmark (case 09)#178
ghostiee-11 wants to merge 1 commit into
alxmrs:geospatial-sql-benchmarksfrom
ghostiee-11:geobench-lazy-roundtrip

ghostiee-11 commented Jun 24, 2026 •

edited

Loading

Uh oh!

alxmrs Jun 24, 2026

Uh oh!

ghostiee-11 Jun 24, 2026

Uh oh!

alxmrs left a comment

Uh oh!

alxmrs Jun 24, 2026

Uh oh!

alxmrs Jun 24, 2026

Uh oh!

ghostiee-11 Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ghostiee-11 commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alxmrs Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

ghostiee-11 Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs left a comment

Choose a reason for hiding this comment

Uh oh!

alxmrs Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

ghostiee-11 Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ghostiee-11 commented Jun 24, 2026 •

edited

Loading