Learn mode

Mode disclosure

All modes use one coherent workspace; only disclosure and guidance change. Learn mode keeps theory, concept names, full schema help, progressive hints, and solution review available.

GROUP BY and Report Grain / write query

M14-A04 - Write - calculate a metric by two dimensions

M14-A04 - Write - calculate a metric by two dimensions. Create one summary row per requested dimension and predict output grain.

Result grain: one revenue row per order status and product category combination
Exact columns: status; category; revenue

Engine cold - Draft idle

Unchanged since last Run/CheckNo Run or Check yet

14px

Loading editor

Scenario

Create grouped reports at the requested grain: predict row meaning before grouping, group by every selected dimension, and avoid accidental extra grouping columns.

GROUP BY and Report Grain / write query

One-sentence task

M14-A04 - Write - calculate a metric by two dimensions. Create one summary row per requested dimension and predict output grain.

Learn mode disclosure

Theory, concept names, full schema help, and progressive hints are available.

Structured output contract

Result grain: one revenue row per order status and product category combination
Exact columns: status; category; revenue
Source population: Use the prompt setup plus FROM, JOIN, WHERE, and subquery predicates as the source population. Visible rows are only examples.
Grouping: Group only at the requested output grain: one revenue row per order status and product category combination.
Ordering: order by status then category nulls last
Validation: select-only; hidden deterministic variants.

Relevant tables

order_items orders products

Time and difficulty

Estimated time: 8 minutes
Difficulty: 3/5

Objective and concepts

State the requested SQL output contract for group by and report grain using source grain, columns, ordering, and edge-case evidence.

Glossary links

Concept material

SQL Trail treats every query as an evidence trail: identify source grain, transform rows deliberately, then compare output to a shared contract.

A passing query must handle hidden nulls, ties, boundaries, and no-match rows when the contract makes them relevant.

Syntax card

SELECT <requested_columns>
FROM <source_table>
WHERE <source_population_filter>
GROUP BY <result_grain_columns>
ORDER BY <deterministic_tie_breakers>;

<requested_columns> means the exact output columns, aliases, and order from the visible contract.
<source_population_filter> means the row population definition, not a copied visible-row value.
<deterministic_tie_breakers> means all ordering and tie rules needed for repeatable output.

Why this works

GROUP BY status changes the row meaning from individual orders to one summary row per distinct status value.

Edge cases

Hidden variants preserve nulls, ties, duplicates, boundaries, no-match rows, and alternate row order when those risks apply.

PostgreSQL note

The local engine uses PostgreSQL-compatible syntax, including explicit NULL predicates, deterministic ORDER BY clauses, and transactional grading.

Worked example

SELECT status, 'one row per order status' AS row_meaning, COUNT(*)::int AS order_count FROM orders GROUP BY status ORDER BY status;

Assumptions, dialect notes, and common traps

Duplicate policy: Preserve duplicate facts unless the prompt explicitly asks for distinct tuples or set semantics.
Null policy: Preserve NULL, empty string, zero, and false as distinct values unless the contract says to display a fallback.
Tie-breakers: Use every ordering rule in the contract and end tied business metrics with deterministic secondary keys when needed.
Zero-related entities: Do not invent zero rows unless the contract asks for preserved parents, missing entities, or complete periods.
Numeric tolerance: Round only at the requested final stage; hidden checks use the contract precision rather than visible formatting luck.

PostgreSQL-compatible local checks

Queries run in a local PGlite worker with PostgreSQL-style syntax and transactional grading.

Grouped rows have a new meaning: A grouped result is read as if each output row still represented one source order or product. Repair: State the grouped row grain before writing or interpreting aggregate values.
Every selected dimension defines grain: An extra selected column is added to GROUP BY and fragments the report into too many rows. Repair: Keep only the requested dimensions in SELECT and GROUP BY.
GROUP BY is not DISTINCT with math: The grouping column is treated as duplicate suppression instead of the row promise for aggregate metrics. Repair: Use GROUP BY to define one output row per dimension value, then aggregate all input rows inside each group.
Grouped reports still need deterministic order: Rows tied on metrics or containing NULL dimensions move around across hidden variants. Repair: Order by the grouped dimensions and state NULLS FIRST or NULLS LAST when null groups can appear.

Opened hints

No hints opened yet.

Learn mode

Mode disclosure

All modes use one coherent workspace; only disclosure and guidance change. Learn mode keeps theory, concept names, full schema help, progressive hints, and solution review available.

GROUP BY and Report Grain / write query

M14-A04 - Write - calculate a metric by two dimensions

M14-A04 - Write - calculate a metric by two dimensions. Create one summary row per requested dimension and predict output grain.

Result grain: one revenue row per order status and product category combination
Exact columns: status; category; revenue

Engine cold - Draft idle

Unchanged since last Run/CheckNo Run or Check yet

14px

Loading editor

Scenario

Create grouped reports at the requested grain: predict row meaning before grouping, group by every selected dimension, and avoid accidental extra grouping columns.

GROUP BY and Report Grain / write query

One-sentence task

M14-A04 - Write - calculate a metric by two dimensions. Create one summary row per requested dimension and predict output grain.

Learn mode disclosure

Theory, concept names, full schema help, and progressive hints are available.

Structured output contract

Result grain: one revenue row per order status and product category combination
Exact columns: status; category; revenue
Source population: Use the prompt setup plus FROM, JOIN, WHERE, and subquery predicates as the source population. Visible rows are only examples.
Grouping: Group only at the requested output grain: one revenue row per order status and product category combination.
Ordering: order by status then category nulls last
Validation: select-only; hidden deterministic variants.

Relevant tables

order_items orders products

Time and difficulty

Estimated time: 8 minutes
Difficulty: 3/5

Objective and concepts

State the requested SQL output contract for group by and report grain using source grain, columns, ordering, and edge-case evidence.

Glossary links

Concept material

SQL Trail treats every query as an evidence trail: identify source grain, transform rows deliberately, then compare output to a shared contract.

A passing query must handle hidden nulls, ties, boundaries, and no-match rows when the contract makes them relevant.

Syntax card

SELECT <requested_columns>
FROM <source_table>
WHERE <source_population_filter>
GROUP BY <result_grain_columns>
ORDER BY <deterministic_tie_breakers>;

<requested_columns> means the exact output columns, aliases, and order from the visible contract.
<source_population_filter> means the row population definition, not a copied visible-row value.
<deterministic_tie_breakers> means all ordering and tie rules needed for repeatable output.

Why this works

GROUP BY status changes the row meaning from individual orders to one summary row per distinct status value.

Edge cases

Hidden variants preserve nulls, ties, duplicates, boundaries, no-match rows, and alternate row order when those risks apply.

PostgreSQL note

The local engine uses PostgreSQL-compatible syntax, including explicit NULL predicates, deterministic ORDER BY clauses, and transactional grading.

Worked example

SELECT status, 'one row per order status' AS row_meaning, COUNT(*)::int AS order_count FROM orders GROUP BY status ORDER BY status;

Assumptions, dialect notes, and common traps

Duplicate policy: Preserve duplicate facts unless the prompt explicitly asks for distinct tuples or set semantics.
Null policy: Preserve NULL, empty string, zero, and false as distinct values unless the contract says to display a fallback.
Tie-breakers: Use every ordering rule in the contract and end tied business metrics with deterministic secondary keys when needed.
Zero-related entities: Do not invent zero rows unless the contract asks for preserved parents, missing entities, or complete periods.
Numeric tolerance: Round only at the requested final stage; hidden checks use the contract precision rather than visible formatting luck.

PostgreSQL-compatible local checks

Queries run in a local PGlite worker with PostgreSQL-style syntax and transactional grading.

Grouped rows have a new meaning: A grouped result is read as if each output row still represented one source order or product. Repair: State the grouped row grain before writing or interpreting aggregate values.
Every selected dimension defines grain: An extra selected column is added to GROUP BY and fragments the report into too many rows. Repair: Keep only the requested dimensions in SELECT and GROUP BY.
GROUP BY is not DISTINCT with math: The grouping column is treated as duplicate suppression instead of the row promise for aggregate metrics. Repair: Use GROUP BY to define one output row per dimension value, then aggregate all input rows inside each group.
Grouped reports still need deterministic order: Rows tied on metrics or containing NULL dimensions move around across hidden variants. Repair: Order by the grouped dimensions and state NULLS FIRST or NULLS LAST when null groups can appear.

Opened hints

No hints opened yet.

customer_id	customer_name	city	loyalty_tier	created_at
1	Avery Stone	Austin	gold	2025-12-01T00:00:00.000Z
2	Jordan Lee	Chicago	NULL	2026-01-03T00:00:00.000Z
3	Casey Quinn	Austin	silver	2026-01-04T00:00:00.000Z
4	Morgan Park	Seattle	gold	2026-01-09T00:00:00.000Z
5	Jordan Lee	Denver	NULL	2026-01-10T00:00:00.000Z

order_item_id	order_id	product_id	quantity	unit_price
1000	100	1	2	6.50
1001	100	2	1	14.00
1002	101	3	4	4.25
1003	102	1	1	6.50
1004	103	4	1	28.00

order_id	customer_id	staff_id	ordered_at	status	coupon_code
100	1	1	2026-01-02T15:15:00.000Z	completed	WELCOME
101	2	1	2026-01-05T20:30:00.000Z	completed	NULL
102	1	2	2026-01-05T20:30:00.000Z	returned	NULL
103	4	2	2026-02-01T14:00:00.000Z	pending	GEAR10

product_id	product_name	category	price	discontinued	stock_count	restock_date
1	Trail Mix	Snacks	6.50	false	18	2026-01-15T00:00:00.000Z
2	Camp Mug	Gear	14.00	false	8	2026-01-20T00:00:00.000Z
3	Notebook	Stationery	4.25	false	0	NULL
4	Lantern	Gear	28.00	true	3	2026-02-01T00:00:00.000Z
5	Tea Tin	NULL	9.75	false	11	2026-01-31T00:00:00.000Z

staff_id	staff_name	role
1	Riley Chen	floor
2	Sam Patel	support

Mode disclosure

M14-A04 - Write - calculate a metric by two dimensions

Scenario

One-sentence task

Structured output contract

Relevant tables

Time and difficulty

Objective and concepts

Glossary links

Assumptions, dialect notes, and common traps

Opened hints

Results

Feedback

Solution Studio

Hints

Grain Guard

Query X-Ray

Statement type

Clauses

Sources

Aliases

Output expressions

Aggregate usage

Window usage

Grouping dimensions

Order keys

Frame status

Logical processing order

Join X-Ray

Input row counts

Solution Studio

Mode disclosure

M14-A04 - Write - calculate a metric by two dimensions

Scenario

One-sentence task

Structured output contract

Relevant tables

Time and difficulty

Objective and concepts

Glossary links

Assumptions, dialect notes, and common traps

Opened hints

Results

Feedback

Solution Studio

Hints

Grain Guard

Query X-Ray

Statement type

Clauses

Sources

Aliases

Output expressions

Aggregate usage

Window usage

Grouping dimensions

Order keys

Frame status

Logical processing order

Join X-Ray

Input row counts

Solution Studio