name: ecto-thinking description: Use when the user works with Ecto, database schemas, changesets, migrations, Repo operations, preloading, query composition, contexts/bounded-contexts, Ecto.Multi, or multi-tenancy. Covers context design and cross-context patterns. Do NOT use for Ash.Resource definitions (use ash-thinking) or Phoenix LiveView (use phoenix-thinking).
Ecto Thinking
Mental shifts for Ecto and data layer design. These insights challenge typical ORM patterns.
Iron Law: NEVER Use :float for Money
Floats can't represent decimal cents exactly (0.1 + 0.2 != 0.3). Balances drift, reconciliation fails, audits find pennies that don't exist.
# BAD — silent corruption
add :price, :float
# GOOD — exact decimal (Postgres numeric)
add :price, :decimal, precision: 19, scale: 4
# OR — integer cents, format at the edge
add :price_cents, :integer
Arithmetic on :decimal MUST use the Decimal module, never +/*:
# BAD
total = price + tax # works on floats, breaks on Decimal
# GOOD
total = Decimal.add(price, tax)
Never cast user-supplied money through a float on the way in — keep it :decimal (or integer cents) end to end.
Tidewave MCP: Database at Your Fingertips
If Tidewave MCP tools are available, use them for all database and schema work:
| Task | Tidewave Tool |
|---|---|
| List all schemas in the project | get_ecto_schemas — discovers all modules with __changeset__/0, shows file paths |
| Run a query against the database | execute_sql_query — executes SQL through your Ecto repo (supports multiple repos, parameterized queries, 50-row limit) |
| Test a changeset or Repo operation | project_eval — run Repo.all(Post), test changesets, verify migrations in the running app |
| Check Ecto docs for your version | get_docs — e.g., get_docs("Ecto.Changeset.cast/4") for your exact Ecto version |
Prefer execute_sql_query over manual psql — it uses your app's Ecto connection, respects your repo config, and works with any database adapter.
Prefer project_eval over writing test scripts — test Repo.aggregate, changeset pipelines, or Ecto.Multi operations directly in the running app.
Not installed? → mix igniter.install tidewave then claude mcp add --transport http tidewave http://localhost:4000/tidewave/mcp
Context = Setting That Changes Meaning
Context isn't just a namespace—it changes what words mean. "Product" means different things in Checkout (SKU, name), Billing (SKU, cost), and Fulfillment (SKU, warehouse). Each bounded context may have its OWN Product schema/table.
Think top-down: Subdomain → Context → Entity. Not "What context does Product belong to?" but "What is a Product in this business domain?"
Cross-Context References: IDs, Not Associations
schema "cart_items" do
field :product_id, :integer # Reference by ID
# NOT: belongs_to :product, Catalog.Product
end
Query through the context, not across associations. Keeps contexts independent and testable.
DDD Patterns as Pipelines
def create_product(params) do
params
|> Products.build() # Factory: unstructured → domain
|> Products.validate() # Aggregate: enforce invariants
|> Products.insert() # Repository: persist
end
Use events (as data structs) to compose bounded contexts with minimal coupling.
Schema ≠ Database Table
| Use Case | Approach |
|---|---|
| Database table | Standard schema/2 |
| Form validation only | embedded_schema/1 |
| API request/response | Embedded schema or schemaless |
Multiple Changesets per Schema
def registration_changeset(user, attrs) # Full validation + password
def profile_changeset(user, attrs) # Name, bio only
def admin_changeset(user, attrs) # Role, verified_at
Different operations = different changesets.
Multi-Tenancy: Composite Foreign Keys
add :post_id, references(:posts, with: [org_id: :org_id], match: :full)
Use prepare_query/3 for automatic scoping. Raise if org_id missing.
Preload vs Join Trade-offs
| Approach | Best For |
|---|---|
| Separate preloads | Has-many with many records (less memory) |
| Join preloads | Belongs-to, has-one (single query) |
Join preloads can use 10x more memory for has-many.
CRUD Contexts Are Fine
"If you have a CRUD bounded context, go for it. No need to add complexity."
Use generators for simple cases. Add DDD patterns only when business logic demands it.
Gotchas from Core Team
CTE Queries Don't Inherit Schema Prefix
In multi-tenant apps, CTEs don't get the parent query's prefix.
Fix: Explicitly set prefix: %{recursive_query | prefix: "tenant"}
Parameterized Queries ≠ Prepared Statements
- Parameterized queries:
WHERE id = $1— always used by Ecto - Prepared statements: Query plan cached by name — can be disabled
pgbouncer: Use prepare: :unnamed (disables prepared statements, keeps parameterized queries).
pool_count vs pool_size
More pools with fewer connections = better for benchmarks. But with mixed fast/slow queries, a single larger pool gives better latency.
Rule: pool_count for uniform workloads, larger pool_size for real apps.
Sandbox Mode Doesn't Work With External Processes
Cachex, separate GenServers, or anything outside the test process won't share the sandbox transaction.
Fix: Make the external service use the test process, or accept it's not in the same transaction.
Null Bytes Crash Postgres
PostgreSQL rejects null bytes even though they're valid UTF-8.
Fix: Sanitize at boundaries: String.replace(string, "\x00", "")
No Implicit Cross Joins
from(a in A, b in B, ...) with no on: or where-correlation is a Cartesian product — every row of A × every row of B.
# BAD — accidental cross join (every a paired with every b)
from a in Author,
b in Book,
select: {a.name, b.title}
# GOOD — explicit, correlated join
from a in Author,
join: b in Book, on: b.author_id == a.id,
select: {a.name, b.title}
Always express joins with join: + on:. Never list a second source in the from and rely on a where clause to "fix" it later.
Dedup Shared Children Before cast_assoc
When multiple parents reference the SAME child data (e.g. shared tags), passing duplicate child params inserts duplicates or trips a unique constraint.
Fix: dedup child params before the changeset, or persist children up front with Repo.insert + on_conflict: / Ecto.Multi, then associate by ID. cast_assoc assumes each param is a distinct row.
preload_order for Association Sorting
has_many :comments, Comment, preload_order: [desc: :inserted_at]
Note: Doesn't work for through associations.
Runtime Migrations Use List API
Ecto.Migrator.run(Repo, [{0, Migration1}, {1, Migration2}], :up, opts)
Idioms
- Prefer
Repo.insert/1overRepo.insert!/1—handle{:ok, _}/{:error, _}explicitly - Use
Repo.transact/1(Ecto 3.12+) for simple transactions instead ofEcto.Multi
Red Flags - STOP and Reconsider
- belongs_to pointing to another context's schema
- Single changeset for all operations
- Preloading has-many with join
- CTEs in multi-tenant apps without explicit prefix
- Using pgbouncer without
prepare: :unnamed - Testing with Cachex/GenServers assuming sandbox shares transactions
- Accepting user input without null byte sanitization
:float(or float arithmetic) anywhere near money- A
fromlisting two sources with no correlatingon:/where (cross join) cast_assocover child params that contain duplicate shared records
Any of these? Re-read the Gotchas section.