Indexes & Collections

This guide covers how to define access patterns using indexes, how key generation works, and how to use collections for cross-entity queries.

Entity Primary Key

Entities define their primary key and GSI indexes together. Collections are auto-discovered from entities sharing the same collection name on a GSI.

1
const Tasks = Entity.make({
2
  model: Task,
3
  entityType: "Task",
4
  primaryKey: {
5
    pk: { field: "pk", composite: ["taskId"] },
6
    sk: { field: "sk", composite: [] },
7
  },
8
  indexes: {
9
    byProject: {
10
      name: "gsi1",
11
      pk: { field: "gsi1pk", composite: ["projectId"] },
12
      sk: { field: "gsi1sk", composite: ["priority"] },
13
    },
14
    byAssignee: {
15
      name: "gsi2",
16
      pk: { field: "gsi2pk", composite: ["employeeId"] },
17
      sk: { field: "gsi2sk", composite: ["priority"] },
18
    },
19
  },
20
  timestamps: true,
21
})

Anatomy of a Primary Key

1
primaryKey: {
2
  pk: {
3
    field: "pk",                         // Physical DynamoDB attribute
4
    composite: ["taskId"],               // Model attributes -> partition key
5
  },
6
  sk: {
7
    field: "sk",
8
    composite: [],                       // Empty = one item per partition key
9
  },
10
}

Property	Type	Required	Description
`pk.field`	`string`	Yes	Physical DynamoDB attribute name
`pk.composite`	`string[]`	Yes	Model attributes composing the partition key
`sk.field`	`string`	Yes	Physical DynamoDB attribute name
`sk.composite`	`string[]`	Yes	Model attributes composing the sort key

Access Patterns via the Primary Index

Every entity exposes a .primary(...) query accessor on the bound client — the primary index is treated symmetrically with GSI indexes. Pass the required PK composites (and optionally one or more SK composites) to return a BoundQuery:

1
// `Memberships` primary key: pk = orgId, sk = userId
2
//
3
// List all items under a shared primary partition key (PK only)
4
const allMembers = yield* db.entities.Memberships.primary({
5
  orgId: "org-acme",
6
}).collect()
7

8
// Narrow by partial SK composite (begins_with prefix match)
9
const acmeAdmins = yield* db.entities.Memberships.primary({
10
  orgId: "org-acme",
11
})
12
  .filter({ role: "admin" })
13
  .collect()

This is the natural access pattern for shared-PK single-table designs — for example a join table that holds many channel grants under one accountId. Use .get(fullKey) when you want a strongly-consistent single-item fetch by the full composite key; use .primary(partialKey) when you want a Query that returns every item in the partition (or a prefix-matched subset).

GSI Access Patterns via Entity Indexes

GSI access patterns are defined directly on the entity via the indexes property. Each index specifies the physical GSI, PK composites, and SK composites — see the task-entity block above for the canonical shape.

Logical Names as API Surface

Collection names become query accessors on the typed client:

1
// Logical names become query accessors — no physical index names in code
2
const alphaTasks = yield* basic.entities.Tasks.byProject({ projectId: "proj-alpha" }).collect()
3

4
const aliceTasks = yield* basic.entities.Tasks.byAssignee({ employeeId: "emp-alice" }).collect()

You never reference physical index names (gsi1, gsi2) in application code.

Key Generation

effect-dynamodb generates DynamoDB key values automatically from your composite declarations. You declare which attributes compose each key — the system handles how.

Format

1
$<schema>#v<version>#<prefix>#<attrName>_<value>#<attrName>_<value>

Each composite value is prefixed with its attribute name (separated by _) and segments are joined with #. Given DynamoSchema({ name: "myapp", version: 1 }) and entityType: "Task":

Composite	Generated Key
`["taskId"]` with value `"t-001"`	`$myapp#v1#task#taskid_t-001`
`["projectId", "status"]` with values `"proj-alpha"`, `"active"`	`$myapp#v1#task#projectid_proj-alpha#status_active`
`[]` (empty)	`$myapp#v1#task`

Empty Composites

Both pk.composite and sk.composite can be empty arrays. The entity type prefix guarantees no collisions between entity types.

pk	sk	Use Case
`[]`	`[]`	Singleton (config, counters)
`["userId"]`	`[]`	One item per user
`[]`	`["userId"]`	All users in one partition
`["tenantId"]`	`["userId"]`	Users partitioned by tenant

Attribute Serialization

Type	Key Value
`string`	As-is
`number`	Zero-padded
`DateTime.Utc`	ISO 8601
`boolean`	`"true"` / `"false"`
Branded string	Underlying value

Casing Resolution

Casing is applied uniformly across the entire generated key — schema name, version prefix, entity type, collection name, and composite attribute values. Resolution order:

Index-level casing (highest priority)
Schema-level casing (default: "lowercase")

This means "Emp-Alice" and "emp-alice" produce identical keys under the default "lowercase" setting. The original attribute value casing is preserved on the stored attribute itself — only the composed key string is cased.

1
// With casing: "lowercase", entityType: "Employee", employeeId: "Emp-Alice"
2
// Composed PK:      $myapp#v1#employee#emp-alice
3
// Stored attribute: employeeId = "Emp-Alice"  (original casing preserved)

Collections

Collections group multiple entity types that share a partition key, enabling cross-entity queries. Two modes are supported: isolated (default) and clustered. Isolated keeps each entity’s sort-key namespace separate so single-entity scans remain efficient; clustered places the collection name above each entity’s sort key so all members are physically interleaved (required for sub-collections).

Isolated Collections

Each entity owns its sort key prefix. Collection queries use only the partition key — no sort key condition.

1
const IsolatedEmployees = Entity.make({
2
  model: Employee,
3
  entityType: "Employee",
4
  primaryKey: {
5
    pk: { field: "pk", composite: ["employeeId"] },
6
    sk: { field: "sk", composite: [] },
7
  },
8
  indexes: {
9
    departmentStaff: {
10
      collection: "departmentStaff",
11
      name: "gsi1",
12
      pk: { field: "gsi1pk", composite: ["department"] },
13
      sk: { field: "gsi1sk", composite: ["hireDate"] },
14
    },
15
  },
16
  timestamps: true,
17
})
18

19
class Equipment extends Schema.Class<Equipment>("Equipment")({
20
  equipmentId: Schema.String,
21
  department: Schema.String,
22
  name: Schema.String,
23
  purchaseDate: Schema.String,
24
}) {}
25

26
const Equipments = Entity.make({
27
  model: Equipment,
28
  entityType: "Equipment",
29
  primaryKey: {
30
    pk: { field: "pk", composite: ["equipmentId"] },
31
    sk: { field: "sk", composite: [] },
32
  },
33
  indexes: {
34
    departmentStaff: {
35
      collection: "departmentStaff",
36
      name: "gsi1",
37
      pk: { field: "gsi1pk", composite: ["department"] },
38
      sk: { field: "gsi1sk", composite: ["purchaseDate"] },
39
    },
40
  },
41
  timestamps: true,
42
})

Sort key format (isolated):

1
$myapp#v1#employee_1#hiredate_2020-01-15        <- Employee entity owns its SK
2
$myapp#v1#equipment_1#purchasedate_2023-06-01   <- Equipment entity owns its SK

Query behavior:

Query	Mechanism	Returns
Everything	PK only (no SK condition)	Employee + Equipment
Employees only	PK + `begins_with(SK, "$myapp#v1#employee_1")`	Employee

Best for: High-volume single-entity queries where cross-entity queries are occasional.

Clustered Collections

The collection name sits at the top of every entity’s sort key. All entities share this prefix.

1
const ClusteredEmployees = Entity.make({
2
  model: Employee,
3
  entityType: "Employee",
4
  primaryKey: {
5
    pk: { field: "pk", composite: ["employeeId"] },
6
    sk: { field: "sk", composite: [] },
7
  },
8
  indexes: {
9
    tenantMembers: {
10
      collection: "tenantMembers",
11
      name: "gsi1",
12
      pk: { field: "gsi1pk", composite: ["tenantId"] },
13
      sk: { field: "gsi1sk", composite: ["department", "hireDate"] },
14
      type: "clustered",
15
    },
16
  },
17
  timestamps: true,
18
})
19

20
const ClusteredTasks = Entity.make({
21
  model: Task,
22
  entityType: "Task",
23
  primaryKey: {
24
    pk: { field: "pk", composite: ["taskId"] },
25
    sk: { field: "sk", composite: [] },
26
  },
27
  indexes: {
28
    tenantMembers: {
29
      collection: "tenantMembers",
30
      name: "gsi1",
31
      pk: { field: "gsi1pk", composite: ["tenantId"] },
32
      sk: { field: "gsi1sk", composite: ["projectId", "taskId"] },
33
      type: "clustered",
34
    },
35
  },
36
  timestamps: true,
37
})

Sort key format (clustered):

1
$myapp#v1#tenantmembers#employee_1#department_engineering#hiredate_2020-01-15
2
$myapp#v1#tenantmembers#task_1#projectid_proj-alpha#taskid_t-001
3
^^^^^^^^^^^^^^^^^^^^^^^^^^
4
collection prefix (shared)

Query behavior:

Query	Mechanism	Returns
Everything	PK + `begins_with(SK, "$myapp#v1#tenantmembers")`	Employee + Task
Employees only	PK + `begins_with(SK, "$myapp#v1#tenantmembers#employee_1")`	Employee
Tasks only	PK + `begins_with(SK, "$myapp#v1#tenantmembers#task_1")`	Task

Best for: Cross-entity queries, relationship-dense data.

Hierarchical Sub-Collections

True parent/child sub-collections nest the collection hierarchy in the sort key so a begins_with query at the parent level returns the parent’s items and every descendant. Two requirements:

Use the array form on the entity index: collection: ["parent", "child"] (deeper levels add more elements).
Use type: "clustered" so the SK puts the hierarchy above the entity-type prefix.

In the example below, Employee lives at the parent level (["contributions"]) while Task and ProjectMember live one level deeper (["contributions", "assignments"]). All three share gsi2 keyed on employeeId:

1
// True hierarchical sub-collections via the array form `collection: ["parent", "child"]`
2
// + `type: "clustered"`. The full hierarchy is written into the SK so a begins_with
3
// query at the parent level matches every descendant — querying "contributions"
4
// returns SubEmployee + SubTasks + SubProjectMembers, while "assignments" returns
5
// only SubTasks + SubProjectMembers.
6
const SubEmployee = Entity.make({
7
  model: Employee,
8
  entityType: "Employee",
9
  primaryKey: {
10
    pk: { field: "pk", composite: ["employeeId"] },
11
    sk: { field: "sk", composite: [] },
12
  },
13
  indexes: {
14
    contributions: {
15
      collection: ["contributions"],
16
      type: "clustered",
17
      name: "gsi2",
18
      pk: { field: "gsi2pk", composite: ["employeeId"] },
19
      sk: { field: "gsi2sk", composite: ["department"] },
20
    },
21
  },
22
  timestamps: true,
23
})
24

25
const SubTasks = Entity.make({
26
  model: Task,
27
  entityType: "Task",
28
  primaryKey: {
29
    pk: { field: "pk", composite: ["taskId"] },
30
    sk: { field: "sk", composite: [] },
31
  },
32
  indexes: {
33
    assignments: {
34
      collection: ["contributions", "assignments"],
35
      type: "clustered",
36
      name: "gsi2",
37
      pk: { field: "gsi2pk", composite: ["employeeId"] },
38
      sk: { field: "gsi2sk", composite: ["projectId", "taskId"] },
39
    },
40
  },
41
  timestamps: true,
42
})
43

44
const SubProjectMembers = Entity.make({
45
  model: ProjectMember,
46
  entityType: "ProjectMember",
47
  primaryKey: {
48
    pk: { field: "pk", composite: ["employeeId", "projectId"] },
49
    sk: { field: "sk", composite: [] },
50
  },
51
  indexes: {
52
    assignments: {
53
      collection: ["contributions", "assignments"],
54
      type: "clustered",
55
      name: "gsi2",
56
      pk: { field: "gsi2pk", composite: ["employeeId"] },
57
      sk: { field: "gsi2sk", composite: ["projectId"] },
58
    },
59
  },
60
  timestamps: true,
61
})

SK shape written by each entity:

Entity	SK
`SubEmployee`	`$myapp#v1#contributions#employee_1#department_engineering`
`SubTasks`	`$myapp#v1#contributions#assignments#task_1#projectid_p-α#taskid_t-001`
`SubProjectMembers`	`$myapp#v1#contributions#assignments#projectmember_1#projectid_p-α`

Both descendant SKs start with the parent prefix $myapp#v1#contributions, so a query at the parent level returns all three. The child query uses the longer prefix $myapp#v1#contributions#assignments and returns only the two descendant entities.

Query at any depth:

Collection	begins_with prefix	Returns
`contributions`	`$myapp#v1#contributions`	`Employee` + `Task` + `ProjectMember`
`assignments`	`$myapp#v1#contributions#assignments`	`Task` + `ProjectMember`

1
// Parent — includes everything
2
const contributions = yield* db.collections
3
  .contributions({ employeeId: "emp-alice" })
4
  .collect()
5
// { SubEmployee: Employee[], SubTasks: Task[], SubProjectMembers: ProjectMember[] }
6

7
// Child — only descendants
8
const assignments = yield* db.collections
9
  .assignments({ employeeId: "emp-alice" })
10
  .collect()
11
// { SubTasks: Task[], SubProjectMembers: ProjectMember[] }

Defining Collections

Collections are defined by adding a collection property to entity indexes. Entities sharing the same collection name on the same physical GSI are automatically grouped into a collection by DynamoClient.make():

1
// On EmployeeEntity
2
indexes: {
3
  tenantMembers: {
4
    collection: "tenantMembers",
5
    name: "gsi1",
6
    pk: { field: "gsi1pk", composite: ["tenantId"] },
7
    sk: { field: "gsi1sk", composite: ["department", "hireDate"] },
8
    type: "clustered",
9
  },
10
}
11

12
// On TaskEntity — same collection name + same GSI
13
indexes: {
14
  tenantMembers: {
15
    collection: "tenantMembers",
16
    name: "gsi1",
17
    pk: { field: "gsi1pk", composite: ["tenantId"] },
18
    sk: { field: "gsi1sk", composite: ["projectId", "taskId"] },
19
    type: "clustered",
20
  },
21
}

When you call DynamoClient.make({ entities, tables }), the client auto-discovers that both entities share the tenantMembers collection and exposes db.collections.tenantMembers(...) as a cross-entity query accessor.

Validation Rules

All entities in a collection must share the same PK composite on that index
All entities sharing an index must agree on the type (no mixing isolated/clustered)
Sub-collections sharing an index must use the same PK and SK fields
No duplicate entity types within a single collection

Worked Example: Multi-Tenant Project Management

Access Patterns

Pattern	Collection	PK	SK
Get employee by ID	primary	`employeeId`	—
Get task by ID	primary	`taskId`	—
Employees in tenant	TenantMembers (gsi1)	`tenantId`	`department`, `hireDate`
Tasks in tenant	TenantMembers (gsi1)	`tenantId`	`projectId`, `taskId`
All tenant items	TenantMembers (gsi1)	`tenantId`	—
Employee by email	ByEmail (gsi2)	`email`	—
Tasks by assignee	ByAssignee (gsi2)	`employeeId`	`priority`

DynamoDB Items

Each composite value in a key is prefixed with its attribute name (e.g., employeeid_emp-alice). Structural parts (schema, entity type, collection) are cased according to the schema/index casing rules.

pk	sk	gsi1pk	gsi1sk	edd_e
`$myapp#v1#employee#employeeid_emp-alice`	`$myapp#v1#employee`	`$myapp#v1#tenantmembers#tenantid_t-acme`	`$myapp#v1#tenantmembers#employee_1#department_engineering#hiredate_2024-01-15`	Employee
`$myapp#v1#task#taskid_t-001`	`$myapp#v1#task`	`$myapp#v1#tenantmembers#tenantid_t-acme`	`$myapp#v1#tenantmembers#task_1#projectid_proj-alpha#taskid_t-001`	Task
`$myapp#v1#employee#employeeid_emp-bob`	`$myapp#v1#employee`	`$myapp#v1#tenantmembers#tenantid_t-acme`	`$myapp#v1#tenantmembers#employee_1#department_sales#hiredate_2023-06-01`	Employee

What’s Next?

indexPolicy — Per-Half GSI Membership Rules — Per-half sparse/preserve declaration, per-half evaluation gate, structural composition with the unified per-half can’t-compose rule, set/remove asymmetry, per-half cascade, EDD-9025 invariant
Queries — Query by index, filter by sort key composites, paginate results
Data Integrity — Unique constraints and optimistic concurrency
Lifecycle — Soft delete and version retention