name: adding-personhog-rpc description: > Guide for adding a new RPC to personhog-replica and personhog-router. Covers eligibility checks, proto definition, code generation for Python and Node.js clients, Rust implementation (storage trait, postgres queries, service handler, router wiring), and index compatibility validation. Use when adding a new gRPC endpoint to personhog, migrating a Django ORM query to personhog, or extending the personhog service API.
Adding a personhog RPC
This skill walks through adding a new RPC end-to-end: proto definition, code generation, Rust implementation, and client updates.
Before you start: eligibility check
Personhog serves person, distinct ID, group, group type mapping, cohort membership, and feature flag hash key override data. If the data being accessed doesn't live in one of these tables, this RPC doesn't belong in personhog:
| Table | Data category | Routing |
|---|---|---|
posthog_person |
PersonData | Reads: replica (eventual) or leader (strong). Writes: leader |
posthog_persondistinctid |
PersonData | Same as person |
posthog_group |
NonPersonData | All ops: replica |
posthog_grouptypemapping |
NonPersonData | All ops: replica |
posthog_cohortpeople |
NonPersonData | All ops: replica |
posthog_featureflaghashkeyoverride |
NonPersonData | All ops: replica |
posthog_personoverride |
PersonData | Reads/writes follow person routing |
posthog_personlessdistinctid |
PersonData | Same as person |
If the table is not listed above, stop — this data should not go through personhog.
Step 0: design the data access pattern
Before writing any proto, figure out the SQL query you need. Then validate it against the available indexes — see references/database-indexes.md.
Key questions:
- What table(s) does this query hit?
- Does the WHERE clause match an existing index? Every query must be an index scan, never a sequential scan.
- Is this a read or write? This determines routing (see table above).
- For reads: does the caller need strong consistency (primary pool) or is eventual (replica pool) acceptable?
- For batch lookups: is the batch within a single team or cross-team?
Step 1: define proto messages and RPC
All proto files live in proto/personhog/.
See references/proto-conventions.md for message conventions and a worked example.
Where to add what
- Message types →
proto/personhog/types/v1/<domain>.proto(person.proto, group.proto, cohort.proto, feature_flag.proto, or common.proto) - Service RPC →
proto/personhog/service/v1/service.proto(the public API clients call) - Replica RPC →
proto/personhog/replica/v1/replica.proto(the internal API the router delegates to) - Leader RPC →
proto/personhog/leader/v1/leader.proto(only if this is a person-data write routed to leader)
The service and replica protos must both declare the RPC with identical signature. The router delegates from service → replica (or leader) transparently.
Step 2: generate client stubs
Python
bin/generate_personhog_proto.sh
Then update three files:
posthog/personhog_client/proto/__init__.py— add re-exports for new request/response message typesposthog/personhog_client/client.py— add a wrapper method matching the pattern of existing methodsposthog/personhog_client/fake_client.py— implement the method for test use
Node.js
cd nodejs && pnpm run generate:personhog-proto
Then update:
nodejs/src/ingestion/personhog/client.ts— add a wrapper method matching the pattern of existing methodsnodejs/src/ingestion/personhog/client.test.ts— add a default stub toSERVICE_DEFAULTSfor the new RPC
Rust
No generation step needed — tonic regenerates on cargo build.
But you must implement the RPC (next step), or the build will fail.
Step 3: implement in Rust
The compiler guides you — once the proto is defined, cargo build errors tell you exactly which trait methods are missing.
3a. Storage layer (personhog-replica)
- Add a trait method in
rust/personhog-replica/src/storage/traits/<domain>.rs - Implement the query in
rust/personhog-replica/src/storage/postgres/<domain>.rs- Use
sqlx::query_as!orsqlx::query!macros - Add timing instrumentation via
DB_QUERY_DURATIONandDB_ROWS_RETURNEDmetrics - Use
self.replica_poolfor reads,self.primary_poolfor writes - Return early for empty batch inputs
- Use
- Add storage tests in
rust/personhog-replica/tests/storage_tests.rs
3b. Service layer (personhog-replica)
- Add the RPC handler in
rust/personhog-replica/src/service/mod.rs- Extract fields from the proto request
- Call the storage trait method
- Convert storage results to proto responses
- Map storage errors to tonic
Statuscodes
- Add service tests in
rust/personhog-replica/tests/service_tests.rs
3c. Router wiring (personhog-router)
- Add the method to
rust/personhog-router/src/router/mod.rs- Use the
route_requestfunction (imported fromrouting.rs) with the correctDataCategoryandOperationType - Call the replica (or leader) backend
- Use the
call_backend!macro for instrumentation
- Use the
- Add the service impl to
rust/personhog-router/src/service/mod.rs- Invoke the
route_request!macro (defined at the top of this file) to delegate to the router
- Invoke the
- Add to the backend trait in
rust/personhog-router/src/backend/mod.rsand implement inreplica.rs - Add router tests in
rust/personhog-router/tests/
Use rstest parameterized tests where multiple variations of the same behavior are being tested.
Step 4: verify
cargo build -p personhog-proto
cargo build -p personhog-replica
cargo build -p personhog-router
cargo test -p personhog-replica
cargo test -p personhog-router
Checklist
- Query uses an existing index (no seq scans)
- Proto messages added to
types/v1/<domain>.proto - RPC added to both
service.protoandreplica.proto(andleader.protoif needed) - Python stubs generated,
proto/__init__.pyupdated,client.pymethod added,fake_client.pyupdated - Node.js stubs generated,
client.test.tsSERVICE_DEFAULTS updated - Rust storage trait + postgres impl + service handler + router wiring all implemented
- Tests added at storage, service, and router layers
-
cargo buildandcargo testpass for all three crates