A progressive data-assembly framework for Python in Clean Architecture style — adopt each layer as you need it.
Requirements: Python 3.10+, Pydantic v2
pydantic-resolve is a complete framework for defining and assembling your data layer.
- Define entities and relationships — Pydantic models + ER Diagram as the single source of truth.
- Assemble response trees —
resolve_*/post_*+ batch loaders, recursive and N+1-safe. - Expose the same graph to REST, GraphQL, and AI agents (MCP) without rewriting.
These are progressive layers, not a package deal — most users stay on resolve_* and post_* forever, reach for ER Diagram only when relationships start repeating, and add GraphQL/MCP only when those surfaces are actually needed.
from typing import Optional
from pydantic import BaseModel
from pydantic_resolve import Loader, Resolver
class TaskView(BaseModel):
id: int
title: str
owner_id: int
owner: Optional[UserView] = None
def resolve_owner(self, loader=Loader(user_loader)):
return loader.load(self.owner_id)
tasks = await Resolver().resolve(tasks) # one query, no N+1The snippet above is the assembly step — describe what's missing, the framework fetches it. ER Diagram and GraphQL/MCP integrations build on top of the same model graph.
In most FastAPI projects, you define SQLAlchemy ORM models first, then create Pydantic schemas that mirror them. As the project grows, data-assembly logic ends up scattered across Repository / Service / Route:
@router.get("/tasks")
async def get_tasks():
tasks = await task_service.get_tasks()
user_ids = list({t.owner_id for t in tasks})
users = await user_service.get_users_by_ids(user_ids)
user_map = {u.id: u for u in users}
return [
TaskResponse(**{**t.model_dump(), 'owner': user_map.get(t.owner_id)})
for t in tasks
]This pattern couples your API contract to the database layout and gives business logic no stable home. pydantic-resolve provides that home. For the full architectural analysis, see Clean Architecture for Python.
pip install pydantic-resolve
pip install pydantic-resolve[mcp] # with MCP supportThroughout the Quick Start, we build one API:
Sprinthas manyTaskTaskhas oneowner(aUser)- The API also needs derived fields like
task_countandcontributors
Each step adds one concept on top of the previous code.
Every response model has some fields already filled (from the database, from user input) and some fields that need to be fetched separately. resolve_* is how you declare those missing fields.
from typing import Optional
from pydantic import BaseModel
from pydantic_resolve import Loader, Resolver, build_object
class UserView(BaseModel):
id: int
name: str
async def user_loader(user_ids: list[int]):
users = await db.query(User).filter(User.id.in_(user_ids)).all()
return build_object(users, user_ids, lambda user: user.id)
class TaskView(BaseModel):
id: int
title: str
owner_id: int
owner: Optional[UserView] = None
def resolve_owner(self, loader=Loader(user_loader)):
return loader.load(self.owner_id)
tasks = [TaskView.model_validate(task) for task in raw_tasks]
tasks = await Resolver().resolve(tasks)A useful mental model: resolve_* means "this field needs data from outside the current node." The framework collects every loader.load(...) call across the tree, batches one query per loader, and maps the results back.
Real APIs rarely have just one relationship. When Sprint contains many Tasks, and each Task already knows how to load its owner, the resolver walks the tree and batch-loads everything recursively.
from typing import List
from pydantic_resolve import build_list
async def task_loader(sprint_ids: list[int]):
tasks = await db.query(Task).filter(Task.sprint_id.in_(sprint_ids)).all()
return build_list(tasks, sprint_ids, lambda task: task.sprint_id)
class SprintView(BaseModel):
id: int
name: str
tasks: List[TaskView] = []
def resolve_tasks(self, loader=Loader(task_loader)):
return loader.load(self.id)
sprints = [SprintView.model_validate(sprint) for sprint in raw_sprints]
sprints = await Resolver().resolve(sprints)Result: one query per loader, regardless of how many sprints or tasks you load.
task_count and contributor_names don't come from a query — they're derived from data already on the model. post_* handles these: it runs after all nested resolve_* calls have finished.
class SprintView(BaseModel):
id: int
name: str
tasks: List[TaskView] = []
task_count: int = 0
contributor_names: list[str] = []
def resolve_tasks(self, loader=Loader(task_loader)):
return loader.load(self.id)
def post_task_count(self):
return len(self.tasks)
def post_contributor_names(self):
return sorted({task.owner.name for task in self.tasks if task.owner})Execution order:
resolve_tasksloads the sprint's tasks.- Each
TaskView.resolve_ownerloads its owner. post_task_countandpost_contributor_namesrun after those nested fields are ready.
Putting it all together, the response looks like:
{
"id": 1,
"name": "Sprint 1",
"tasks": [
{
"id": 101,
"title": "Implement login",
"owner_id": 1,
"owner": { "id": 1, "name": "Alice" }
}
],
"task_count": 1,
"contributor_names": ["Alice"]
}resolve_* |
post_* |
|
|---|---|---|
| Needs external IO? | Yes | Usually no |
| Runs before descendants ready? | Yes | No |
| Good for counts, sums, formatting? | Sometimes | Yes |
| Return value resolved again? | Yes | No |
These two patterns cover most API endpoints. The next section covers cross-tree coordination — skip it if your tree is simple enough with resolve_* and post_*.
When parent and child nodes need to share data without hard-coding references to each other, two helpers cover the two directions.
Send a value from an ancestor down to its descendants.
from typing import Annotated
from pydantic_resolve import ExposeAs
class SprintView(BaseModel):
id: int
name: Annotated[str, ExposeAs('sprint_name')] # visible to all descendants
tasks: List[TaskView] = []
def resolve_tasks(self, loader=Loader(task_loader)):
return loader.load(self.id)
class TaskView(BaseModel):
id: int
title: str
owner_id: int
owner: Optional[UserView] = None
full_title: str = ""
def resolve_owner(self, loader=Loader(user_loader)):
return loader.load(self.owner_id)
def post_full_title(self, ancestor_context):
return f"{ancestor_context['sprint_name']} / {self.title}"Use this when a child needs context from an ancestor (sprint name, permissions, locale).
Aggregate values from many descendants up to one ancestor.
from typing import Annotated
from pydantic_resolve import Collector, SendTo
class SprintView(BaseModel):
id: int
name: str
tasks: List[TaskView] = []
contributors: list[UserView] = []
def resolve_tasks(self, loader=Loader(task_loader)):
return loader.load(self.id)
def post_contributors(self, collector=Collector('contributors')):
return collector.values()
class TaskView(BaseModel):
id: int
title: str
owner_id: int
owner: Annotated[Optional[UserView], SendTo('contributors')] = None
def resolve_owner(self, loader=Loader(user_loader)):
return loader.load(self.owner_id)Use this when a parent needs to aggregate values from many descendants (all contributors, all tags, all attachments).
Three mechanisms cover the whole library:
| What you need | What you write | What the framework does |
|---|---|---|
| Load related data | resolve_* + Loader(...) |
Batch lookups and map results back |
| Compute derived fields | post_* |
Run after descendants are fully resolved |
| Reuse relationship declarations | ER Diagram + AutoLoad (see below) |
Centralize relationship wiring for many models |
flowchart LR
subgraph FW["Frameworks & Interfaces"]
R["Response<br/>FastAPI routes"]
end
subgraph APP["Application Business Rules"]
RV["Resolver<br/>resolve / post"]
end
subgraph ENT["Enterprise Business Rules"]
E["Entity + ER Diagram"]
end
subgraph ADP["Interface Adapters"]
L["Loader"]
end
FW --> APP --> ENT --> ADP
The dependency direction always points inward: Entity doesn't know about Loader. Loader doesn't know about FastAPI. FastAPI doesn't know about the database.
pydantic-resolve is built around Clean Architecture. Its components map 1:1 to the layers:
| Clean Architecture Layer | pydantic-resolve Component |
|---|---|
| Enterprise Business Rules | Entity + ER Diagram |
| Application Business Rules | Resolver + resolve/post |
| Interface Adapters | Loader (data access) |
| Frameworks & Interfaces | FastAPI routes + GraphQL + MCP |
This mapping is what makes the library more than a DataLoader helper — it gives data assembly a stable home that survives framework churn.
For the full architectural analysis, see Clean Architecture for Python.
Optional. The Core API above (
resolve_*/post_*+Loader) covers most use cases. Read this section only when you notice the same relationship being declared repeatedly across response models.
A common signal is when you see the same relation described again and again:
TaskCard.resolve_ownerTaskDetail.resolve_ownerSprintBoard.resolve_tasksSprintReport.resolve_tasks
At that point, the problem is no longer "how do I load this field?" but "where is the source of truth for relationships?" ER Diagram + AutoLoad is the answer.
| Question | Hand-written Core API | ER Diagram + AutoLoad |
|---|---|---|
| First endpoint | Faster | Slower |
| Upfront setup | Low | Medium |
| Reusing the same relation in many models | Repetitive | Centralized |
| Changing a relationship later | Update many resolve_* methods |
Update one ERD declaration |
| GraphQL / MCP generation | Separate work | Natural extension |
ERD mode asks for more discipline up front:
- Define entity classes.
- Declare relationships explicitly.
- Create
AutoLoadfrom the samediagramused by the resolver.
That setup cost is real. The payoff is that relationship knowledge converges into one place — every Response is just a different view of the same Entity graph. The same ERD also powers GraphQL queries, MCP services, and admin tools.
from typing import Optional
from pydantic import BaseModel
from pydantic_resolve import Relationship, base_entity, config_global_resolver
BaseEntity = base_entity()
class UserEntity(BaseModel, BaseEntity):
id: int
name: str
class TaskEntity(BaseModel, BaseEntity):
__relationships__ = [
Relationship(fk='owner_id', name='owner', target=UserEntity, loader=user_loader)
]
id: int
title: str
owner_id: int
class SprintEntity(BaseModel, BaseEntity):
__relationships__ = [
Relationship(fk='id', name='tasks', target=list[TaskEntity], loader=task_loader)
]
id: int
name: str
diagram = BaseEntity.get_diagram()
AutoLoad = diagram.create_auto_load()
config_global_resolver(diagram)
class TaskView(TaskEntity):
# Field name matches Relationship(name='owner') → AutoLoad is implicit
owner: Optional[UserEntity] = None
class SprintView(SprintEntity):
# Field name matches Relationship(name='tasks') → AutoLoad is implicit
tasks: list[TaskView] = []
task_count: int = 0
def post_task_count(self):
return len(self.tasks)Compared with the Core API version:
resolve_ownerdisappears.resolve_tasksdisappears.- The relationship definitions live in one place.
post_*still works exactly the same.- When a View field's name matches a relationship name,
Annotated[..., AutoLoad()]is optional (implicit AutoLoad). Use the explicit form only when the field name differs from the relationship name.
If you want to hide internal FK fields such as owner_id, add DefineSubset on top of the ERD setup:
from pydantic_resolve import DefineSubset
class TaskSummary(DefineSubset):
__subset__ = (TaskEntity, ('id', 'title'))
owner: Optional[UserEntity] = None # implicit AutoLoadOnce ERD mode makes sense conceptually, you can let the ORM describe the relationships for you and import them into the Enterprise layer:
from pydantic_resolve import ErDiagram
from pydantic_resolve.integration.mapping import Mapping
from pydantic_resolve.integration.sqlalchemy import build_relationship
entities = build_relationship(
mappings=[
Mapping(entity=SprintEntity, orm=SprintORM),
Mapping(entity=TaskEntity, orm=TaskORM),
Mapping(entity=UserEntity, orm=UserORM),
],
session_factory=session_factory,
)
diagram = ErDiagram(entities=[]).add_relationship(entities)
AutoLoad = diagram.create_auto_load()
config_global_resolver(diagram)build_relationship supports SQLAlchemy, Django, and Tortoise ORM. This is a good later optimization when your ORM metadata is already stable and you want to avoid duplicating relationship declarations.
Start with resolve_* and post_* on one endpoint. You gain immediate N+1 protection without changing your architecture.
When relationships start repeating across models, move them into ERD. This is the step where you establish your Enterprise layer.
When your ORM is stable, use build_relationship() to import existing relationship knowledge from the database layer.
ERD mode is a good fit when:
- The project has 3+ related entities reused across multiple response models.
- The team wants one shared place to inspect and discuss relationships.
- You want GraphQL or MCP generated from the same model graph.
- You want to hide FK fields while keeping relationship definitions centralized.
Core API is usually enough when:
- You only have a few loading requirements.
- You want each endpoint to stay maximally explicit.
- The response shape is still changing quickly.
The library exposes your data through two entry points — ERD mode (data-model-first) and UseCase mode (operation-first). Both can power GraphQL, MCP, and admin tools:
flowchart LR
entity["Entity + ERD<br/>Enterprise Business Rules"]
graphql["GraphQL"]
usecase["UseCase Service<br/>business operations"]
graphql_uc["GraphQL"]
api["REST API"]
mcp_uc["MCP Service"]
mcp_gen["MCP Service"]
entity --> graphql
entity --> usecase
usecase --> api
usecase --> graphql_uc
graphql_uc --> mcp_uc
graphql --> mcp_gen
Generate GraphQL schema from ERD and execute queries:
from pydantic_resolve.graphql import GraphQLHandler
handler = GraphQLHandler(diagram)
result = await handler.execute("{ users { id name posts { title } } }")
# result.data == {"users": [{"id": 1, "name": "Alice", "posts": [{"title": "Hello"}]}, ...]}Compose GraphQL queries over UseCaseService classes — the API surface is a set of business operations, not a graph of entities:
from pydantic_resolve import query
from pydantic_resolve.use_case import UseCaseService
from pydantic_resolve.use_case.manager import UseCaseAppConfig, UseCaseManager
class UserService(UseCaseService):
"""User management."""
@query
async def list_users(cls) -> list[UserSummary]:
"""Get all users."""
...
manager = UseCaseManager(
apps=[UseCaseAppConfig(name="blog", services=[UserService])]
)
app = manager.get_app("blog")
result = await app.compose("{ listUsers { id name } }")Use this when the API is operation-first (RPC-style) rather than entity-graph-first.
Expose GraphQL APIs to AI agents (requires pip install pydantic-resolve[mcp]):
from pydantic_resolve import AppConfig, create_mcp_server
mcp = create_mcp_server(apps=[AppConfig(name="blog", er_diagram=diagram)])
mcp.run()
# Agents can then query: "list all posts by user Alice" → translated to GraphQL against your ERDExpose UseCase operations to AI agents via the same compose surface:
from pydantic_resolve.use_case import (
UseCaseAppConfig,
create_use_case_graphql_mcp_server,
)
mcp = create_use_case_graphql_mcp_server(
apps=[UseCaseAppConfig(name="blog", services=[UserService, PostService])],
)
mcp.run()The MCP server uses a 4-layer progressive disclosure (list_apps → describe_compose_schema → describe_compose_method → compose_query) so the agent can discover operations and shape queries without flooding its tool list.
Interactive ERD exploration with fastapi-voyager:
from fastapi_voyager import create_voyager
app.mount('/voyager', create_voyager(app, er_diagram=diagram))| Dimension | ORM-First | Entity-First |
|---|---|---|
| Type source of truth | ORM model | Entity (Pydantic) |
| Relationship wiring | Repeated per endpoint | Centralized in ERD |
| Data assembly | Manual in Service/Route | Automatic via Resolver |
| N+1 prevention | Manual eager loading | Built-in DataLoader batching |
| Multi-data source | Scattered conversion code | Unified Loader interface |
| API contract stability | Tied to DB schema | Independent of DB |
| Feature | GraphQL | pydantic-resolve |
|---|---|---|
| N+1 Prevention | Manual DataLoader setup | Built-in automatic batching |
| Type Safety | Separate schema files | Native Pydantic types |
| Learning Curve | Steep (Schema, Resolvers, Loaders) | Moderate (Loader/batch pattern required) |
| Debugging | Complex introspection | Standard Python debugging |
| Integration | Requires dedicated server | Works with any framework |
| Query Flexibility | Any client can query anything | Explicit API contracts |
Note: pydantic-resolve borrows the DataLoader batch pattern from GraphQL ecosystems but stays inside your existing REST framework. If you already use strawberry or ariadne and are happy with it, pydantic-resolve may be redundant for you.
- Full Documentation
- Clean Architecture for Python (full paper)
- Example Project
- Live Demo
- Live Demo - GraphQL
- API Reference
MIT License
tangkikodo (allmonday@126.com)