* WIP * WIP - make test_model_definition tests pass * WIP - make test_model_methods pass * WIP - make whole test suit at least run - failing 49/443 tests * WIP fix part of the getting pydantic tests as types of fields are now kept in core schema and not on fieldsinfo * WIP fix validation in update by creating individual fields validators, failing 36/443 * WIP fix __pydantic_extra__ in intializing model, fix test related to pydantic config checks, failing 32/442 * WIP - fix enum schema in model_json_schema, failing 31/442 * WIP - fix copying through model, fix setting pydantic fields on through, fix default config and inheriting from it, failing 26/442 * WIP fix tests checking pydantic schema, fix excluding parent fields, failing 21/442 * WIP some missed files * WIP - fix validators inheritance and fix validators in generated pydantic, failing 17/442 * WIP - fix through models setting - only on reverse side of relation, but always on reverse side, failing 15/442 * WIP - fix through models setting - only on reverse side of relation, but always on reverse side, failing 15/442 * WIP - working on proper populating __dict__ for relations for new schema dumping, some work on openapi docs, failing 13/442 * WIP - remove property fields as pydantic has now computed_field on its own, failing 9/442 * WIP - fixes in docs, failing 8/442 * WIP - fix tests for largebinary schema, wrapped bytes fields fail in pydantic, will be fixed in pydantic-core, remaining is circural schema for related models, failing 6/442 * WIP - fix to pk only models in schemas * Getting test suites to pass (#1249) * wip, fixing tests * iteration, fixing some more tests * iteration, fixing some more tests * adhere to comments * adhere to comments * remove unnecessary dict call, re-add getattribute for testing * todo for reverse relationship * adhere to comments, remove prints * solve circular refs * all tests pass 🎉 * remove 3.7 from tests * add lint and type check jobs * reforat with ruff, fix jobs * rename jobs * fix imports * fix evaluate in py3.8 * partially fix coverage * fix coverage, add more tests * fix test ids * fix test ids * fix lint, fix docs, make docs fully working scripts, add test docs job * fix pyproject * pin py ver in test docs * change dir in test docs * fix pydantic warning hack * rm poetry call in test_docs * switch to pathlib in test docs * remove coverage req test docs * fix type check tests, fix part of types * fix/skip next part of types * fix next part of types * fix next part of types * fix coverage * fix coverage * fix type (bit dirty 🤷) * fix some code smells * change pre-commit * tweak workflows * remove no root from tests * switch to full python path by passing sys.executable * some small refactor in new base model, one sample test, change makefile * small refactors to reduce complexity of methods * temp add tests for prs against pydantic_v2 * remove all references to __fields__ * remove all references to construct, deprecate the method and update model_construct to be in line with pydantic * deprecate dict and add model_dump, todo switch to model_dict in calls * fix tests * change to union * change to union * change to model_dump and model_dump_json from dict and json deprecated methods, deprecate them in ormar too * finish switching dict() -> model_dump() * finish switching json() -> model_dump_json() * remove fully pydantic_only * switch to extra for payment card, change missed json calls * fix coverage - no more warnings internal * fix coverage - no more warnings internal - part 2 * split model_construct into own and pydantic parts * split determine pydantic field type * change to new field validators * fix benchmarks, add codspeed instead of pytest-benchmark, add action and gh workflow * restore pytest-benchmark * remove codspeed * pin pydantic version, restore codspeed * change on push to pydantic_v2 to trigger first one * Use lifespan function instead of event (#1259) * check return types * fix imports order, set warnings=False on json that passes the dict, fix unnecessary loop in one of the test * remove references to model's meta as it's now ormar config, rename related methods too * filter out pydantic serializer warnings * remove choices leftovers * remove leftovers after property_fields, keep only enough to exclude them in initialization * add migration guide * fix meta references * downgrade databases for now * Change line numbers in documentation (#1265) * proofread and fix the docs, part 1 * proofread and fix the docs for models * proofread and fix the docs for fields * proofread and fix the docs for relations * proofread and fix rest of the docs, add release notes for 0.20 * create tables in new docs src * cleanup old deps, uncomment docs publish on tag * fix import reorder --------- Co-authored-by: TouwaStar <30479449+TouwaStar@users.noreply.github.com> Co-authored-by: Goran Mekić <meka@tilda.center>
494 lines
17 KiB
Markdown
494 lines
17 KiB
Markdown
# Joins and subqueries
|
|
|
|
To join one table to another, so load also related models you can use following methods.
|
|
|
|
* `select_related(related: Union[List, str]) -> QuerySet`
|
|
* `select_all(follow: bool = True) -> QuerySet`
|
|
* `prefetch_related(related: Union[List, str]) -> QuerySet`
|
|
|
|
|
|
* `Model`
|
|
* `Model.load()` method
|
|
|
|
|
|
* `QuerysetProxy`
|
|
* `QuerysetProxy.select_related(related: Union[List, str])` method
|
|
* `QuerysetProxy.select_all(follow: bool=True)` method
|
|
* `QuerysetProxy.prefetch_related(related: Union[List, str])` method
|
|
|
|
## select_related
|
|
|
|
`select_related(related: Union[List, str]) -> QuerySet`
|
|
|
|
Allows to prefetch related models during the same query.
|
|
|
|
**With `select_related` always only one query is run against the database**, meaning
|
|
that one (sometimes complicated) join is generated and later nested models are processed in
|
|
python.
|
|
|
|
To fetch related model use `ForeignKey` names.
|
|
|
|
To chain related `Models` relation use double underscores between names.
|
|
|
|
!!!note
|
|
If you are coming from `django` note that `ormar` `select_related` differs ->
|
|
in `django` you can `select_related`
|
|
only single relation types, while in `ormar` you can select related across `ForeignKey`
|
|
relation, reverse side of `ForeignKey` (so virtual auto generated keys) and `ManyToMany`
|
|
fields (so all relations as of current version).
|
|
|
|
!!!tip
|
|
To control which model fields to select use `fields()`
|
|
and `exclude_fields()` `QuerySet` methods.
|
|
|
|
!!!tip
|
|
To control order of models (both main or nested) use `order_by()` method.
|
|
|
|
```python
|
|
class Album(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
is_best_seller: bool = ormar.Boolean(default=False)
|
|
|
|
class Track(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
album: Optional[Album] = ormar.ForeignKey(Album)
|
|
title: str = ormar.String(max_length=100)
|
|
position: int = ormar.Integer()
|
|
play_count: int = ormar.Integer(nullable=True)
|
|
```
|
|
|
|
```python
|
|
# Django style
|
|
album = await Album.objects.select_related("tracks").all()
|
|
|
|
# Python style
|
|
album = await Album.objects.select_related(Album.tracks).all()
|
|
|
|
# will return album with all columns tracks
|
|
```
|
|
|
|
You can provide a string or a list of strings (or a field/ list of fields)
|
|
|
|
```python
|
|
class SchoolClass(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="schoolclasses")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
department: Optional[Department] = ormar.ForeignKey(Department, nullable=False)
|
|
|
|
|
|
class Category(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="categories")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
|
|
|
|
class Student(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
schoolclass: Optional[SchoolClass] = ormar.ForeignKey(SchoolClass)
|
|
category: Optional[Category] = ormar.ForeignKey(Category, nullable=True)
|
|
|
|
|
|
class Teacher(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
schoolclass: Optional[SchoolClass] = ormar.ForeignKey(SchoolClass)
|
|
category: Optional[Category] = ormar.ForeignKey(Category, nullable=True)
|
|
```
|
|
|
|
```python
|
|
# Django style
|
|
classes = await SchoolClass.objects.select_related(
|
|
["teachers__category", "students"]).all()
|
|
|
|
# Python style
|
|
classes = await SchoolClass.objects.select_related(
|
|
[SchoolClass.teachers.category, SchoolClass.students]).all()
|
|
|
|
# will return classes with teachers and teachers categories
|
|
# as well as classes students
|
|
```
|
|
|
|
Exactly the same behavior is for Many2Many fields, where you put the names of Many2Many
|
|
fields and the final `Models` are fetched for you.
|
|
|
|
!!!warning
|
|
If you set `ForeignKey` field as not nullable (so required) during all
|
|
queries the not nullable `Models` will be auto prefetched, even if you do not include
|
|
them in select_related.
|
|
|
|
!!!note
|
|
All methods that do not return the rows explicitly returns a QuerySet instance so
|
|
you can chain them together
|
|
|
|
So operations like `filter()`, `select_related()`, `limit()` and `offset()` etc. can be chained.
|
|
|
|
Something like `Track.object.select_related("album").filter(album__name="Malibu").offset(1).limit(1).all()`
|
|
|
|
## select_all
|
|
|
|
`select_all(follow: bool = False) -> QuerySet`
|
|
|
|
By default when you select `all()` none of the relations are loaded, likewise,
|
|
when `select_related()` is used you need to explicitly specify all relations that should
|
|
be loaded. If you want to include also nested relations this can be cumberstone.
|
|
|
|
That's why `select_all()` was introduced, so by default load all relations of a model
|
|
(so kind of opposite as with `all()` approach).
|
|
|
|
By default adds only directly related models of a parent model (from which the query is run).
|
|
|
|
If `follow=True` is set it adds also related models of related models.
|
|
|
|
!!!info
|
|
To not get stuck in an infinite loop as related models also keep a relation
|
|
to parent model visited models set is kept.
|
|
|
|
That way already visited models that are nested are loaded, but the load do not
|
|
follow them inside. So Model A -> Model B -> Model C -> Model A -> Model X
|
|
will load second Model A but will never follow into Model X.
|
|
Nested relations of those kind need to be loaded manually.
|
|
|
|
With sample date like follow:
|
|
|
|
```python
|
|
base_ormar_config = OrmarConfig(
|
|
database=databases.Database(DATABASE_URL, force_rollback=True),
|
|
metadata=sqlalchemy.MetaData(),
|
|
)
|
|
|
|
|
|
class Address(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="addresses")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
street: str = ormar.String(max_length=100, nullable=False)
|
|
number: int = ormar.Integer(nullable=False)
|
|
post_code: str = ormar.String(max_length=20, nullable=False)
|
|
|
|
|
|
|
|
class Branch(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="branches")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100, nullable=False)
|
|
address = ormar.ForeignKey(Address)
|
|
|
|
|
|
class Company(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="companies")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100, nullable=False, name="company_name")
|
|
founded: int = ormar.Integer(nullable=True)
|
|
branches = ormar.ManyToMany(Branch)
|
|
```
|
|
|
|
To select all `Companies` with all `Branches` and `Addresses` you can simply query:
|
|
|
|
```python
|
|
companies = await Company.objects.select_all(follow=True).all()
|
|
|
|
# which is equivalent to:
|
|
companies = await Company.objects.select_related('branches__address').all()
|
|
```
|
|
|
|
Of course in this case it's quite easy to issue explicit relation names in `select_related`,
|
|
but the benefit of `select_all()` shows when you have multiple relations.
|
|
|
|
If for example `Company` would have 3 relations and all of those 3 relations have it's own
|
|
3 relations you would have to issue 9 relation strings to `select_related`, `select_all()`
|
|
is also resistant to change in names of relations.
|
|
|
|
!!!note
|
|
Note that you can chain `select_all()` with other `QuerySet` methods like `filter`, `exclude_fields` etc.
|
|
To exclude relations use `exclude_fields()` call with names of relations (also nested) to exclude.
|
|
|
|
## prefetch_related
|
|
|
|
`prefetch_related(related: Union[List, str]) -> QuerySet`
|
|
|
|
Allows to prefetch related models during query - but opposite to `select_related` each
|
|
subsequent model is fetched in a separate database query.
|
|
|
|
**With `prefetch_related` always one query per Model is run against the database**,
|
|
meaning that you will have multiple queries executed one after another.
|
|
|
|
To fetch related model use `ForeignKey` names.
|
|
|
|
To chain related `Models` relation use double underscores between names.
|
|
|
|
!!!tip
|
|
To control which model fields to select use `fields()`
|
|
and `exclude_fields()` `QuerySet` methods.
|
|
|
|
!!!tip
|
|
To control order of models (both main or nested) use `order_by()` method.
|
|
|
|
```python
|
|
class Album(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
is_best_seller: bool = ormar.Boolean(default=False)
|
|
|
|
class Track(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
album: Optional[Album] = ormar.ForeignKey(Album)
|
|
title: str = ormar.String(max_length=100)
|
|
position: int = ormar.Integer()
|
|
play_count: int = ormar.Integer(nullable=True)
|
|
```
|
|
|
|
```python
|
|
# Django style
|
|
album = await Album.objects.prefetch_related("tracks").all()
|
|
|
|
# Python style
|
|
album = await Album.objects.prefetch_related(Album.tracks).all()
|
|
|
|
|
|
# will return album will all columns tracks
|
|
```
|
|
|
|
You can provide a string, or a list of strings
|
|
|
|
```python
|
|
class SchoolClass(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="schoolclasses")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
department: Optional[Department] = ormar.ForeignKey(Department, nullable=False)
|
|
|
|
|
|
class Category(ormar.Model):
|
|
ormar_config = base_ormar_config.copy(tablename="categories")
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
|
|
|
|
class Student(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
schoolclass: Optional[SchoolClass] = ormar.ForeignKey(SchoolClass)
|
|
category: Optional[Category] = ormar.ForeignKey(Category, nullable=True)
|
|
|
|
|
|
class Teacher(ormar.Model):
|
|
ormar_config = base_ormar_config.copy()
|
|
|
|
id: int = ormar.Integer(primary_key=True)
|
|
name: str = ormar.String(max_length=100)
|
|
schoolclass: Optional[SchoolClass] = ormar.ForeignKey(SchoolClass)
|
|
category: Optional[Category] = ormar.ForeignKey(Category, nullable=True)
|
|
```
|
|
|
|
```python
|
|
# Django style
|
|
classes = await SchoolClass.objects.prefetch_related(
|
|
["teachers__category", "students"]).all()
|
|
|
|
# Python style
|
|
classes = await SchoolClass.objects.prefetch_related(
|
|
[SchoolClass.teachers.category, SchoolClass.students]).all()
|
|
# will return classes with teachers and teachers categories
|
|
# as well as classes students
|
|
```
|
|
|
|
Exactly the same behavior is for Many2Many fields, where you put the names of Many2Many
|
|
fields and the final `Models` are fetched for you.
|
|
|
|
!!!warning
|
|
If you set `ForeignKey` field as not nullable (so required) during all
|
|
queries the not nullable `Models` will be auto prefetched, even if you do not include
|
|
them in select_related.
|
|
|
|
!!!note
|
|
All methods that do not return the rows explicitly returns a QuerySet instance so
|
|
you can chain them together
|
|
|
|
So operations like `filter()`, `select_related()`, `limit()` and `offset()` etc. can be chained.
|
|
|
|
Something like `Track.object.select_related("album").filter(album__name="Malibu").offset(1).limit(1).all()`
|
|
|
|
## select_related vs prefetch_related
|
|
|
|
Which should you use -> `select_related` or `prefetch_related`?
|
|
|
|
Well, it really depends on your data. The best answer is try yourself and see which one
|
|
performs faster/better in your system constraints.
|
|
|
|
What to keep in mind:
|
|
|
|
### Performance
|
|
|
|
**Number of queries**:
|
|
`select_related` always executes one query against the database,
|
|
while `prefetch_related` executes multiple queries. Usually the query (I/O) operation is
|
|
the slowest one but it does not have to be.
|
|
|
|
**Number of rows**:
|
|
Imagine that you have 10 000 object in one table A and each of those objects have 3
|
|
children in table B, and subsequently each object in table B has 2 children in table C.
|
|
Something like this:
|
|
|
|
```
|
|
Model C
|
|
/
|
|
Model B - Model C
|
|
/
|
|
Model A - Model B - Model C
|
|
\ \
|
|
\ Model C
|
|
\
|
|
Model B - Model C
|
|
\
|
|
Model C
|
|
```
|
|
|
|
That means that `select_related` will always return 60 000 rows (10 000 * 3 * 2) later
|
|
compacted to 10 000 models.
|
|
|
|
How many rows will return `prefetch_related`?
|
|
|
|
Well, that depends, if each of models B and C is unique it will return 10 000 rows in
|
|
first query, 30 000 rows
|
|
(each of 3 children of A in table B are unique) in second query and 60 000 rows (each of
|
|
2 children of model B in table C are unique) in 3rd query.
|
|
|
|
In this case `select_related` seems like a better choice, not only it will run one query
|
|
comparing to 3 of
|
|
`prefetch_related` but will also return 60 000 rows comparing to 100 000
|
|
of `prefetch_related` (10+30+60k).
|
|
|
|
But what if each Model A has exactly the same 3 models B and each models C has exactly
|
|
same models C? `select_related`
|
|
will still return 60 000 rows, while `prefetch_related` will return 10 000 for model A,
|
|
3 rows for model B and 2 rows for Model C. So in total 10 006 rows. Now depending on the
|
|
structure of models (i.e. if it has long Text() fields etc.) `prefetch_related`
|
|
might be faster despite it needs to perform three separate queries instead of one.
|
|
|
|
#### Memory
|
|
|
|
`ormar` is does not keep a registry of already loaded models.
|
|
|
|
That means that in `select_related` example above you will always have 10 000 Models A,
|
|
30 000 Models B
|
|
(even if the unique number of rows in db is 3 - processing of `select_related` spawns
|
|
**new** child models for each parent model). And 60 000 Models C.
|
|
|
|
If the same Model B is shared by rows 1, 10, 100 etc. and you update one of those, the
|
|
rest of rows that share the same child will **not** be updated on the spot. If you
|
|
persist your changes into the database the change **will be available only after reload
|
|
(either each child separately or the whole query again)**. That means
|
|
that `select_related` will use more memory as each child is instantiated as a new object
|
|
- obviously using it's own space.
|
|
|
|
!!!note
|
|
This might change in future versions if we decide to introduce caching.
|
|
|
|
!!!warning
|
|
By default all children (or event the same models loaded 2+ times) are
|
|
completely independent, distinct python objects, despite that they represent the same
|
|
row in db.
|
|
|
|
They will evaluate to True when compared, so in example above:
|
|
|
|
```python
|
|
# will return True if child1 of both rows is the same child db row
|
|
row1.child1 == row100.child1
|
|
|
|
# same here:
|
|
model1 = await Model.get(pk=1)
|
|
model2 = await Model.get(pk=1) # same pk = same row in db
|
|
# will return `True`
|
|
model1 == model2
|
|
```
|
|
|
|
but
|
|
|
|
```python
|
|
# will return False (note that id is a python `builtin` function not ormar one).
|
|
id(row1.child1) == id(ro100.child1)
|
|
|
|
# from above - will also return False
|
|
id(model1) == id(model2)
|
|
```
|
|
|
|
On the contrary - with `prefetch_related` each unique distinct child model is
|
|
instantiated only once and the same child models is shared across all parent models.
|
|
That means that in `prefetch_related` example above if there are 3 distinct models in
|
|
table B and 2 in table C, there will be only 5 children nested models shared between all
|
|
model A instances. That also means that if you update any attribute it will be updated
|
|
on all parents as they share the same child object.
|
|
|
|
|
|
## Model methods
|
|
|
|
Each model instance have a set of methods to `save`, `update` or `load` itself.
|
|
|
|
### load
|
|
|
|
You can load the `ForeignKey` related model by calling `load()` method.
|
|
|
|
`load()` can be used to refresh the model from the database (if it was changed by some other process).
|
|
|
|
!!!tip
|
|
Read more about `load()` method in [models methods](../models/methods.md#load)
|
|
|
|
## QuerysetProxy methods
|
|
|
|
When access directly the related `ManyToMany` field as well as `ReverseForeignKey`
|
|
returns the list of related models.
|
|
|
|
But at the same time it exposes subset of QuerySet API, so you can filter, create,
|
|
select related etc related models directly from parent model.
|
|
|
|
### select_related
|
|
|
|
Works exactly the same as [select_related](./#select_related) function above but allows you to fetch related
|
|
objects from other side of the relation.
|
|
|
|
!!!tip
|
|
To read more about `QuerysetProxy` visit [querysetproxy][querysetproxy] section
|
|
|
|
### select_all
|
|
|
|
Works exactly the same as [select_all](./#select_all) function above but allows you to fetch related
|
|
objects from other side of the relation.
|
|
|
|
!!!tip
|
|
To read more about `QuerysetProxy` visit [querysetproxy][querysetproxy] section
|
|
|
|
|
|
### prefetch_related
|
|
|
|
Works exactly the same as [prefetch_related](./#prefetch_related) function above but allows you to fetch related
|
|
objects from other side of the relation.
|
|
|
|
!!!tip
|
|
To read more about `QuerysetProxy` visit [querysetproxy][querysetproxy] section
|
|
|
|
|
|
[querysetproxy]: ../relations/queryset-proxy.md
|