Multipleobjectsreturned With Get_or_create
Solution 1:
Example code
Imagine you have the following model:
class DictionaryEntry(models.Model):
name = models.CharField(max_length=255, null=False, blank=False)
definition = models.TextField(null=True, blank=False)
and the following code:
obj, created = DictionaryEntry.objects.get_or_create(
name='apple', definition='some kind of fruit')
get_or_create
In case you have not seen the code for get_or_create
:
# simplifieddefget_or_create(cls, **kwargs):
try:
instance, created = cls.get(**kwargs), Falseexcept cls.DoesNotExist:
instance, created = cls.create(**kwargs), Truereturn instance, created
about webservers...
Now imagine that you have a webserver with 2
worker processes that both have their own concurrent access to the database.
# simplifieddefget_or_create(cls, **kwargs):
try:
instance, created = cls.get(**kwargs), False# <===== nope not there...except cls.DoesNotExist:
instance, created = cls.create(**kwargs), Truereturn instance, created
If the timing goes right (or wrong depending on how you want to phrase this), both processes can do the lookup and not find the item. They may both create the item. Everything is fine...
MultipleObjectsReturned: get() returned more than one KeyValue -- it returned 2!
Everything is fine... until you call get_or_create
a third time, "third time is a charm" they say.
# simplifieddefget_or_create(cls, **kwargs):
try:
instance, created = cls.get(**kwargs), False# <==== kaboom, 2 objects.except cls.DoesNotExist:
instance, created = cls.create(**kwargs), Truereturn instance, created
unique_together
How could you solve this? Maybe enforce a constraint at the database level:
classDictionaryEntry(models.Model):
name = models.CharField(max_length=255, null=False, blank=False)
definition = models.TextField(null=True, blank=False)
classMeta:
unique_together = (('name', 'definition'),)
back to the function:
# simplifieddefget_or_create(cls, **kwargs):
try:
instance, created = cls.get(**kwargs), Falseexcept cls.DoesNotExist:
instance, created = cls.create(**kwargs), True# <==== this handles IntegrityErrorreturn instance, created
Say you have the same race as before, and they both did not find the item and proceed to the insert; doing so they will start a transaction and one of them is going to win the race while the other will see an IntegrityError
.
mysql ?
The example uses a TextField
, which for mysql
translates to a LONGTEXT
(in my case). Adding the unique_together
constraint fails the syncdb
.
django.db.utils.InternalError: (1170, u"BLOB/TEXT column 'definition' used in key specification without a key length")
So, no luck, you may have to deal with MultipleObjectsReturned
manually.
- https://code.djangoproject.com/ticket/2495
- https://code.djangoproject.com/ticket/12579
- http://django.readthedocs.org/en/latest/topics/db/transactions.html#using-a-high-isolation-level
- https://docs.djangoproject.com/en/dev/topics/db/transactions/#django.db.transaction.atomic
possible solutions
- It may be possible to replace the
TextField
with aCharField
. - It may be possible to add a
CharField
which may be a strong hash of theTextField
, that you can compute inpre_save
and use in aunique_together
.
Solution 2:
As the name implies, get_or_create
model.objects.get()
s or model.objects.create()
s.
It's conceptually equivalent to:
try:
model.objects.get(pk=1)
except model.DoesNotExist:
model.objects.create(pk=1)
The source is where you find definitive answers to these types of questions. Hint: search def get_or_create
.
As you can see, this function only catches DoesNotExist
in the try/except.
defget_or_create(self, **kwargs):
"""
Looks up an object with the given kwargs, creating one if necessary.
Returns a tuple of (object, created), where created is a boolean
specifying whether an object was created.
"""assert kwargs, \
'get_or_create() must be passed at least one keyword argument'
defaults = kwargs.pop('defaults', {})
lookup = kwargs.copy()
for f in self.model._meta.fields:
if f.attname in lookup:
lookup[f.name] = lookup.pop(f.attname)
try:
self._for_write = Truereturn self.get(**lookup), Falseexcept self.model.DoesNotExist:
Solution 3:
Another situation that could cause MultipleObjectsReturned error with get_or_create() API seems to be if there are multiple threads calling this API at the same time with the same set of query parameters.
Solely relying on try... catch... to create a unique row in Python wouldn't work. If you are trying to use this API, I think you should have a matching uniqueness constraint on the appropriate columns in the database.
Solution 4:
Warning
This method is atomic assuming that the database enforces uniqueness of the keyword arguments (see unique or unique_together). If the fields used in the keyword arguments do not have a uniqueness constraint, concurrent calls to this method may result in multiple rows with the same parameters being inserted.
Post a Comment for "Multipleobjectsreturned With Get_or_create"