Django中的结合与相交

时间:2020-03-06 14:29:04  来源:igfitidea点击:
class Tag(models.Model):
  name = models.CharField(maxlength=100)

class Blog(models.Model):
  name = models.CharField(maxlength=100)
  tags =  models.ManyToManyField(Tag)

简单的模型只是问我一个问题。

我想知道如何以两种不同的方式使用标签查询博客。

  • 用" tag1"或者" tag2"标记的博客条目:Blog.objects.filter(tags_in = [1,2])。distinct()
  • 用" tag1"和" tag2"标记的博客对象:
  • 完全用" tag1"和" tag2"标记的博客对象:

标签和博客仅作为示例。

解决方案

我们可以将Q对象用于#1:

# Blogs who have either hockey or django tags.
from django.db.models import Q
Blog.objects.filter(
    Q(tags__name__iexact='hockey') | Q(tags__name__iexact='django')
)

我相信,联合和交叉点在Django ORM的范围之外,但是对它们来说是可能的。以下示例来自提供功能的django-tagging Django应用程序。 models.py的346行:

对于第二部分,我们基本上是在寻找两个查询的并集

def get_union_by_model(self, queryset_or_model, tags):
    """
    Create a ``QuerySet`` containing instances of the specified
    model associated with *any* of the given list of tags.
    """
    tags = get_tag_list(tags)
    tag_count = len(tags)
    queryset, model = get_queryset_and_model(queryset_or_model)

    if not tag_count:
        return model._default_manager.none()

    model_table = qn(model._meta.db_table)
    # This query selects the ids of all objects which have any of
    # the given tags.
    query = """
    SELECT %(model_pk)s
    FROM %(model)s, %(tagged_item)s
    WHERE %(tagged_item)s.content_type_id = %(content_type_id)s
      AND %(tagged_item)s.tag_id IN (%(tag_id_placeholders)s)
      AND %(model_pk)s = %(tagged_item)s.object_id
    GROUP BY %(model_pk)s""" % {
        'model_pk': '%s.%s' % (model_table, qn(model._meta.pk.column)),
        'model': model_table,
        'tagged_item': qn(self.model._meta.db_table),
        'content_type_id': ContentType.objects.get_for_model(model).pk,
        'tag_id_placeholders': ','.join(['%s'] * tag_count),
    }

    cursor = connection.cursor()
    cursor.execute(query, [tag.pk for tag in tags])
    object_ids = [row[0] for row in cursor.fetchall()]
    if len(object_ids) > 0:
        return queryset.filter(pk__in=object_ids)
    else:
        return model._default_manager.none()

对于第3部分,我相信我们正在寻找一个交叉点。参见models.py的307行

def get_intersection_by_model(self, queryset_or_model, tags):
    """
    Create a ``QuerySet`` containing instances of the specified
    model associated with *all* of the given list of tags.
    """
    tags = get_tag_list(tags)
    tag_count = len(tags)
    queryset, model = get_queryset_and_model(queryset_or_model)

    if not tag_count:
        return model._default_manager.none()

    model_table = qn(model._meta.db_table)
    # This query selects the ids of all objects which have all the
    # given tags.
    query = """
    SELECT %(model_pk)s
    FROM %(model)s, %(tagged_item)s
    WHERE %(tagged_item)s.content_type_id = %(content_type_id)s
      AND %(tagged_item)s.tag_id IN (%(tag_id_placeholders)s)
      AND %(model_pk)s = %(tagged_item)s.object_id
    GROUP BY %(model_pk)s
    HAVING COUNT(%(model_pk)s) = %(tag_count)s""" % {
        'model_pk': '%s.%s' % (model_table, qn(model._meta.pk.column)),
        'model': model_table,
        'tagged_item': qn(self.model._meta.db_table),
        'content_type_id': ContentType.objects.get_for_model(model).pk,
        'tag_id_placeholders': ','.join(['%s'] * tag_count),
        'tag_count': tag_count,
    }

    cursor = connection.cursor()
    cursor.execute(query, [tag.pk for tag in tags])
    object_ids = [row[0] for row in cursor.fetchall()]
    if len(object_ids) > 0:
        return queryset.filter(pk__in=object_ids)
    else:
        return model._default_manager.none()

我已经使用Django 1.0进行了测试:

"或者"查询:

Blog.objects.filter(tags__name__in=['tag1', 'tag2']).distinct()

或者我们可以使用Q类:

Blog.objects.filter(Q(tags__name='tag1') | Q(tags__name='tag2')).distinct()

"和"查询:

Blog.objects.filter(tags__name='tag1').filter(tags__name='tag2')

我不确定第三个,我们可能需要使用SQL。

请不要重新发明轮子,而要使用完全为用例制作的django-tagging应用。它可以执行我们描述的所有查询,甚至更多。

如果需要将自定义字段添加到Tag模型中,还可以查看django-tagging的分支。