ruby 将弹性搜索限制设置为“无限制”

Question

提问by Sumit Rai

How can i get all the results from elastic search as the results only display limit to 10 only. ihave got a query like:

我怎样才能从弹性搜索中获得所有结果，因为结果只显示限制为 10。我有一个查询，如：

@data = Athlete.search :load => true do
          size 15
          query do
            boolean do
              must { string q, {:fields => ["name", "other_names", "nickname", "short_name"], :phrase_slop => 5} }
              unless conditions.blank?
                conditions.each do |condition|
                  must { eval(condition) }
                end
              end
              unless excludes.blank?
                excludes.each do |exclude|
                  must_not { eval(exclude) }
                end
              end
            end
          end
          sort do
            by '_score', "desc"
          end
        end

i have set the limit to 15 but i wan't to make it unlimited so that i can get all the data I can't set the limit as my data keeps on changing and i want to get all the data.

我已将限制设置为 15，但我不想将其设置为无限制，以便我可以获取所有数据我无法设置限制，因为我的数据不断变化，我想获取所有数据。

Answer 1

采纳答案by Zach

You can use the fromand sizeparameters to page through all your data. This could be very slow depending on your data and how much is in the index.

您可以使用from和size参数分页浏览所有数据。这可能会非常慢，具体取决于您的数据以及索引中的内容。

http://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-from-size.html

Answer 2

回答by David

Another approach is to first do a searchType: 'count', then and then do a normal search with sizeset to results.count.

另一种方法是先执行 a searchType: 'count'，然后使用sizeset to执行正常搜索results.count。

The advantage here is it avoids depending on a magic number for UPPER_BOUNDas suggested in this similar SO question, and avoids the extra overhead of building too large of a priority queue that Shay Banon describes here. It also lets you keep your results sorted, unlike scan.

这里的优点是它避免了依赖UPPER_BOUND于这个类似的 SO 问题中建议的幻数，并避免了构建过大的优先级队列的额外开销，Shay Banon在这里描述。与scan.

The biggest disadvantage is that it requires two requests. Depending on your circumstance, this may be acceptable.

最大的缺点是它需要两个请求。根据您的情况，这可能是可以接受的。

Answer 3

回答by travelingbones

From the docs, "Note that from + sizecan not be more than the index.max_result_windowindex setting which defaults to 10,000". So my admittedly very ad-hoc solution is to just pass size: 10000or 10,000 minus fromif I use the fromargument.

从文档中，“请注意from + size不能超过index.max_result_window默认为 10,000的索引设置”。因此，我公认的非常临时的解决方案是，如果我使用该参数，则仅通过size: 10000或减去 10,000 。from

Note that following Matt's comment below, the proper way to do this if you have a larger amount of documents is to use the scroll api. I have used this successfully, but only with the python interface.

请注意，按照下面 Matt 的评论，如果您有大量文档，正确的方法是使用scroll api。我已经成功地使用了它，但仅限于 python 接口。

Answer 4

回答by Rachel Gallen

use the scan method e.g.

使用扫描方法，例如

 curl -XGET 'localhost:9200/_search?search_type=scan&scroll=10m&size=50' -d '
 {
    "query" : {
       "match_all" : {}
     }
 }

see here

看这里

ruby 将弹性搜索限制设置为“无限制”

提问by Sumit Rai

采纳答案by Zach

回答by David

回答by travelingbones

回答by Rachel Gallen

相关推荐

最近更新

标签

ruby 将弹性搜索限制设置为“无限制”

提问by Sumit Rai

采纳答案by Zach

回答by David

回答by travelingbones

回答by Rachel Gallen

相关推荐

在 ruby​​ 中创建哈希数组

ruby 尝试从 FTP 下载文件导致“500 Illegal PORT command”错误

ruby 如何优雅地检查 RSpec 中是否存在记录

ruby 传递多个参数的ruby发送方法

相关推荐

最近更新

标签

在 ruby 中创建哈希数组