如何使用 urllib2 在 python 中下载 zip 文件？

Question

提问by Justjoe

Two part question. I am trying to download multiple archived Cory Doctorow podcasts from the internet archive. The old one's that do not come into my iTunes feed. I have written the script but the downloaded files are not properly formatted.

两部分问题。我正在尝试从互联网存档下载多个存档的 Cory Doctorow 播客。旧的没有进入我的 iTunes 提要。我已经编写了脚本，但下载的文件格式不正确。

Q1 - What do I change to download the zip mp3 files? Q2 - What is a better way to pass the variables into URL?

Q1 - 下载 zip mp3 文件需要做哪些更改？Q2 - 将变量传递到 URL 的更好方法是什么？

 # and the base url.

def dlfile(file_name,file_mode,base_url):
    from urllib2 import Request, urlopen, URLError, HTTPError

    #create the url and the request
    url = base_url + file_name + mid_url + file_name + end_url 
    req = Request(url)

    # Open the url
    try:
        f = urlopen(req)
        print "downloading " + url

        # Open our local file for writing
        local_file = open(file_name, "wb" + file_mode)
        #Write to our local file
        local_file.write(f.read())
        local_file.close()

    #handle errors
    except HTTPError, e:
        print "HTTP Error:",e.code , url
    except URLError, e:
        print "URL Error:",e.reason , url

# Set the range 
var_range = range(150,153)

# Iterate over image ranges
for index in var_range:

    base_url = 'http://www.archive.org/download/Cory_Doctorow_Podcast_'
    mid_url = '/Cory_Doctorow_Podcast_'
    end_url = '_64kb_mp3.zip'
    #create file name based on known pattern
    file_name =  str(index) 
    dlfile(file_name,"wb",base_url

This script was adapted from here

该脚本改编自here

Answer 1

采纳答案by dcolish

Here's how I'd deal with the url building and downloading. I'm making sure to name the file as the basename of the url (the last bit after the trailing slash) and I'm also using the withclause for opening the file to write to. This uses a ContextManagerwhich is nice because it will close that file when the block exits. In addition, I use a template to build the string for the url. urlopendoesn't need a request object, just a string.

这是我如何处理 url 构建和下载。我确保将文件命名为 url 的基本名称（尾部斜杠后的最后一位），并且我还使用with子句打开要写入的文件。这使用了一个很好的ContextManager，因为它会在块退出时关闭该文件。此外，我使用模板来构建 url 的字符串。urlopen不需要请求对象，只需要一个字符串。

import os
from urllib2 import urlopen, URLError, HTTPError


def dlfile(url):
    # Open the url
    try:
        f = urlopen(url)
        print "downloading " + url

        # Open our local file for writing
        with open(os.path.basename(url), "wb") as local_file:
            local_file.write(f.read())

    #handle errors
    except HTTPError, e:
        print "HTTP Error:", e.code, url
    except URLError, e:
        print "URL Error:", e.reason, url


def main():
    # Iterate over image ranges
    for index in range(150, 151):
        url = ("http://www.archive.org/download/"
               "Cory_Doctorow_Podcast_%d/"
               "Cory_Doctorow_Podcast_%d_64kb_mp3.zip" %
               (index, index))
        dlfile(url)

if __name__ == '__main__':
    main()

Answer 2

回答by pyfunc

An older solution on SO along the lines of what you want:

沿着您想要的路线的 SO 上的旧解决方案：

如何使用 urllib2 在 python 中下载 zip 文件？

提问by Justjoe

采纳答案by dcolish

回答by pyfunc

相关推荐

最近更新

标签

如何使用 urllib2 在 python 中下载 zip 文件？

提问by Justjoe

采纳答案by dcolish

回答by pyfunc

相关推荐

Python 使用for循环时如何获取前一个元素？

Python：在元组列表中查找最小值、最大值

Python中的拉格朗日插值

Python 没有这样的文件或目录错误

相关推荐

最近更新

标签