bash 从网站下载图片

Question

提问by kev

I want to have a local copy of a gallery on a website. The gallery shows the pictures at domain.com/id/1 (id increases in increments of 1) and then the image is stored at pics.domain.com/pics/original/image.format. The exact line that the images have in the HTML are

我想在网站上有一个画廊的本地副本。图库在 domain.com/id/1（id 以 1 为增量增加）显示图片，然后图像存储在 pics.domain.com/pics/original/image.format。图像在 HTML 中的确切行是

<div id="bigwall" class="right"> 
    <img border=0 src='http://pics.domain.com/pics/original/image.jpg' name='pic' alt='' style='top: 0px; left: 0px; margin-top: 50px; height: 85%;'> 
</div>

So I want to write a script that does something like this (in pseudo-code):

所以我想写一个脚本来做这样的事情（用伪代码）：

for(id = 1; id <= 151468; id++) {
     page = "http://domain.com/id/" + id.toString();
     src = returnSrc(); // Searches the html for img with name='pic' and saves the image location as a string
     getImg(); // Downloads the file named in src
}

I'm not sure exactly how to do this though. I suppose I could do it in bash, using wget to download the html and then search the html manually for http://pics.domain.com/pics/original/.then use wget again to save the file, remove the html file, increment the id and repeat. The only thing is I'm not good at handling strings, so if anyone could tell me how to search for the url and replace the *s with the file name and format I should be able to get the rest going. Or if my method is stupid and you have a better one please share.

我不确定如何做到这一点。我想我可以在 bash 中完成，使用 wget 下载 html，然后手动搜索 html 以查找http://pics.domain.com/pics/original/ 。然后再次使用 wget 保存文件，删除 html 文件，增加 id 并重复。唯一的问题是我不擅长处理字符串，所以如果有人能告诉我如何搜索 url 并将 *s 替换为文件名和格式，我应该能够完成其余的工作。或者，如果我的方法很愚蠢，而您有更好的方法，请分享。

Answer 1

回答by kev

# get all pages
curl 'http://domain.com/id/[1-151468]' -o '#1.html'

# get all images
grep -oh 'http://pics.domain.com/pics/original/.*jpg' *.html >urls.txt

# download all images
sort -u urls.txt | wget -i-

bash 从网站下载图片

提问by kev

回答by kev

相关推荐

最近更新

标签

bash 从网站下载图片

提问by kev

回答by kev

相关推荐

echo -e 在终端中工作但不在 bash 脚本中

AWK/bash 如何在 AWK 中包含文件名

制作一个 Git Bash“Shell”脚本来执行已经存在的命令

bash 尝试执行“as”时出错：execvp：没有这样的文件或目录

相关推荐

最近更新

标签