java 网页抓取java初学者
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/6446356/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
web scraping java beginner
提问by user807593
I am new to Java, I would like to become really good in web scraping and parsing data
我是 Java 新手,我想成为非常擅长网络抓取和解析数据的人
Are there any sites related to web scraping that would help me understand the how the APIs like htmcleaner, web-harvest, htmlparser work??
是否有任何与网络抓取相关的网站可以帮助我了解 htmcleaner、web-harvest、htmlparser 等 API 的工作原理?
I'm still not proficient enough in Java to look at their Javadocs and understand how all their methods work, and cannot find Java code examples(tutorials) on the web that would help me.
我对 Java 仍然不够精通,无法查看他们的 Javadoc 并了解他们所有方法的工作原理,并且无法在网络上找到对我有帮助的 Java 代码示例(教程)。
回答by Marsellus Wallace
回答by aemus
Have you tried using the examples at:
您是否尝试过使用以下示例:
- http://htmlcleaner.sourceforge.net/javause.php
- http://web-harvest.sourceforge.net/usage.php
- http://chasethedevil.blogspot.com/2006/05/java-html-parsing-example-with.html
- http://htmlcleaner.sourceforge.net/javause.php
- http://web-harvest.sourceforge.net/usage.php
- http://chasethedevil.blogspot.com/2006/05/java-html-parsing-example-with.html
Maybe those can be of some help?
也许这些可以有所帮助?