使用 C# 搜索网页内容

Question

提问by localhost

How do you search a websites source code with C#? hard to explain, heres the source for doing it in python

如何使用 C# 搜索网站源代码？很难解释，这是在 python 中执行它的源代码

import urllib2, re
word = "How to ask"
source = urllib2.urlopen("http://stackoverflow.com").read()
if re.search(word,source):
     print "Found it "+word

Answer 1

回答by Canavar

Here is the source for getting HTML code of a page, you can add your search method later :

这是获取页面 HTML 代码的来源，您可以稍后添加搜索方法：

string url = "http://someurl.com/default.aspx";
WebRequest webRequest=WebRequest.Create(url);
WebResponse response=webRequest.GetResponse();

Stream str=response.GetResponseStream();
StreamReader reader=new StreamReader(str);
string source=reader.ReadToEnd();

Hope this helps.

希望这可以帮助。

Answer 2

回答by Wolfwyrd

If you want to access the raw HTML from a web page you need to do the following:

如果要从网页访问原始 HTML，则需要执行以下操作：

Use a HttpWebRequest to connect to the file
Open the connection and read the response stream into a string
Search the response for your content

使用 HttpWebRequest 连接到文件
打开连接并将响应流读入字符串
在回复中搜索您的内容

So code something like:

所以代码如下：

string pageContent = null;
HttpWebRequest myReq = (HttpWebRequest)WebRequest.Create("http://example.com/page.html");
HttpWebResponse myres = (HttpWebResponse)myReq.GetResponse();

using (StreamReader sr = new StreamReader(myres.GetResponseStream()))
{
    pageContent = sr.ReadToEnd();
}

if (pageContent.Contains("YourSearchWord"))
{
    //Found It
}

Answer 3

回答by JohannesH

I guess this is as close as you'll get in C# to your python code.

我想这与您在 C# 中获得的 Python 代码非常接近。

using System;
using System.Net;

class Program
{
    static void Main()
    {
        string word = "How to ask";
        string source = (new WebClient()).DownloadString("http://stackoverflow.com/");
        if(source.Contains(word))
            Console.WriteLine("Found it " + word);
    }
}

I'm not sure if re.search(#, #) is case sensitive or not. If it's not you could use...

我不确定 re.search(#, #) 是否区分大小写。如果不是你可以用...

if(source.IndexOf(word, StringComparison.InvariantCultureIgnoreCase) > -1)

instead.

反而。

使用 C# 搜索网页内容

提问by localhost

回答by Canavar

回答by Wolfwyrd

回答by JohannesH

相关推荐

最近更新

标签

使用 C# 搜索网页内容

提问by localhost

回答by Canavar

回答by Wolfwyrd

回答by JohannesH

相关推荐

C# 捕获 HTTP 请求

C# ASP.Net MVC：可以覆盖 AuthorizeAttribute 吗？

C# QueueUserWorkItem() 和 BeginInvoke() 之间有什么区别，用于执行不需要返回类型的异步活动

我可以“乘以”一个字符串（在 C# 中）吗？

相关推荐

最近更新

标签