PayPal 的 Python 接口 - urllib.urlencode 非 ASCII 字符失败

声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow 原文地址: http://stackoverflow.com/questions/787935/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me): StackOverFlow

提示:将鼠标放在中文语句上可以显示对应的英文。显示中英文
时间:2020-11-03 20:51:45  来源:igfitidea点击:

Python interface to PayPal - urllib.urlencode non-ASCII characters failing

pythonunicodepaypalurllib2urllib

提问by Krystian Cybulski

I am trying to implement PayPal IPN functionality. The basic protocol is as such:

我正在尝试实现 PayPal IPN 功能。基本协议如下:

  1. The client is redirected from my site to PayPal's site to complete payment. He logs into his account, authorizes payment.
  2. PayPal calls a page on my server passing in details as POST. Details include a person's name, address, and payment info etc.
  3. I need to call a URL on PayPal's site internally from my processing page passing back all the params that were passed in abovem and an additional one called 'cmd' with a value of '_notify-validate'.
  1. 客户从我的网站重定向到 PayPal 的网站以完成付款。他登录他的帐户,授权付款。
  2. 贝宝调用我服务器上的一个页面,以 POST 的形式传递详细信息。详细信息包括一个人的姓名、地址和付款信息等。
  3. 我需要从我的处理页面内部调用 PayPal 站点上的 URL,将所有在上面传递的参数和一个名为“cmd”的附加参数传回,其值为“_notify-validate”。

When I try to urllib.urlencode the params which PayPal has sent to me, I get a:

当我尝试 urllib.urlencode PayPal 发送给我的参数时,我得到一个:

While calling send_response_to_paypal. Traceback (most recent call last):
  File "<snip>/account/paypal/views.py", line 108, in process_paypal_ipn
    verify_result = send_response_to_paypal(params)
  File "<snip>/account/paypal/views.py", line 41, in send_response_to_paypal
    params = urllib.urlencode(params)
  File "/usr/local/lib/python2.6/urllib.py", line 1261, in urlencode
    v = quote_plus(str(v))
UnicodeEncodeError: 'ascii' codec can't encode character u'\ufffd' in position 9: ordinal not in range(128)

I understand that urlencode does ASCII encoding, and in certain cases, a user's contact info can contain non-ASCII characters. This is understandable. My question is, how do I encode non-ASCII characters for POSTing to a URL using urllib2.urlopen(req) (or other method)

我知道 urlencode 进行 ASCII 编码,在某些情况下,用户的联系信息可以包含非 ASCII 字符。这是可以理解的。我的问题是,如何使用 urllib2.urlopen(req) (或其他方法)对非 ASCII 字符进行编码以发布到 URL

Details:

细节:

I read the params in PayPal's original request as follows (the GET is for testing):

我阅读了 PayPal 原始请求中的参数如下(GET 用于测试):

def read_ipn_params(request):
    if request.POST:  
        params= request.POST.copy()  
        if "ipn_auth" in request.GET:
            params["ipn_auth"]=request.GET["ipn_auth"]
        return params
    else:  
        return request.GET.copy()  

The code I use for sending back the request to PayPal from the processing page is:

我用于从处理页面向 PayPal 发送请求的代码是:

def send_response_to_paypal(params):
    params['cmd']='_notify-validate'  
    params = urllib.urlencode(params)  
    req = urllib2.Request(PAYPAL_API_WEBSITE, params)  
    req.add_header("Content-type", "application/x-www-form-urlencoded") 
    response = urllib2.urlopen(req)  
    status = response.read()  
    if not status == "VERIFIED":  
        logging.warn("PayPal cannot verify IPN responses: " + status)
        return False

    return True

Obviously, the problem only arises if someone's name or address or other field used for the PayPal payment does not fall into the ASCII range.

显然,只有当某人的姓名或地址或用于 PayPal 付款的其他字段不属于 ASCII 范围时,才会出现问题。

回答by Jarret Hardie

Try converting the params dictionary to utf-8 first... urlencode seems to like that better than unicode:

首先尝试将 params 字典转换为 utf-8 ...... urlencode 似乎比 unicode 更喜欢:

params = urllib.urlencode(dict([k, v.encode('utf-8')] for k, v in params.items()))

Of course, this assumes your input is unicode. If your input is something other than unicode, you'll want to decode it to unicode first, then encode it:

当然,这假设您的输入是 unicode。如果您的输入不是 unicode,则您需要先将其解码为 un​​icode,然后对其进行编码:

params['foo'] = my_raw_input.decode('iso-8859-1')
params = urllib.urlencode(dict([k, v.encode('utf-8')] for k, v in params.items()))

回答by Grego

Instead of encoding to utf-8, one should encode to what ever the paypal is using for the post. It is available under key 'charset' in the form paypal sends.

与其编码为utf-8,不如编码为贝宝用于帖子的内容。它可以在 paypal 发送的表单中的键“字符集”下使用。

So the following code worked for me:

所以下面的代码对我有用:

data = dict([(k, v.encode(data['charset'])) for k, v in data.items()])

data = dict([(k, v.encode(data['charset'])) for k, v in data.items()])

回答by KFB

I know it's a bit late to chime in here, but the best solution I found was to not even parse what they were giving back. In django (don't know what you're using) I was able to get the raw request they sent, which I passed back verbatim. Then it was just a matter of putting the cmd key onto that.

我知道在这里加入有点晚了,但我发现的最佳解决方案是甚至不解析他们回馈的内容。在 django 中(不知道你在使用什么)我能够得到他们发送的原始请求,我逐字传回。然后只需将 cmd 键放在上面即可。

This way it never matters what encoding they send you, you're just sending it right back.

这样一来,他们向您发送什么编码就无关紧要了,您只需将其发送回去即可。