python - 发布到 Google Analytics,在 Node 上工作,在 Python 上失败......为什么?
问题描述
我正在尝试将事件发布到 Google Analytics。当我使用下面的 NodeJS 代码时它工作正常,但是当我使用下面的 Python 代码时它会失败。两者都返回 HTTP 200,即使在发布到调试 URL ( https://www.google-analytics.com/debug/collect ) 时,Google Analytics 在这两种情况下都会返回成功详细信息(请参阅下面的响应中的有效:真)。问题是,当从 NodeJS 发布时,结果会显示在 GA 网站上,而从 Python 发布时,它永远不会出现。我确实比较了两者的请求,但未能发现差异。
{
"hitParsingResult": [ {
"valid": true,
"parserMessage": [ ],
"hit": "/debug/collect?v=1\u0026t=event\u0026tid=XXXXXXX\u0026cid=YYYYYYu0026ec=Slack\u0026ea=SlashCommand\u0026el=whowasat-curl\u0026an=staging.Whereis-Everybody?\u0026aid=staging.whereis-everybody.com"
} ],
"parserMessage": [ {
"messageType": "INFO",
"description": "Found 1 hit in the request."
} ]
}
NodeJS 代码是(结果确实显示在 Google Analytics 中):
'use strict';
var request = require('request');
require('request-debug')(request);
function postEventToGA(category, action, label) {
var options = {
v: '1',
t: 'event',
tid: process.env.GOOGLEANALYTICS_TID,
cid: process.env.GOOGLEANALYTICS_CID,
ec: category,
ea: action,
el: label,
an: process.env.STAGE_INFIX + "appname",
aid: process.env.STAGE_INFIX + "appname"
};
console.log("payload: " + JSON.stringify(options))
request.post({ url: 'https://www.google-analytics.com/collect', form: options }, function (err, response, body) {
console.log(request)
if (err) {
console.log("Failed to post event to Google Analytics, error: " + err);
} else {
if (200 != response.statusCode) {
console.log("Failed to post event to Google Analytics, response code: " + response.statusCode + " error: " + err);
}
}
});
}
postEventToGA("some-category", "some-action", "some-label")
Python 代码是(结果未显示在 Google Analytics 中):
import json
import logging
import os
import requests
LOGGER = logging.getLogger()
LOGGER.setLevel(logging.INFO)
GOOGLEANALYTICS_TID = os.environ["GOOGLEANALYTICS_TID"]
GOOGLEANALYTICS_CID = os.environ["GOOGLEANALYTICS_CID"]
STAGE_INFIX = os.environ["STAGE_INFIX"]
def post_event(category, action, label):
payload = {
"v": "1",
"t": "event",
"tid": GOOGLEANALYTICS_TID,
"cid": GOOGLEANALYTICS_CID,
"ec": category,
"ea": action,
"el": label,
"an": STAGE_INFIX + "appname,
"aid": STAGE_INFIX + "appname",
}
response = requests.post("https://www.google-analytics.com/collect", payload)
print(response.request.method)
print(response.request.path_url)
print(response.request.url)
print(response.request.body)
print(response.request.headers)
print(response.status_code)
print(response.text)
if response.status_code != 200:
LOGGER.warning(
"Got non 200 response code (%s) while posting to GA.", response.status_code
)
post_event("some-category", "some-action", "some-label")
知道为什么 NodeJS 帖子会出现在 Google Analytics 中而 Python 帖子不会吗?(虽然两者都返回 HTTP200)
解决方案
进行了更多测试,发现用户代理 HTTP 标头导致了问题。当我在 Python 代码中将其设置为空字符串时,它可以工作。像这样:
headers = {"User-Agent": ""}
response = requests.post(
"https://www.google-analytics.com/collect", payload, headers=headers
)
https://developers.google.com/analytics/devguides/collection/protocol/v1/reference上的文档确实说明使用了用户代理,但没有明确说明要求是什么。“python-requests/2.22.0”(python-requests lib 的默认值)显然不被接受。