首页 > 解决方案 > 发布到 Google Analytics,在 Node 上工作,在 Python 上失败......为什么?

问题描述

我正在尝试将事件发布到 Google Analytics。当我使用下面的 NodeJS 代码时它工作正常,但是当我使用下面的 Python 代码时它会失败。两者都返回 HTTP 200,即使在发布到调试 URL ( https://www.google-analytics.com/debug/collect ) 时,Google Analytics 在这两种情况下都会返回成功详细信息(请参阅下面的响应中的有效:真)。问题是,当从 NodeJS 发布时,结果会显示在 GA 网站上,而从 Python 发布时,它永远不会出现。我确实比较了两者的请求,但未能发现差异。

{
  "hitParsingResult": [ {
    "valid": true,
    "parserMessage": [ ],
    "hit": "/debug/collect?v=1\u0026t=event\u0026tid=XXXXXXX\u0026cid=YYYYYYu0026ec=Slack\u0026ea=SlashCommand\u0026el=whowasat-curl\u0026an=staging.Whereis-Everybody?\u0026aid=staging.whereis-everybody.com"
  } ],
  "parserMessage": [ {
    "messageType": "INFO",
    "description": "Found 1 hit in the request."
  } ]
} 

NodeJS 代码是(结果确实显示在 Google Analytics 中):

'use strict';

var request = require('request');
require('request-debug')(request);

function postEventToGA(category, action, label) {

    var options = {
        v: '1',
        t: 'event',
        tid: process.env.GOOGLEANALYTICS_TID,
        cid: process.env.GOOGLEANALYTICS_CID,
        ec: category,
        ea: action,
        el: label,
        an: process.env.STAGE_INFIX + "appname",
        aid: process.env.STAGE_INFIX + "appname"
    };

    console.log("payload: " + JSON.stringify(options))
    request.post({ url: 'https://www.google-analytics.com/collect', form: options }, function (err, response, body) {
        console.log(request)
        if (err) {
            console.log("Failed to post event to Google Analytics, error: " + err);
        } else {
            if (200 != response.statusCode) {
                console.log("Failed to post event to Google Analytics, response code: " + response.statusCode + " error: " + err);
            }
        }
    });

}

postEventToGA("some-category", "some-action", "some-label")

Python 代码是(结果未显示在 Google Analytics 中):

import json
import logging
import os
import requests

LOGGER = logging.getLogger()
LOGGER.setLevel(logging.INFO)

GOOGLEANALYTICS_TID = os.environ["GOOGLEANALYTICS_TID"]
GOOGLEANALYTICS_CID = os.environ["GOOGLEANALYTICS_CID"]
STAGE_INFIX = os.environ["STAGE_INFIX"]

def post_event(category, action, label):

    payload = {
        "v": "1",
        "t": "event",
        "tid": GOOGLEANALYTICS_TID,
        "cid": GOOGLEANALYTICS_CID,
        "ec": category,
        "ea": action,
        "el": label,
        "an": STAGE_INFIX + "appname,
        "aid": STAGE_INFIX + "appname",
    }

    response = requests.post("https://www.google-analytics.com/collect", payload)

    print(response.request.method)
    print(response.request.path_url)
    print(response.request.url)
    print(response.request.body)
    print(response.request.headers)

    print(response.status_code)
    print(response.text)

    if response.status_code != 200:
        LOGGER.warning(
            "Got non 200 response code (%s) while posting to GA.", response.status_code
        )


post_event("some-category", "some-action", "some-label")

知道为什么 NodeJS 帖子会出现在 Google Analytics 中而 Python 帖子不会吗?(虽然两者都返回 HTTP200)

标签: pythonnode.jsgoogle-analytics

解决方案


进行了更多测试,发现用户代理 HTTP 标头导致了问题。当我在 Python 代码中将其设置为空字符串时,它可以工作。像这样:

headers = {"User-Agent": ""}
response = requests.post(
    "https://www.google-analytics.com/collect", payload, headers=headers
)

https://developers.google.com/analytics/devguides/collection/protocol/v1/reference上的文档确实说明使用了用户代理,但没有明确说明要求是什么。“python-requests/2.22.0”(python-requests lib 的默认值)显然不被接受。


推荐阅读