python - 带有 info_types 的 Python 错误中的 Google Cloud DLP API 图像编辑
问题描述
您好,我正在尝试使用 google 的示例代码从 python 中的图像中编辑信息,我成功地检查了字符串中的 info_types,但是当我尝试使用 Anaconda 分发中的图像检查示例代码时,它给了我以下错误:
---------------------------------------------------------------------------
TypeError Traceback (most recent call last)
<ipython-input-44-d8dba8a7c706> in <module>()
----> 1 redact_image('python-pdf-redaction', 'redaction.jpg', 'redaction_out.jpg',info_types, min_likelihood='LIKELY', mime_type=None)
<ipython-input-41-fc41518240a7> in redact_image(project, filename, output_filename, info_types, min_likelihood, mime_type)
71 parent, inspect_config=inspect_config,
72 image_redaction_configs=image_redaction_configs,
---> 73 byte_item=byte_item)
74
75 # Write out the results.
~/anaconda3/lib/python3.6/site-packages/google/cloud/dlp_v2/gapic/dlp_service_client.py in redact_image(self, parent, inspect_config, image_redaction_configs, include_findings, byte_item, retry, timeout, metadata)
431 image_redaction_configs=image_redaction_configs,
432 include_findings=include_findings,
--> 433 byte_item=byte_item,
434 )
435 return self._inner_api_calls["redact_image"](
TypeError: {'name': 'FIRST_NAME'} has type dict, but expected one of: bytes, unicode
这是我的代码:
import mimetypes
def redact_image(project, filename, output_filename,
info_types, min_likelihood=None, mime_type=None):
"""Uses the Data Loss Prevention API to redact protected data in an image.
Args:
project: The Google Cloud project id to use as a parent resource.
filename: The path to the file to inspect.
output_filename: The path to which the redacted image will be written.
info_types: A list of strings representing info types to look for.
A full list of info type categories can be fetched from the API.
min_likelihood: A string representing the minimum likelihood threshold
that constitutes a match. One of: 'LIKELIHOOD_UNSPECIFIED',
'VERY_UNLIKELY', 'UNLIKELY', 'POSSIBLE', 'LIKELY', 'VERY_LIKELY'.
mime_type: The MIME type of the file. If not specified, the type is
inferred via the Python standard library's mimetypes module.
Returns:
None; the response from the API is printed to the terminal.
"""
# Import the client library
import google.cloud.dlp
# Instantiate a client.
dlp = google.cloud.dlp.DlpServiceClient()
# Prepare info_types by converting the list of strings into a list of
# dictionaries (protos are also accepted).
info_types = [{'name': info_type} for info_type in info_types]
# Prepare image_redaction_configs, a list of dictionaries. Each dictionary
# contains an info_type and optionally the color used for the replacement.
# The color is omitted in this sample, so the default (black) will be used.
image_redaction_configs = []
if info_types is not None:
for info_type in info_types:
image_redaction_configs.append({'info_type': info_type})
# Construct the configuration dictionary. Keys which are None may
# optionally be omitted entirely.
inspect_config = {
'min_likelihood': min_likelihood,
'info_types': info_types,
}
# If mime_type is not specified, guess it from the filename.
if mime_type is None:
mime_guess = mimetypes.MimeTypes().guess_type(filename)
mime_type = mime_guess[0] or 'application/octet-stream'
# Select the content type index from the list of supported types.
supported_content_types = {
None: 1, # "Unspecified"
'image/jpeg': 1,
'image/bmp': 2,
'image/png': 3,
'image/svg': 4,
'text/plain': 5,
}
content_type_index = supported_content_types.get(mime_type, 0)
# Construct the byte_item, containing the file's byte data.
with open(filename, mode='rb') as f:
byte_item = {'type': content_type_index, 'data': f.read()}
# Convert the project id into a full resource id.
parent = dlp.project_path(project)
# Call the API.
response = dlp.redact_image(
parent, inspect_config=inspect_config,
image_redaction_configs=image_redaction_configs,
byte_item=byte_item)
# Write out the results.
with open(output_filename, mode='wb') as f:
f.write(response.redacted_image)
print("Wrote {byte_count} to {filename}".format(
byte_count=len(response.redacted_image), filename=output_filename))
redact_image('python-pdf-redaction', 'redaction.jpg', 'redaction_out.jpg',info_types, min_likelihood='LIKELY', mime_type=None)
我不确定是否必须更改数据类型或其他内容,我找不到任何有关此问题的参考。你的反应会很受欢迎。
编辑:我已经解决了我的错误,我必须制作一个 info_types 列表来指定我想要编辑的 info_types。即 info_types = ['FIRST_NAME', 'LAST_NAME', 'EMAIL_ADDRESS']
解决方案
代码示例需要改进的一件事:
byte_item = {'type': content_type_index, 'data': f.read()}
可以改为
byte_item = {'type': 'IMAGE', 'data': f.read()}
然后你可以摆脱那个 mimetype foo 假设你知道你只发送支持的图像类型。
推荐阅读
- c++ - 64 位平台上的 WinAPI IcmpSendEcho
- flutter - Flutter – 连续两个文本,左边一个优雅地溢出
- android - Glide for Android - 没有发出 URL 请求,也不例外
- firebase - 如何区分活跃用户和非活跃用户
- sikuli - 如何在 sikuli 脚本中实现链式搜索
- excel - Excel TFS 加载项连接到 TFS 时崩溃
- windows - Windows UWP:单进程后台蓝牙任务未触发
- java - 在java中执行“fetchall”语句的更好方法是如何在json响应中进行转换?
- html - 发布缩略图未显示在主页上并出现错误
有人可以帮忙吗?缩略图未显示在我的博客文章中,并且在检查元素中显示以下错误:找不到标签的替换 [post.thumbnailUrl]
我尝试了我的方法,但我无法成功。
如何根据
- ios - 如何根据 Swift 4 中的发件人更改在动作功能中使用的属性