首页 > 解决方案 > 如何在 DynamoDB 中扫描列表中的文本?

问题描述

在 DynamoDB 中,我有一个具有以下结构的表。
actions字段”包含所有信息(这是我要搜索的字段),orderId它是主键

{
  "actions": [
    {
      "actionDescription": "8f23029def1d6baa4",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533730680,
      "user": {
        "fullName": "XXXXX",
        "userName": "xxxxx@xxxx.xxx",
      }
    },
    {
      "actionDescription": "21857e61037bc29ec",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731788,
      "user": {
        "fullName": "XXXXX",
        "userName": "xxxxx@xxxx.xxx",
      }
    },
    {
      "actionDescription": "cf10abd44e24cef56",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731788,
      "user": {
        "fullName": "XXXXX",
        "userName": "xxxxx@xxxx.xxx",
      }
    },
    {
      "actionDescription": "7787fe7a5bf4d22de",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731789,
      "user": {
        "fullName": "OOOOOO",
        "userName": "ooooo@oooo.ooo",
      }
    },
    {
      "actionDescription": "9528c439021f504bf",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731789,
      "user": {
        "fullName": "XXXXX",
        "userName": "xxxxx@xxxx.xxx",
      }
    },
    {
      "actionDescription": "bfba100e0e54934b2",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731789,
      "user": {
        "fullName": "XXXXX",
        "userName": "xxxxx@xxxx.xxx",
      }
    },
    {
      "actionDescription": "f789dc12f1dbe3be2",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731789,
      "user": {
        "fullName": "OOOOOO",
        "userName": "ooooo@oooo.ooo",
      }
    },
    {
      "actionDescription": "4cd6b68dfea7cf8ee",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731789,
      "user": {
        "fullName": "XXXXX",
        "userName": "xxxxx@xxxx.xxx",
      }
    },
    {
      "actionDescription": "1e3a0e95f8e5106d7",
      "actionTitle": "UNDEFINED_ACTION",
      "timestamp": 1533731790,
      "user": {
        "fullName": "OOOOOO",
        "userName": "ooooo@oooo.ooo",
      }
    }
  ],
  "orderId": "13aae31"
}

我想做的是使 PHP 中的扫描项能够通过userName. 或通过操作数组中的任何字段(时间戳、actionTitle 等)。
Bellow 这是我尝试使用的众多术语之一,但我无法取得任何结果

$params = [
 'TableName'                 => $this->tableName,
 'FilterExpression'          => "userName = :searchTerm",
 'ExpressionAttributeValues' => [
     ':searchTerm' => 'ooooo@oooo.ooo',
  ],
 'ReturnConsumedCapacity'    => 'TOTAL',
];
$results = $this->dynamoDbClient->scan($params);

你能告诉我我缺少什么来指导我吗?
另外,请注意:我不想获得特定的orderId,我想获得orderIds包含 searchTerm 的 ALL (在这种情况下userName

标签: phpamazon-dynamodbdynamodb-queries

解决方案


使用此项目架构的最佳选择是自己过滤表格项目。也就是扫描没有过滤表达式的表,自己写代码过滤结果。没有过滤器表达式的扫描将消耗相同数量的读取容量单位。

您可以将过滤器表达式设置为类似的内容,但这是不可扩展的,并且仅当您在操作列表中有固定数量的项目时才有效。

  actions[0].user.userName == :searchTerm OR actions[1].user.userName == :searchTerm OR actions[2].user.userName == :searchTerm OR ....

如果您需要复杂的搜索能力,最好使用专用的搜索数据库。AWS 围绕此提供了两种服务,AWS CloudSearch 和 AWS ElasticSearch。您可以使用 DynamoDB 流来使您的搜索索引保持最新。

如果您设置为使用过滤器扫描 DynamoDB 表,您可以重构您的结构以包含在集合(或连接字符串)中具有所有可搜索信息的其他属性

{
  "actions": [....],
  "actionsDescriptions": Set["8f23029def1d6baa4", "21857e61037bc29ec", "cf10abd44e24cef56", "7787fe7a5bf4d22de", "9528c439021f504bf", "bfba100e0e54934b2", "f789dc12f1dbe3be2", "4cd6b68dfea7cf8ee", "1e3a0e95f8e5106d7"],
  "actionTitles": Set["UNDEFINED_ACTION"],
  "timestamps": Set[1533730680, 1533731788, 1533731789, 1533731790],
  "user_fullNames": Set["XXXXX"],
  "user_userNames": Set["ooooo@oooo.ooo", "xxxxx@xxxx.xxx"],
  "orderId": "13aae31"
}

请注意,您必须使用 Set(或将所有值连接成字符串),因为这些contains函数仅适用于字符串和集合。

然后你可以使用这样的过滤器表达式

contains(user_userNames, :searchTerm)

推荐阅读