首页 > 解决方案 > Tensorflow2 形状不匹配警告,仍在训练中

问题描述

我正在尝试使用 Keras/TF2.3.0 进行多标签分类,其中我有 50 个特征并在五个类别之间进行分类。我收到以下警告,尽管模型仍在训练,这让我感到困惑。


>>> model.fit(train_dataset, epochs=5, validation_data=val_dataset)

Epoch 1/5 WARNING:tensorflow:Model 是用形状 (128, 1, 50) 构造的输入 Tensor("input_1:0", shape=(128, 1, 50), dtype=float32),但它被调用了形状不兼容的输入(无,50)。

WARNING:tensorflow:Model 是用形状 (128, 1, 50) 构造的输入 Tensor("input_1:0", shape=(128, 1, 50), dtype=float32),但它是在不兼容的输入上调用的形状(无,50)。

1/5 [.......................] - ETA:0s - 损失:0.6996WARNING:tensorflow:模型已构建输入 Tensor("input_1:0", shape=(128, 1, 50), dtype=float32) 的形状为 (128, 1, 50),但在形状不兼容的输入 (None, 50) 上调用了它。59/59 [==============================] - 0s 2ms/步 - 损失:0.6941 - val_loss:0.6935

纪元 2/5 59/59 [===============================]...

下面是我的完整代码,其中包含用于重现错误的随机数据。我在弄乱我的 NN 架构(或者我的dfs_to_tfds函数?)以接受具有分布在 TF 中的类之间的num_vars特征和输出值的输入记录?num_classes

import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from tensorflow.keras.layers import Input, Dense, Flatten, Conv1D, AveragePooling1D
from tensorflow.keras.models import Model
import tensorflow as tf

# setup example input data and labels
num_rows = 10_000
num_vars = 50
num_classes = 5
data = np.random.rand(num_rows, num_vars)
labels = np.random.rand(num_rows, num_classes)

# convert input data to TF.data datasets
bs=128
def dfs_to_tfds(features, targets, bs):
  return tf.data.Dataset.from_tensor_slices((features, targets)).batch(bs)

X_train, X_val, y_train, y_val = train_test_split(data, labels)

train_dataset = dfs_to_tfds(X_train, y_train, bs)
val_dataset = dfs_to_tfds(X_val, y_val, bs)

# setup model
inputs = Input(shape = (1, num_vars), batch_size=bs)
h = Dense(units=32, activation='relu')(inputs)
h = Dense(units=32, activation='relu')(h)
h = Dense(units=32, activation='relu')(h)
outputs = Dense(units=num_classes, activation='sigmoid')(h)

model = Model(inputs=inputs, outputs=outputs)

model.compile(optimizer='rmsprop', 
              loss=['binary_crossentropy'], #tf.keras.losses.MSLE
              metrics=None, 
              loss_weights=None, 
              run_eagerly=None)

# train model
model.fit(train_dataset, epochs=5, validation_data=val_dataset)

标签: pythontensorflowkerasmultilabel-classification

解决方案


利用

inputs = Input(shape=num_vars)

并在拟合模型时指定批量大小:

model.fit(train_dataset, epochs=5, validation_data=val_dataset, batch_size=bs)

您的数据不是在子批次中预先组织的,因此您不必与输入形状一起指定它,而是在拟合时指定它。因此,model.fitbatch_size在拟合模型时会自动从您的输入数据中提取批次


推荐阅读