首页 > 解决方案 > 如何编译具有 cpu sse 支持的 tensorflow 静态库?

问题描述

简单来说,我需要将 tensorflow 构建成一个静态库,通过 tensorflow/contrib/makefile/build_all_linux.sh 下的脚本,我已经成功实现了这个目标。但是在将这个 libtensorflow-core.a 链接到我的测试程序后,它表明性能出奇的差。我发现存在如下一堆日志。静态库中似乎没有内置 cpu sse 支持。任何帮助,将不胜感激。

2018-05-04 13:45:44.304314: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "EncodeProto" device_type: "CPU"') for unknown op: EncodeProto
2018-05-04 13:45:44.304648: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "DecodeProtoV2" device_type: "CPU"') for unknown op: DecodeProtoV2
2018-05-04 13:45:44.304720: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "PopulationCount" device_type: "CPU" constraint { name: "T" allowed_values { list { type: DT_INT64 } } }') for unknown op: PopulationCount 
2018-05-04 13:45:44.304732: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "PopulationCount" device_type: "CPU" constraint { name: "T" allowed_values { list { type: DT_INT32 } } }') for unknown op: PopulationCount 
2018-05-04 13:45:44.304741: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "PopulationCount" device_type: "CPU" constraint { name: "T" allowed_values { list { type: DT_INT16 } } }') for unknown op: PopulationCount 
2018-05-04 13:45:44.304750: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "PopulationCount" device_type: "CPU" constraint { name: "T" allowed_values { list { type: DT_UINT16 } } }') for unknown op: PopulationCount 
2018-05-04 13:45:44.304759: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "PopulationCount" device_type: "CPU" constraint { name: "T" allowed_values { list { type: DT_INT8 } } }') for unknown op: PopulationCount 
2018-05-04 13:45:44.304768: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "PopulationCount" device_type: "CPU" constraint { name: "T" allowed_values { list { type: DT_UINT8 } } }') for unknown op: PopulationCount 
2018-05-04 13:45:44.304910: E tensorflow/core/framework/op_kernel.cc:1242] OpKernel ('op: "MutableDenseHashTable" device_type: "CPU" constraint { name: "key_dtype" allowed_values { list { type: DT_INT64 } } } constraint { name: "value_dtype" allowed_values { list { type: DT_VARIANT } } }') for unknown op: MutableDenseHashTable

标签: c++tensorflowsse

解决方案


推荐阅读