hadoop - 无法从配置单元加载数据:-chgrp:'LONEWOLF\Sudarshan' 与组的预期模式不匹配
问题描述
我很新Hadoop
。我已经创建了员工表Hive
并使用文本文件在其中加载了数据现在我正在尝试加载保存在表中但得到关注的数据。我到处寻找,但一无所获。有人可以帮我弄这个吗?
hive> select * from employee;
-chgrp: 'LONEWOLF\Sudarshan' does not match expected pattern for group
Usage: hadoop fs [generic options]
[-appendToFile <localsrc> ... <dst>]
[-cat [-ignoreCrc] <src> ...]
[-checksum <src> ...]
[-chgrp [-R] GROUP PATH...]
[-chmod [-R] <MODE[,MODE]... | OCTALMODE> PATH...]
[-chown [-R] [OWNER][:[GROUP]] PATH...]
[-copyFromLocal [-f] [-p] [-l] [-d] <localsrc> ... <dst>]
[-copyToLocal [-f] [-p] [-ignoreCrc] [-crc] <src> ... <localdst>]
[-count [-q] [-h] [-v] [-t [<storage type>]] [-u] [-x] <path> ...]
[-cp [-f] [-p | -p[topax]] [-d] <src> ... <dst>]
[-createSnapshot <snapshotDir> [<snapshotName>]]
[-deleteSnapshot <snapshotDir> <snapshotName>]
[-df [-h] [<path> ...]]
[-du [-s] [-h] [-x] <path> ...]
[-expunge]
[-find <path> ... <expression> ...]
[-get [-f] [-p] [-ignoreCrc] [-crc] <src> ... <localdst>]
[-getfacl [-R] <path>]
[-getfattr [-R] {-n name | -d} [-e en] <path>]
[-getmerge [-nl] [-skip-empty-file] <src> <localdst>]
[-help [cmd ...]]
[-ls [-C] [-d] [-h] [-q] [-R] [-t] [-S] [-r] [-u] [<path> ...]]
[-mkdir [-p] <path> ...]
[-moveFromLocal <localsrc> ... <dst>]
[-moveToLocal <src> <localdst>]
[-mv <src> ... <dst>]
[-put [-f] [-p] [-l] [-d] <localsrc> ... <dst>]
[-renameSnapshot <snapshotDir> <oldName> <newName>]
[-rm [-f] [-r|-R] [-skipTrash] [-safely] <src> ...]
[-rmdir [--ignore-fail-on-non-empty] <dir> ...]
[-setfacl [-R] [{-b|-k} {-m|-x <acl_spec>} <path>]|[--set <acl_spec> <path>]]
[-setfattr {-n name [-v value] | -x name} <path>]
[-setrep [-R] [-w] <rep> <path> ...]
[-stat [format] <path> ...]
[-tail [-f] <file>]
[-test -[defsz] <path>]
[-text [-ignoreCrc] <src> ...]
[-touchz <path> ...]
[-truncate [-w] <length> <path> ...]
[-usage [cmd ...]]
Generic options supported are
-conf <configuration file> specify an application configuration file
-D <property=value> use value for given property
-fs <file:///|hdfs://namenode:port> specify default filesystem URL to use, overrides 'fs.defaultFS'
property from configurations.
-jt <local|resourcemanager:port> specify a ResourceManager
-files <comma separated list of files> specify comma separated files to be copied to the map
reduce cluster
-libjars <comma separated list of jars> specify comma separated jar files to include in the
classpath.
-archives <comma separated list of archives> specify comma separated archives to be unarchived on
the compute machines.
The general command line syntax is
command [genericOptions] [commandOptions]
Usage: hadoop fs [generic options] -chgrp [-R] GROUP PATH...
OK
4 rows selected (3.61 seconds)
hive>
奇怪的是它占用了 4 行空间并将其显示为空白。
无法理解发生了什么?
表创建:
create table employee (Id int, Name string , Salary float)
row format delimited
fields terminated by ',' ;
加载表中的数据:
load data local inpath 'C:\employee.txt' into table employee;
employee.txt
文件
1 "Anurag" 40000.0
2 "Ayush" 42000.0
3 "Akhil" 44000.0
4 "Aarav" 46000.0
解决方案
当您创建了一个带有 的表时','
delimited fields
,这意味着源必须具有字段由 ',' 分隔的行
你的源文件
1 "Anurag" 40000.0
2 "Ayush" 42000.0
3 "Akhil" 44000.0
4 "Aarav" 46000.0
是一个空格分隔的字段,应该是
1,"Anurag",40000.0
2,"Ayush",42000.0
3,"Akhil",44000.0
4,"Aarav",46000.0
或者你创建的表应该是
create table employee (Id int, Name string , Salary float)
row format delimited
fields terminated by ' ' ;
推荐阅读
- sass - OptionParser::InvalidOption: 无效选项:--watch:css 你的意思是?使用 --trace 进行回溯
- javascript - 如何在内部执行 GQL 查询?
- python - 融化/取消透视具有多组值的数据集
- javascript - React - 数字 0 不保存到本地存储所有其他数字保持同步
- android - 找不到 attr 时如何使自定义视图回退到主题?
- deep-learning - 如何在使用 pytorch_geometric Data 对象时使 DataLoader 加载正确的 batch_size 数据?
- python - matplotlib 表中的单元格。部分文本不同的颜色
- java - Eclipse Java 外部 jar 不编译(但运行)
- c++ - 在 Win CE 环境中使用 C++ 运行 .bat 文件
- arrays - 如何使用变量的值作为标识符来声明/修改 BASH 中的数组?