python - 无法启动 MapReduce 作业
问题描述
当我尝试开始简单的 wordcount mapreduce 示例时,我得到了这个
[cloudera-scm@ivana-namenode2 /opt/MapReduce/wordcount]$ pydoop script wc.py /user/cloudera-scm/Mapduce/wordcount/data/text /user/cloudera-scm/MapReduce/wordcount/output
18/08/14 14:11:49 INFO Configuration.deprecation: mapred.cache.files is deprecated. Instead, use mapreduce.job.cache.files
18/08/14 14:11:49 INFO Configuration.deprecation: mapred.job.name is deprecated. Instead, use mapreduce.job.name
18/08/14 14:11:49 INFO Configuration.deprecation: mapred.reduce.tasks is deprecated. Instead, use mapreduce.job.reduces
18/08/14 14:11:49 INFO Configuration.deprecation: hadoop.pipes.java.recordreader is deprecated. Instead, use mapreduce.pipes.isjavarecordreader
18/08/14 14:11:49 INFO Configuration.deprecation: mapred.cache.archives is deprecated. Instead, use mapreduce.job.cache.archives
18/08/14 14:11:49 INFO Configuration.deprecation: hadoop.pipes.java.recordwriter is deprecated. Instead, use mapreduce.pipes.isjavarecordwriter
18/08/14 14:11:49 INFO Configuration.deprecation: mapred.textoutputformat.separator is deprecated. Instead, use mapreduce.output.textoutputformat.separator
18/08/14 14:11:49 INFO Configuration.deprecation: mapred.compress.map.output is deprecated. Instead, use mapreduce.map.output.compress
18/08/14 14:11:50 INFO client.ConfiguredRMFailoverProxyProvider: Failing over to rm59
18/08/14 14:11:51 WARN mapreduce.JobResourceUploader: No job jar file set. User classes may not be found. See Job or Job#setJar(String).
18/08/14 14:11:51 INFO input.FileInputFormat: Total input paths to process : 1
18/08/14 14:11:52 INFO mapreduce.JobSubmitter: number of splits:1
18/08/14 14:11:52 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1534248543348_0007
18/08/14 14:11:52 INFO mapred.YARNRunner: Job jar is not present. Not adding any jar to the list of resources.
18/08/14 14:11:52 INFO impl.YarnClientImpl: Submitted application application_1534248543348_0007
18/08/14 14:11:52 INFO mapreduce.Job: The url to track the job: http://ivana-namenode2:8088/proxy/application_1534248543348_0007/
18/08/14 14:11:52 INFO mapreduce.Job: Running job: job_1534248543348_0007
18/08/14 14:15:07 INFO mapreduce.Job: Job job_1534248543348_0007 running in uber mode : false
18/08/14 14:15:07 INFO mapreduce.Job: map 0% reduce 0%
18/08/14 14:15:07 INFO mapreduce.Job: Job job_1534248543348_0007 failed with state KILLED due to: Application killed by user.
18/08/14 14:15:07 INFO mapreduce.Job: Counters: 0
18/08/14 14:15:07 INFO util.ExitUtil: Exiting with status 1
Traceback (most recent call last):
File "/usr/local/bin/pydoop", line 11, in <module>
sys.exit(main())
File "/usr/local/lib/python2.7/site-packages/pydoop/app/main.py", line 81, in main
args.func(args, unknown)
File "/usr/local/lib/python2.7/site-packages/pydoop/app/script.py", line 130, in run
scripter.run()
File "/usr/local/lib/python2.7/site-packages/pydoop/app/script.py", line 119, in run
submitter.run()
File "/usr/local/lib/python2.7/site-packages/pydoop/app/submit.py", line 376, in run
logger=self.logger, keep_streams=False)
File "/usr/local/lib/python2.7/site-packages/pydoop/hadut.py", line 302, in run_class
keep_streams=keep_streams)
File "/usr/local/lib/python2.7/site-packages/pydoop/hadut.py", line 167, in run_cmd
raise RunCmdError(p.returncode, ' '.join(_args), error)
pydoop.hadut.RunCmdError: command exited with 1 status
此外,当我尝试启动 jar 示例时,我遇到了同样的问题,所以这与代码无关。
下面,您有指向我的 main.py、script.py 和 submit.py 的链接。我不能把它们放在这里,因为堆栈溢出限制最多 30000 个字符
https://www.dropbox.com/sh/1mabmgfywa38xib/AAB946SUK8-A8VfWfieQG_4ha?dl=0
解决方案
推荐阅读
- javascript - 即使在 AVA 单元测试中选择退出,Mixpanel 也会返回响应对象而不是未定义
- node.js - 主题 gmail api node.js 中的特殊字符
- javascript - 为什么我的触发箭头键事件在 Java Script 中不起作用?
- jquery - 使用 Express 数据库结果更新下拉列表
- c# - 层次结构问题 Unity
- python - time.sleep() 是否足以安全地为简单的网络爬虫创建延迟?
- node.js - 无法在 fetch 调用中使用成员函数
- android - 结合android ViewModel和数据绑定的最佳实践
- android - 如何找出我的视图使用哪些主题颜色?
- c - 为什么使用 SDL I/O 函数而不是 stdio 函数?