kylin-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "fengYu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (KYLIN-1280) Convert Cuboid Data to HFile failed when hbase in different HDFS
Date Fri, 22 Jan 2016 14:04:39 GMT

    [ https://issues.apache.org/jira/browse/KYLIN-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15112429#comment-15112429
] 

fengYu commented on KYLIN-1280:
-------------------------------

[~liyang.gmt8@gmail.com]This is caused by KYLIN-957, because I deploy kylin with two HDFS,
one for hadoop(hdfs://A) and another is for hbase(hdfs://B), in step 'Convert Cuboid Data
to HFile', the mr job input is at hdfs://A and output is to hdfs://B, But in most hadoop cluster,
it can not recognize hdfs://B, so the job always failed.
One way to solve it just redeploy hadoop cluster and make it recognize hdfs://B, or use this
patch to change name service to current master name node before execute the job.


> Convert Cuboid Data to HFile failed when hbase in different HDFS
> ----------------------------------------------------------------
>
>                 Key: KYLIN-1280
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1280
>             Project: Kylin
>          Issue Type: Bug
>    Affects Versions: 2.0
>            Reporter: fengYu
>         Attachments: 0001-transform-path-in-other-HDFS-to-real-name-node-path.patch
>
>
> I deploy kylin-2.0 with hbase which rely on a different HDFS with hadoop cluster, so
I config this property 'kylin.hbase.cluster.fs' = hdfs://A, the name service is different
with 'fs.defaultFS' in hadoop cluster which is hdfs://B.
> In the step 'Convert Cuboid Data to HFile' execute failed, error log is :
> java.io.IOException: Failed to run job : Unable to map logical nameservice URI 'hdfs://A'
to a NameNode. Local configuration does not have a failover proxy provide
> r configured.
>         at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.java:300)
>         at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:432)
>         at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1285)
>         at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1282)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:415)
>         at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)
>         at org.apache.hadoop.mapreduce.Job.submit(Job.java:1282)
>         at org.apache.kylin.engine.mr.common.AbstractHadoopJob.waitForCompletion(AbstractHadoopJob.java:129)
>         at org.apache.kylin.storage.hbase.steps.CubeHFileJob.run(CubeHFileJob.java:93)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
>         at org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:119)
>         at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
>         at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
>         at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
>         at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>         at java.lang.Thread.run(Thread.java:745)
> I think it is because node manager in hadoop cluster can not recognition hdfs://A in
they config. So, I have to tranform the path hdfs://A/path/to/hfile to hdfs://namenode_ip:port/path/to/hfile
before execute this step. and it works for me.  
> Here is my patch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message