chukwa-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Corbin Hoenes <>
Date Thu, 28 Jan 2010 19:15:21 GMT
I'm having some difficulty with the demux part of setting up chukwa.  I assume I am supposed
to run the script to startup all the map reduce jobs that handle
demux and archiving.

My goal is to pull the logs we are collecting out of the sink files and into something we
can start to run our pig scripts on.

When I run start-data-processors it gives me this though:

org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: file:/chukwa/demuxProcessing/mrInput
	at org.apache.hadoop.mapred.FileInputFormat.listStatus(
	at org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(
	at org.apache.hadoop.mapred.FileInputFormat.getSplits(
	at org.apache.hadoop.mapred.JobClient.writeOldSplits(
	at org.apache.hadoop.mapred.JobClient.submitJobInternal(
	at org.apache.hadoop.mapred.JobClient.submitJob(
	at org.apache.hadoop.mapred.JobClient.runJob(

Which seems like I need to configure it to try to connect to hdfs rather than file:/

Only docs I've found are here:
Is there a guide to configuring chukwa-demux-conf.xml?   

I also noticed tries to start which doesn't exist
for me--do I need to get this script?s

View raw message