Re ran the tests last night and only one (new) test failed :
[junit] Test org.apache.hadoop.metrics2.impl.TestMetricsSystemImpl FAILED
So I assume the GangliaMetrics test was fixed by the hosts file too.
I found MAPREDUCE-3894 which seems to explain the failure and it suggests its intermittent. I've tested this by rerunning just that test and low and behold it passed :
# ant test -Dtestcase=TestMetricsSystemImpl
# BUILD SUCCESSFUL
# Total time: 24 seconds
Finally, a clean test run (even with my patch attached). Now I just need someone to look at / approve my patch:
https://issues.apache.org/jira/browse/MAPREDUCE-3807
Any clue as to how I draw someones attention to this?
Congratulations. It sounds like we have similar jobs. I am actually having some trouble with 1.0.0, I am still getting failing streaming tests in contrib. I got 0.20.203.0 and 0.20.205.0 to build fine though. Please get in touch if you want to swap notes.
ReplyDeleteThe list of failures I encountered and the seemingly relevant JIRA's which include fixes that worked for me:
ReplyDeleteHADOOP-7836 [junit] Test org.apache.hadoop.ipc.TestSaslRPC FAILED
MAPREDUCE-3357 [junit] Test org.apache.hadoop.filecache.TestMRWithDistributedCache FAILED
MAPREDUCE-2073 [junit] Test org.apache.hadoop.filecache.TestTrackerDistributedCacheManager FAILED
HBASE-3285 [junit] Test org.apache.hadoop.hdfs.TestFileAppend4 FAILED
MAPREDUCE-3594 [junit] Test org.apache.hadoop.streaming.TestUlimit FAILED
Too many open file [junit] Test org.apache.hadoop.mapred.TestCapacityScheduler FAILED
HADOOP-7949 [junit] Test org.apache.hadoop.ipc.TestSaslRPC FAILED
HADOOP-7949 [junit] Test org.apache.hadoop.metrics2.impl.TestGangliaMetrics FAILED
MAPREDUCE-3894 [junit] Test org.apache.hadoop.metrics2.impl.TestMetricsSystemImpl FAILED # Works intermittently
That, ant 1.7.2 and a umask of 0022 seemed a working combination for me on my CENTOS5u4 VM