A simple MapReduce framework in Java
Modern Java - A Guide to Java 8
C4.5 is a commonly used in decision tree algorithm in data mining for classification. The existing C4.5 algorithm implementation is running in serial way. We are implementing this algorithm using Hadoop MapReduce framework which can run parallel in multiple system.
Used Mapreduce on a Hadoop environment at CCR(University at Buffalo) to compute the monthly volatility of about 3000 stocks with each having data of about three years .There were a total of 40000 files.