Showing posts with label Spark. Show all posts
Showing posts with label Spark. Show all posts

Wednesday, 30 March 2016

Book: Data Algorithms Recipes for Scaling Up with Hadoop and Spark

Very nice book which teaches how to implement mechine learning and data mining techniques such as NBC, recommender, clustering, etc. Implemented in java, the book provides codes in both hadoop mapreduce and apache spark in simple-to-understand and clean manner. Have re-coded most of the algorithms in the book except for chapters dealing with some of the bio stuff which i am not particularly interested at the moment.

Book: Apache Spark Graph Processing

This book provided me guide on how to use apache spark graphx to graph processing in my project. While the book provides only basic implementation and intro to graphx features such as visualization, aggregateMessages, and pregel, I gain some useful insights after re-implemented most of the codes in the book (knowledge about scala required)

Link: https://www.packtpub.com/big-data-and-business-intelligence/apache-spark-graph-processing