Showing posts with label Distributed Computing. Show all posts
Showing posts with label Distributed Computing. Show all posts

Wednesday, 30 March 2016

Book: Data Algorithms Recipes for Scaling Up with Hadoop and Spark

Very nice book which teaches how to implement mechine learning and data mining techniques such as NBC, recommender, clustering, etc. Implemented in java, the book provides codes in both hadoop mapreduce and apache spark in simple-to-understand and clean manner. Have re-coded most of the algorithms in the book except for chapters dealing with some of the bio stuff which i am not particularly interested at the moment.

Tuesday, 19 January 2016

Online Course: Patterns for Building Distributed Systems for The Enterprise

Completed this course. Quite good a mind opener, gain some new perspective of distributed computing, particularly the CQRS via MQ and ESB, as well as the append-only models (esp the event streaming and the historical model). This course gave some good ideas on how to proceed with the refactoring processing of a big data project working on.

Link: https://app.pluralsight.com/library/courses/cqrs-theory-practice/table-of-contents