Big Data: Principles and Best Practices of Scalable Realtime Data Systems
N**S
Deceiving title for an outdated book
The title "Big Data" is totally deceiving. It make you expect a broad coverage of the subject. Instead, the book is exclusively dedicated to the so-called Lambda Architecture. Presented as the only solution to handle big data at realtime.Too bad, in the meantime, stateful stream processing has become a mature player and Lambda Architecture has been recognised as obsolete and problematic.
S**N
I really like this book
I really like this book, because you learn a lot since the very start of it.Not too many programming details, but that is good because it gives you an understanding of the general picture.
P**S
Five Stars
Excellent book; it explains the Lambda Architecture in a clear, concise manner with practical tips, tricks and examples
C**N
Interesting - worth it
Interesting book providing a high-level intro to BD architecture.
J**N
A lot of program code which wasn't necessary. Information was scattered all over the chapters.
All I was interested to know was how do you manage data integrity with incremental load in Lambda Architecture if you do not need a Speed layer. The answer is you do not process data incrementally, you process the entire data set which obviously isn't practical. I feel that the architecture is probably more suited to solving specific type of problems. The idea that I liked was having an Immutable Layer which you can replay and reprocess to build all the facts when things go wrong.
A**R
Pleasant and interesting
Bid Data and technologies around this subject can be very hard and low-level to understand.With this book i found it clear, concise and explained in such a way that everyone with little or no background in IT can understand.A very good Big Data insight and also helpful for understanding which are the best tools to achieve good results with Hadoop and other technologies.I found it very interesting, well written and pleasant to read as well. This book helped me a lot and i'm sure it can help a lot beginners with this subject.
J**N
Help, my entire technology ecosystem has just changed under my feet in the last few years (again).
Help, my entire technology ecosystem has just changed under my feet in the last few years (again). I need to rapidly get upto speed on what the new tools are, how everything fits together, and how to solve common problem types. I need to know where to get started to start learning and exploring in this new technology.This book is a comprehensive overview of Lambda Architecture (Batch View, Serving Layer, Speed Layer) and the Big Data ecosystem of tools (Hadoop, MapReduce, JCasalog, Kafka, Trident, Zookeeper, Storm, Cassandra, ElephantDB, HpyerLogLog, Bloom Filters, Functional and Distributed Programming)It took a few days to read this book cover to cover. It describes an example real-time data analytics application capable of working on terabyte+ datasets, in a clustered server environments. Each chapter alternates between the high level design architecture of each piece, then a second chapter focusing on implementation details with code samples in each of the above technologies.I would recommend having an additional read of the website documentation for each tool.
S**Y
it's about Lambda Architecture, not a Big Data technology survey
Very misleading title and blurb. This is not a book to teach the principles of Big Data systems, nor an overview of existing solutions. Rather, it is an introduction to the very specific (and limited / flawed) Lambda Architecture written by the author.The text is very wordy, it could easily be 1/4 the size that it is, and there is a very depressing lack of summary of the solutions that are out there. You'll learn a lot more about jcascalog and elephantdb than you will about the field.
Trustpilot
1 month ago
3 weeks ago