《Big Data》

《Big Data》 《Big Data》

  • 书名:《Big Data》:Principles and best practices of scalable realtime data systems
  • 分类:编程
  • 作者:Nathan Marz,James Warren
  • 出版社:Manning Publications
  • 出版年:2015-5-10
  • 售价:USD 49.99
  • 装订:Paperback
  • 页码:328

《Big Data》 内容介绍:

Services like social networks, web analytics, and intelligent e-commerce often need to manage data at a scale too big for a traditional database. Complexity increases with scale and demand, and handling big data is not as simple as just doubling down on your RDBMS or rolling out some trendy new technology. Fortunately, scalability and simplicity are not mutually exclusive—you just need to take a different approach. Big data systems use many machines working in parallel to store and process data, which introduces fundamental challenges unfamiliar to most developers. Big Data teaches you to build these systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy to understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they're built. Big Data shows you how to build the back-end for a real-time service called SuperWebAnalytics.com—our version of Google Analytics. As you read, you'll discover that many standard RDBMS practices become unwieldy with large-scale data. To handle the complexities of Big Data and distributed systems, you must drastically simplify your approach. This book introduces a general framework for thinking about big data, and then shows how to apply technologies like Hadoop, Thrift, and various NoSQL databases to build simple, robust, and efficient systems to handle it.

作者Nathan Marz,James Warren介绍:

Nathan Marz is an engineer at Twitter. He was previously Lead Engineer at BackType, a marketing intelligence company, that was acquired by Twitter in July of 2011. He is the author of two major open source projects: Storm, a distributed realtime computation system, and Cascalog, a tool for processing data on Hadoop. He is a frequent speaker and writes a blog at nathanmarz.com. Sam Ritchie is an engineer at Twitter who uses Cascalog and ElephantDB to process and analyze many terabytes of data in near real-time. He is also the lead developer on FORMA, an open-source deforestation monitoring system in use by a number of top research institutions. He is a committer on Cascalog, ElephantDB, Pallet and a number of other open source Clojure projects.

《Big Data》 目录大纲:

1. A new paradigm for Big Data - FREE
2. Data model for Big Data - AVAILABLE
3. Data storage on the batch layer
4. MapReduce and batch processing
5. Batch processing with Cascading
6. Basics of the serving layer
7. Storm and the speed layer
8. Incremental batch processing
9. Layered architecture in-depth
10. Piping the system together
11. Future of NoSQL and Big Data processing
Appendix A: Hadoop
Appendix B: Thrift
Appendix C: Storm


0 个评论



Big Data

《Big Data》

Nathan Marz,James Warren.Manning Publications.2015-5-10

“Services like social networks, web analytics, and intelligen...”




“本书是一本介绍Oracle数据库优化方法的书,以一个实际的大型优化项目为原型,用日记的形式记录了一个优化小组的DBA 如...”

Game Programming Patterns

《Game Programming Patterns》

Robert Nystrom.Genever Benning.2014-11-2

“The biggest challenge facing many game programmers is comple...”

Your UNIX/Linux

《Your UNIX/Linux》

Das, Sumitabha..2012-1

“"Your UNIX/Linux: The Ultimate Guide", written with both use...”

Ruby Programming

《Ruby Programming》


“《Ruby Programming:向Ruby之父学程序设计(第2版)》是为了让完全没有程序设计经验的读者也能灵活地使用...”

Swifter(第二版) : 100 个 Swift 2 开发必备 Tip

《Swifter(第二版) : 100 个 Swift 2 开发必备 Tip》


“《Swifter(第二版):100个 Swift 2 开发必备 Tip》面向商业实战与技能进阶,内容与案例完全基于 Sw...”

The Game Jam Survival Guide

《The Game Jam Survival Guide》

Christer Kaitila.Packt Publishing.2012-4-2


Effective Ruby:改善Ruby程序的48条建议

《Effective Ruby:改善Ruby程序的48条建议》

Peter J. Jones.机械工业出版社.2016-1

“如果你是经验丰富的Rub程序员,本书能帮助你发挥Ruby的全部力量来编写更稳健、高效、可维护和易执行的代码。Peter ...”

Building Web Apps for Google TV

《Building Web Apps for Google TV》

Ferrate, Andres; Surya, Amanda; Lee, Daniels..2011-7

“By integrating the Web with traditional TV, Google TV offers...”



Jeffrey Richter.清华大学出版社.2003-11


Objective-C 2.0程序设计

《Objective-C 2.0程序设计》

Stephen G. Kochan.机械工业出版社.2009年9月




[美]John Viega,Gary Mcgraw.机械工业出版社.2014-3-20


Programming iOS 9

《Programming iOS 9》

Matt Neuburg.O'Reilly Media.2015-12-12


500 Lines or Less

《500 Lines or Less》

Amy Brown,Michael DiBernardo..2016-6-28

“This book provides you with the chance to study how 26 exper...”



[美] Stephen D. Huston,James CE Johnson,Umar Syyid.中国电力出版社.2004-11


Making Isometric Social Real-Time Games with HTML5, CSS3, and Javascript

《Making Isometric Social Real-Time Games with HTML5, CSS3, and Javascript》

Mario Andres Pagella.O'Reilly Media.2011-9-9

“Walk through the process of designing and implementing from ...”

精通Python 3程式設計

《精通Python 3程式設計》

Mark Summerfield.碁峰.2010-11


The Go Programming Language

《The Go Programming Language》

Brian W. Kernighan,Alan Donovan.Addison-Wesley Professional.2015-11-23


React Native开发指南

《React Native开发指南》

[美] Bonnie Eisenman.人民邮电出版社.2016-6

“本书通过丰富的示例和详细的讲解,介绍了React Native这款JavaScript框架。在React Native中...”



Clovis L.Tondo,SCOTT E.GIMPEL.机械工业出版社.2004-1-1

“本书对Brain W. Kernighan和Dennis M. Ritchie所著的《The C Programming...”