《Big Data》

《Big Data》 《Big Data》

  • 书名:《Big Data》:Principles and best practices of scalable realtime data systems
  • 分类:编程
  • 作者:Nathan Marz,James Warren
  • 出版社:Manning Publications
  • 出版年:2015-5-10
  • 售价:USD 49.99
  • 装订:Paperback
  • 页码:328

《Big Data》 内容介绍:

Services like social networks, web analytics, and intelligent e-commerce often need to manage data at a scale too big for a traditional database. Complexity increases with scale and demand, and handling big data is not as simple as just doubling down on your RDBMS or rolling out some trendy new technology. Fortunately, scalability and simplicity are not mutually exclusive—you just need to take a different approach. Big data systems use many machines working in parallel to store and process data, which introduces fundamental challenges unfamiliar to most developers. Big Data teaches you to build these systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy to understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they're built. Big Data shows you how to build the back-end for a real-time service called SuperWebAnalytics.com—our version of Google Analytics. As you read, you'll discover that many standard RDBMS practices become unwieldy with large-scale data. To handle the complexities of Big Data and distributed systems, you must drastically simplify your approach. This book introduces a general framework for thinking about big data, and then shows how to apply technologies like Hadoop, Thrift, and various NoSQL databases to build simple, robust, and efficient systems to handle it.

作者Nathan Marz,James Warren介绍:

Nathan Marz is an engineer at Twitter. He was previously Lead Engineer at BackType, a marketing intelligence company, that was acquired by Twitter in July of 2011. He is the author of two major open source projects: Storm, a distributed realtime computation system, and Cascalog, a tool for processing data on Hadoop. He is a frequent speaker and writes a blog at nathanmarz.com. Sam Ritchie is an engineer at Twitter who uses Cascalog and ElephantDB to process and analyze many terabytes of data in near real-time. He is also the lead developer on FORMA, an open-source deforestation monitoring system in use by a number of top research institutions. He is a committer on Cascalog, ElephantDB, Pallet and a number of other open source Clojure projects.

《Big Data》 目录大纲:

1. A new paradigm for Big Data - FREE
2. Data model for Big Data - AVAILABLE
3. Data storage on the batch layer
4. MapReduce and batch processing
5. Batch processing with Cascading
6. Basics of the serving layer
7. Storm and the speed layer
8. Incremental batch processing
9. Layered architecture in-depth
10. Piping the system together
11. Future of NoSQL and Big Data processing
Appendix A: Hadoop
Appendix B: Thrift
Appendix C: Storm


微信扫一扫关注公众号

0 个评论

要评论图书请先登录注册

你也许想看:

Big Data

《Big Data》

Nathan Marz,James Warren.Manning Publications.2015-5-10

“Services like social networks, web analytics, and intelligen...”

Oracle优化日记

《Oracle优化日记》

白鳝.人民邮电出版社.2010-7

“本书是一本介绍Oracle数据库优化方法的书,以一个实际的大型优化项目为原型,用日记的形式记录了一个优化小组的DBA 如...”

Game Programming Patterns

《Game Programming Patterns》

Robert Nystrom.Genever Benning.2014-11-2

“The biggest challenge facing many game programmers is comple...”

Your UNIX/Linux

《Your UNIX/Linux》

Das, Sumitabha..2012-1

“"Your UNIX/Linux: The Ultimate Guide", written with both use...”

Ruby Programming

《Ruby Programming》

高桥征义,后藤裕藏.电子工业出版社.2009-4

“《Ruby Programming:向Ruby之父学程序设计(第2版)》是为了让完全没有程序设计经验的读者也能灵活地使用...”

Swifter(第二版) : 100 个 Swift 2 开发必备 Tip

《Swifter(第二版) : 100 个 Swift 2 开发必备 Tip》

王巍.电子工业出版社.2015-11

“《Swifter(第二版):100个 Swift 2 开发必备 Tip》面向商业实战与技能进阶,内容与案例完全基于 Sw...”

The Game Jam Survival Guide

《The Game Jam Survival Guide》

Christer Kaitila.Packt Publishing.2012-4-2

“”

Effective Ruby:改善Ruby程序的48条建议

《Effective Ruby:改善Ruby程序的48条建议》

Peter J. Jones.机械工业出版社.2016-1

“如果你是经验丰富的Rub程序员,本书能帮助你发挥Ruby的全部力量来编写更稳健、高效、可维护和易执行的代码。Peter ...”

Building Web Apps for Google TV

《Building Web Apps for Google TV》

Ferrate, Andres; Surya, Amanda; Lee, Daniels..2011-7

“By integrating the Web with traditional TV, Google TV offers...”

Microsoft.NET框架程序设计

《Microsoft.NET框架程序设计》

Jeffrey Richter.清华大学出版社.2003-11

“《Microsoft.NET框架程序设计》(修订版)是《微软.NET程序员系列》丛书之一,主要介绍如何开发面向Micro...”

Objective-C 2.0程序设计

《Objective-C 2.0程序设计》

Stephen G. Kochan.机械工业出版社.2009年9月

“本书是Objective-C领域的经典著作,对Objective-C进行了系统而全面的阐述,权威性毋庸置疑。全书共分为四...”

安全软件开发之道

《安全软件开发之道》

[美]John Viega,Gary Mcgraw.机械工业出版社.2014-3-20

“本书被誉为安全技术领域的“黄帝内经”,由安全技术大师亲力打造,畅销全球,数位安全技术专家联袂推荐。综合论述如何在软件开发...”

Programming iOS 9

《Programming iOS 9》

Matt Neuburg.O'Reilly Media.2015-12-12

“”

500 Lines or Less

《500 Lines or Less》

Amy Brown,Michael DiBernardo..2016-6-28

“This book provides you with the chance to study how 26 exper...”

ACE程序员指南

《ACE程序员指南》

[美] Stephen D. Huston,James CE Johnson,Umar Syyid.中国电力出版社.2004-11

“本书由ACE的几位核心开发者撰写,快速而直接地解释了怎样充分利用ACE。从介绍ACE开始,循序渐进,详细讨论了ACE的I...”

Making Isometric Social Real-Time Games with HTML5, CSS3, and Javascript

《Making Isometric Social Real-Time Games with HTML5, CSS3, and Javascript》

Mario Andres Pagella.O'Reilly Media.2011-9-9

“Walk through the process of designing and implementing from ...”

精通Python 3程式設計

《精通Python 3程式設計》

Mark Summerfield.碁峰.2010-11

“”

The Go Programming Language

《The Go Programming Language》

Brian W. Kernighan,Alan Donovan.Addison-Wesley Professional.2015-11-23

“”

React Native开发指南

《React Native开发指南》

[美] Bonnie Eisenman.人民邮电出版社.2016-6

“本书通过丰富的示例和详细的讲解,介绍了React Native这款JavaScript框架。在React Native中...”

C程序设计语言

《C程序设计语言》

Clovis L.Tondo,SCOTT E.GIMPEL.机械工业出版社.2004-1-1

“本书对Brain W. Kernighan和Dennis M. Ritchie所著的《The C Programming...”