Arjun Srivastava's Library
home

Arjun Srivastava's Library

Advanced Analytics With Spark: Patterns for Learning From Data at Scale
Sandy Ryza and Uri Laserson and Sean Owen and Josh Wills
In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together...
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
Martin Kleppmann
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, includin...
Elasticsearch: The Definitive Guide
Clinton Gormley and Zachary Tong
Whether you need full-text search or real-time analytics of structured data—or both—the Elasticsearch distributed search engine is an ideal way to put your data to work. This practical guide not only shows you how to search, analyze, and explore dat...
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
Trevor Hastie and Robert Tibshirani and Jerome Friedman
During the past decade there has been an explosion in computation and information technology. With it have come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data ...
Go in Action
Brian Ketelsen and Erik St. Martin and William Kennedy
Summary Go in Action introduces the Go language, guiding you from inquisitive developer to Go guru. The book begins by introducing the unique features and concepts of Go. Then, you'll get hands-on experience writing real-world applications includin...
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
Holden Karau and Rachel Warren
Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warre...
How to Win an Indian Election: What Political Parties Don’t Want You to Know
Shivam Shankar Singh
What role do political consultants play in election campaigns? How are political parties using technological tools such as data analytics, surveys and alternative media to construct effective, micro-targeted campaigns? How does the use of money impa...
Machine Learning With R, the Tidyverse, and Mlr
Hefin I. Rhys
Summary Machine learning (ML) is a collection of programming techniques for discovering relationships in data. With ML algorithms, you can cluster and classify data for tasks like making recommendations or fraud detection and make predictions for sa...
Redis in Action
Josiah L. Carlson
SummaryRedis in Action introduces Redis and walks you through examples that demonstrate how to use it effectively. You'll begin by getting Redis set up properly and then exploring the key-value model. Then, you'll dive into real use cases including ...
Spark: The Definitive Guide: Big Data Processing Made Simple
Bill Chambers and Matei Zaharia
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Mate...
Tika in Action
Chris Mattmann and Jukka Zitting
SummaryTika in Action is a hands-on guide to content mining with Apache Tika. The book's many examples and case studies offer real-world experience from domains ranging from search engines to digital asset management and scientific data processing.A...