Arjun Srivastava's Library
home

Arjun Srivastava's Library

Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
Martin Kleppmann
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, includin...
Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures
Claus O. Wilke
Effective visualization is the best way to communicate information from the increasingly large and complex datasets in the natural and social sciences. But with the increasing power of visualization software today, scientists, engineers, and business...
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
Holden Karau and Rachel Warren
Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warre...
Tika in Action
Chris Mattmann and Jukka Zitting
SummaryTika in Action is a hands-on guide to content mining with Apache Tika. The book's many examples and case studies offer real-world experience from domains ranging from search engines to digital asset management and scientific data processing.A...