1st Edition

Scaling Up with R and Apache Arrow Bigger Data, Easier Workflows

160 Pages 20 B/W Illustrations
by Chapman & Hall

160 Pages 20 B/W Illustrations
by Chapman & Hall

160 Pages 20 B/W Illustrations
by Chapman & Hall

Analyze large datasets directly from R. Scaling Up With R and Arrow provides a guide to working efficiently with larger-than-memory datasets using the arrow R package. As data grows in size and complexity, traditional data analysis methods in R often hit technical limitations. In this book, you'll learn how to overcome these hurdles without needing to set up complex infrastructure. You'll... Read more

Acknowledgements  Foreword  1. Introduction  2. Getting Started  3. Data Manipulation  4. Files and Formats  5. Datasets  6. Cloud  7. Advanced Topics  8. Sharing Data and Interoperability  References  Appendices

Biography

Nic Crane is an R developer, educator, and general enthusiast, with a background in data science and software engineering.  Nic is a member of the Apache Arrow Project Management Committee (PMC) and part of the team who maintains the arrow R package.

 

Jonathan Keane is an engineering manager with a background in software engineering and data science. Jonathan is a part of the team who maintains the Arrow project including the Arrow R package.

 

Neal Richardson is an engineering leader focused on building software that helps people work with data. He is a member of the Arrow PMC and one of the top contributors to the project.