2nd Edition
Introduction to Data Science Data Wrangling and Visualization with R
Preface
Acknowledgements
Introduction
Part 1: R
1. Getting started
2. R basics
3. Programming basics
4. The tidyverse
5. data.table
6. Importing data
Part 2: Data Visualization
7. Visualizing data distributions
8. ggplot2
9. Data visualization principles
10. Data visualization in practice
Part 3: Data Wrangling
11. Reshaping data
12. Joining tables
13. Parsing dates and times
14. Locales
15. Extracting data from the web
16. String processing
17. Text analysis
Part 4: Productivity Tools
18. Organizing with Unix
19. Git and GitHub
20. Reproducible projects
Biography
Rafael A. Irizarry is professor and chair of Data Science at the Dana-Farber Cancer Institute, professor of biostatistics at Harvard, and a fellow of the American Statistical Association and the International Society of Computational Biology. Prof. Irizarry is an applied statistician and during the last 25 years has worked in diverse areas, including genomics, sound engineering, and public health surveillance. He disseminates solutions to data analysis challenges as open source software, tools that are widely downloaded and used. Prof. Irizarry has also developed and taught several data science courses at Harvard as well as popular online courses.
Praise for the first edition:
"I think the book would be perfect for schools looking to make a transition to a model where introduction to data science takes the place of introduction to statistics and maybe introductory computer science."
- Arend Kuyper, Northwestern University"A great introduction to data science and modern R programing, with tons of examples of application of the R abilities throughout the whole volume. The book suggests multiple links to the internet websites related to the topics under consideration that makes it an incredibly useful source of contemporary data science and programing, helping to students and researchers in their projects."
- Technometrics"Introduction to Data Science will teach you to juggle with your data and get maximum results from it using R. I highly recommended this book for students and everybody taking the first steps in data science using R."
- Maria Ivanchuk, ISCB News






