Chapman and Hall/CRC
The Essentials of Data Science: Knowledge Discovery Using R presents the concepts of data science through a hands-on approach using free and open source software. It systematically drives an accessible journey through data analysis and machine learning to discover and share knowledge from data.
Building on over thirty years’ experience in teaching and practising data science, the author encourages a programming-by-example approach to ensure students and practitioners attune to the practise of data science while building their data skills. Proven frameworks are provided as reusable templates. Real world case studies then provide insight for the data scientist to swiftly adapt the templates to new tasks and datasets.
The book begins by introducing data science. It then reviews R’s capabilities for analysing data by writing computer programs. These programs are developed and explained step by step. From analysing and visualising data, the framework moves on to tried and tested machine learning techniques for predictive modelling and knowledge discovery. Literate programming and a consistent style are a focus throughout the book.
"I have several books on data science and R, as well as other similar subjects and programming languages, in my personal library. However, this book is a great blend of important data science topics and R programming that will make it a great reference for anyone working in this important and immensely popular area. I highly recommend this book for college students learning what it takes to start their career in data science or even current professionals wanting to make a career change or who just want to know more about the subject (and do some R programming as well)."
~Dean V. Neubauer, Techometrics
"Due to the self-contained introduction to many of the features of R and RStudio, Graham J. Williams The Essentials of Data Science, Knowledge Discovery Using R would make an excellent recommended or supplementary text for a course that plans to use the rattle package. This book would also serve as a great resource for those with an interest in data science who would like a hands-on approach to learning R and gettting a flavor for a handful of topics within data science."
~Katherine M. Kinnaird, Brown University
Part I - An Overview for the Data Scientist. Data Science, Analytics, and Data Mining. From Rattle to R for the Data Scientist. Preparing Data. Building Models. Case Studies. R Basics. Part II - Data Foundations. Reading Data into R. Exploring and Summarising Data. Transforming Data. Presenting Data. Part III - Analytics. Descriptive Analytics. Predictive Analytics. Prescriptive Analytics. Text Analytics. Social Network Analytics. Part IV - Advanced Data Science in R. Dealing with Big Data. Parallel Processing for High Performance Analytics. Ensembles for Big Data.