1st Edition

Demystifying Probability and Statistics for Data Scientists with R

    400 Pages 101 B/W Illustrations
    by Auerbach Publications

    400 Pages 101 B/W Illustrations
    by Auerbach Publications

    Data science is a fast-emerging field of study and research. It mainly leverages integrated data analytics platforms, and in the recent past, the arrival of artificial intelligence (AI) technologies and tools to extract actionable insights out of burgeoning data volumes has totally changed the game of data science. Machine and deep learning (ML/DL) algorithms are the principal technologies of the AI paradigm. There are innovations and improvisations in the AI ecosystem that make sense out of a massive quantity of multi-structured data. The data science domain is acquiring special significance these days as businesses and governments across the globe need to fulfil the long-standing goal of data-driven insights and insight-driven decisions. Transitioning data into information and knowledge plays a pivotal role for worldwide enterprises and establishments in effectively initiating and implementing next-generation digital transformation projects.

    Programming statistics and probability applications are key to this vital transformation of data into knowledge and insight. Probability and statistics are also key to decision-making processes that data science is automating. Demystifying Probability and Statistics for Data Scientists with R illustrates statistics with the R programming language, an emerging tool in data science. Filled with techniques for data science and analytics programming, this book is written for students and professionals. It is focused on learning outcomes and problem solving. Beginning with the basics of R programming, data science, and probability, the book progresses to methods, testing, and experiment design. Highlights of the book include:

    • Types of data and collection methods
    • Data visualization methods
    • Probability basics
    • Random variables and distributions
    • Sampling methods and confidence intervals and hypothesis testing
    • Design of experiments
    • Correlation and regression
    • Chi square test
    • Non-parametric tests

    1. Demystifying the Data Science Paradigm
    2. Illustrating Machine Learning (ML) Algorithms
    3. Data Science with R
    4. Types of Data and Collection Methods
    5. Data Visualization Methods
    6. Data Description
    7. Probability Basics
    8. Random Variables and Distributions
    9. Sampling Methods and Confidence Intervals and Hypothesis Testing
    10. Design of Experiments
    11. Correlation and Regression
    12. Chi Square Test
    13. Non-Parametric Tests


    Dr. Dheva Rajan S. presently works as a professor at University of Technology and Applied Sciences, Al Mussanah, Oman. He has more than 15 years of experience in academics and research.

    Dr. Pethuru Raj is a chief architect at Reliance Jio Platforms Ltd. (JPL) Bangalore.

    Dr. Selvagopal P. presently is the Head of the Department of Mathematics in Anna Vinayagar Arts & Science College, Nagercoil, India.

    Dr. B. Sundaravadivazhagan is a professor in the Department of Information Technology at the University of Technology and Applied Sciences-Al Mussanah, Oman. He is a Senior Member of the Institute of Electrical and Electronics Engineers.