This book aims to compile typical fundamental-to-advanced statistical methods to be used for health data sciences. Although the book promotes applications to health and health-related data, the models in the book can be used to analyze any kind of data. The data are analyzed with the commonly used statistical software of R/SAS (with online supplementary on SPSS/Stata). The data and computing programs will be available to facilitate readers’ learning experience. There has been considerable attention to making statistical methods and analytics available to health data science researchers and students. This book brings it all together to provide a concise point-of-reference for the most commonly used statistical methods from the fundamental level to the advanced level. We envisage this book will contribute to the rapid development in health data science. We provide straightforward explanations of the collected statistical theory and models, compilations of a variety of publicly available data, and illustrations of data analytics using commonly used statistical software of SAS/R. We will have the data and computer programs available for readers to replicate and implement the new methods. The primary readers would be applied data scientists and practitioners in any field of data science, applied statistical analysts and scientists in public health, academic researchers, and graduate students in statistics and biostatistics. The secondary readers would be R&D professionals/practitioners in industry and governmental agencies. This book can be used for both teaching and applied research.
1. Sampling and Data Collection 2. Measures of Tendency, Spread, Relative Standing, Association, Belief 3. Statistical Modeling of Mean of Continuous and Mean of Binary Outcomes 4. Modeling of Continuous and Binary Outcomes with Factors: One-way and Two-way ANOVA Models 5. Statistical Modeling of Continuous Outcomes with Continuous Explanatory Factors Linear Regression Models 6. Modeling Continuous Responses with Categorical and Continuous Covariates: One-Way Analysis of Covariance (ANCOVA) 7. Statistical Modeling of Binary Outcome with One or More Covariates: Standard Logistic Regression Model 8. Generalized Linear Models 9. Modeling Repeated Continuous Observations using GEE 10. Modeling for Correlated Continuous Responses with Random-Effects 11. Modeling Correlated Binary Outcomes through Hierarchical Logistic Regression Models
"In summary, Statistical Analytics for Health Data Science with SAS and R excels in demystifying intricate statistical concepts and offers both theoretical grounding and practical experience. Whether you are an applied data scientist, a graduate student, or a public health researcher, this work by Willson, Chen, and Peace is an invaluable asset for learning and applying statistics in real-world settings."
Ali Rahnavard, The George Washingotn University USA, Journal of the American Statistical Association, 2023.