1st Edition

Applied Machine Learning Using mlr3 in R

    356 Pages 43 Color & 53 B/W Illustrations
    by Chapman & Hall

    356 Pages 43 Color & 53 B/W Illustrations
    by Chapman & Hall

    356 Pages 43 Color & 53 B/W Illustrations
    by Chapman & Hall

    mlr3 is an award-winning ecosystem of R packages that have been developed to enable state-of-the-art machine learning capabilities in R. Applied Machine Learning Using mlr3 in R gives an overview of flexible and robust machine learning methods, with an emphasis on how to implement them using mlr3 in R. It covers various key topics, including basic machine learning tasks, such as building and evaluating a predictive model; hyperparameter tuning of machine learning approaches to obtain peak performance; building machine learning pipelines that perform complex operations such as pre-processing followed by modelling followed by aggregation of predictions; and extending the mlr3 ecosystem with custom learners, measures, or pipeline components.

    Features:

    • In-depth coverage of the mlr3 ecosystem for users and developers
    • Explanation and illustration of basic and advanced machine learning concepts
    • Ready to use code samples that can be adapted by the user for their application
    • Convenient and expressive machine learning pipelining enabling advanced modelling
    • Coverage of topics that are often ignored in other machine learning books

    The book is primarily aimed at researchers, practitioners, and graduate students who use machine learning or who are interested in using it. It can be used as a textbook for an introductory or advanced machine learning class that uses R, as a reference for people who work with machine learning methods, and in industry for exploratory experiments in machine learning.

    1. Introduction and Overview 
    Lars Kotthoff, Raphael Sonabend, Natalie Foss, Bernd Bischl

    2. Data and Basic Modeling
    Natalie Foss, Lars Kotthoff

    3. Evaluation and Benchmarking
    Giuseppe Casalicchio, Lukas Burk

    4. Hyperparameter Optimization 
    Marc Becker, Lennart Schneider, Sebastian Fischer

    5. Advanced Tuning Methods and Black Box Optimization
    Lennart Schneider, Marc Becker

    6. Feature Selection
    Marvin N. Wright

    7. Sequential Pipelines
    Martin Binder, Florian Pfisterer

    8. Non-sequential Pipelines and Tuning
    Martin Binder, Florian Pfisterer, Marc Becker, Marvin N. Wright

    9. Preprocessing
    Janek Thomas

    10. Advanced Technical Aspects of mlr3 
    Michel Lang, Sebastian Fischer, Raphael Sonabend

    11. Large-Scale Benchmarking
    Sebastian Fischer, Michel Lang, Marc Becker

    12. Model Interpretation 
    Susanne Dandl, Przemysław Biecek, Giuseppe Casalicchio, Marvin N. Wright

    13. Beyond Regression and Classification 
    Raphael Sonabend, Patrick Schratz, Damir Pulatov

    14. Algorithmic Fairness 
    Florian Pfisterer

    Biography

    Bernd Bischl is a professor of Statistical Learning and Data Science in LMU Munich and co-director of the Munich Center for Machine Learning. He studied Computer Science, Artificial Intelligence and Data Science and holds a PhD in statistics. His research interests include AutoML, model selection, interpretable ML and the development of statistical software. He wrote the initial version of mlr and still leads the mlr3 developers, now largely focusing on design, code review and strategic development.

    Raphael Sonabend is a founder and director of OSPO Now and a visiting researcher at Imperial College London. They hold a PhD in statistics, specializing in machine learning applications for survival analysis. They wrote the mlr3 packages mlr3proba and mlr3benchmark.

    Lars Kotthoff is an associate professor of Computer Science at the University of Wyoming, US. He has studied and held academic appointments in Germany, UK, Ireland, and Canada. Lars has been contributing to mlr for about a decade. His research aims to automate machine learning and other areas of AI.

    Michel Lang is the scientific coordinator of the Research Center Trustworthy Data Science and Security. He has a PhD in statistics and has been developing statistical software for over a decade. He joined the mlr team in 2014 and wrote the initial version of mlr3.