Buch, Englisch, 480 Seiten, Format (B × H): 178 mm x 254 mm, Gewicht: 453 g
Statistics and Prediction Algorithms Through Case Studies
Buch, Englisch, 480 Seiten, Format (B × H): 178 mm x 254 mm, Gewicht: 453 g
Reihe: Chapman & Hall/CRC Data Science Series
ISBN: 978-1-032-41986-2
Verlag: Taylor & Francis Ltd
Introduction to Data Science: Statistics and Prediction Algorithms Through Case Studies teaches data science as a way of thinking statistically, not just as a collection of computational tools. Building on the topics covered in Introduction to Data Science: Data Wrangling and Visualization with R, this book is designed for students with some programming experience and basic mathematical maturity, this book builds the foundations of probability, statistical inference, regression, high-dimensional data analysis, and machine learning through real data examples and reproducible R code. It is suitable for one-semester course in advanced data science.
The book shows how to reason about variability, uncertainty, prediction error, model assumptions, and validation. Through case studies involving polling, genetics, baseball, recommendation systems, image classification, and other modern datasets, readers learn how to connect probability models to data, summarize complex information, quantify uncertainty, fit and interpret models, evaluate prediction algorithms, and understand the statistical ideas behind machine learning. Each chapter is designed to support classroom teaching, self-study, and hands-on analysis, with exercises and companion web materials available through the book website.
Key Features:
- Includes base R, data.table, and tidyverse code.
- Focuses on the statistical and probabilistic foundations of machine learning.
- Contains real-world case studies.
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
Zielgruppe
Postgraduate, Undergraduate Advanced, and Undergraduate Core
Autoren/Hrsg.
Fachgebiete
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken
- Mathematik | Informatik EDV | Informatik Informatik Theoretische Informatik
- Mathematik | Informatik Mathematik Stochastik
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Maschinelles Lernen
- Mathematik | Informatik EDV | Informatik Informatik Mensch-Maschine-Interaktion Informationsvisualisierung
Weitere Infos & Material
Distributions Numerical Summaries Comparing Groups Connecting Data and Probability Discrete Probability Continuous Probability Random Variables Sampling Models and the Central Limit Theorem Estimates and Confidence Intervals Data-Driven Models Bayesian Statistics Hierarchical Models Hypothesis Testing Bootstrap Introduction to Regression The Linear Model Framework Treatment Effect Models Generalized Linear Models Association Is Not Causation Multivariable Regression Working with Matrices in R Applied Linear Algebra Dimension Reduction Regularization Latent Factor Models Notation and Terminology Performance Metrics Conditional Expectations and Smoothing Resampling and Model Assessment Supervised Learning Methods Building Machine Learning Models Unsupervised Learning: Clustering




