Data Science and Predictive Analytics
The first edition of the textbook Data Science and Predictive Analytics: Biomedical and Health Applications using R, authored by Ivo D. Dinov, was published in August 2018 by Springer.[1] The second edition of the book was printed in 2023.[2]
Author | Ivo D. Dinov |
---|---|
Country | Switzerland |
Language | English |
Series | The Springer Series in Applied Machine Learning |
Subject | Computer science, Data science, artificial intelligence |
Publisher | Springer |
Publication date | 2018 (1st ed.), 2023 (2nd edition) |
Media type | Print (hardcover and softcover), electronic (PDF and EPub) |
ISBN | 978-3-031-17483-4 978-3-319-72346-4, 978-3-031-17485-8, 978-3-031-17482-7 |
This textbook covers some of the mathematical foundations, computational techniques, and artificial intelligence approaches used in data science research and applications.[3]
Using the statistical computing platform R and a broad range of biomedical case-studies, the 23 chapters of the book first edition provide explicit examples of importing, exporting, processing, modeling, visualizing, and interpreting large, multivariate, incomplete, heterogeneous, longitudinal, and incomplete datasets (big data).[4]
Structure
First edition table of contents
The first edition of the Data Science and Predictive Analytics (DSPA) textbook [1] is divided into the following 23 chapters, each progressively building on the previous content.
- Motivation
- Foundations of R
- Managing Data in R
- Data Visualization
- Linear Algebra & Matrix Computing
- Dimensionality Reduction
- Lazy Learning: Classification Using Nearest Neighbors
- Probabilistic Learning: Classification Using Naive Bayes
- Decision Tree Divide and Conquer Classification
- Forecasting Numeric Data Using Regression Models
- Black Box Machine-Learning Methods: Neural Networks and Support Vector Machines
- Apriori Association Rules Learning
- k-Means Clustering
- Model Performance Assessment
- Improving Model Performance
- Specialized Machine Learning Topics
- Variable/Feature Selection
- Regularized Linear Modeling and Controlled Variable Selection
- Big Longitudinal Data Analysis
- Natural Language Processing/Text Mining
- Prediction and Internal Statistical Cross Validation
- Function Optimization
- Deep Learning, Neural Networks
Second edition table of contents
The significantly reorganized revised edition of the book (2023) [2] expands and modernizes the presented mathematical principles, computational methods, data science techniques, model-based machine learning and model-free artificial intelligence algorithms. The 14 chapters of the new edition start with an introduction and progressively build foundational skills to naturally reach biomedical applications of deep learning.
- Introduction
- Basic Visualization and Exploratory Data Analytics
- Linear Algebra, Matrix Computing, and Regression Modeling
- Linear and Nonlinear Dimensionality Reduction
- Supervised Classification
- Black Box Machine Learning Methods
- Qualitative Learning Methods—Text Mining, Natural Language Processing, and Apriori Association Rules Learning
- Unsupervised Clustering
- Model Performance Assessment, Validation, and Improvement
- Specialized Machine Learning Topics
- Variable Importance and Feature Selection
- Big Longitudinal Data Analysis
- Function Optimization
- Deep Learning, Neural Networks
Reception
The materials in the Data Science and Predictive Analytics (DSPA) textbook have been peer-reviewed in the International Statistical Institute’s ISI Review Journal[3] and the Journal of the American Library Association.[4] Many scholarly publications reference the DSPA textbook.[5][6]
As of January 17, 2021, the electronic version of the book first edition (ISBN 978-3-319-72347-1) is freely available on SpringerLink[7] and has been downloaded over 6 million times. The textbook is globally available in print (hardcover and softcover) and electronic formats (PDF and EPub) in many college and university libraries[8] and has been used for data science, computational statistics, and analytics classes at various institutions.[9]
References
- Dinov, Ivo (2018). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. Springer.
- Dinov, Ivo (2023). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. The Springer Series in Applied Machine Learning. Springer. doi:10.1007/978-3-031-17483-4. ISBN 978-3-031-17482-7. S2CID 256875731.
- Capaldi, Mindy (April 2019) [05 April 2019]. "(Review) Data Science and Predictive Analytics: Biomedical and Health Applications Using R". International Statistical Review. 87 (1): 181–182. doi:10.1111/insr.12317. S2CID 132379032.
- Saracco, Benjamin (2020) [April 2020]. "Review of Data Science and Predictive Analytics: Biomedical and Health Applications Using R". Journal of the Medical Library Association. 108 (2): 344. doi:10.5195/jmla.2020.901. PMC 7069824. S2CID 214729817.
- "Altmetric – Data Science and Predictive Analytics".
- "Google Scholar".
- Dinov, Ivo D. (2018). Data Science and Predictive Analytics. doi:10.1007/978-3-319-72347-1. ISBN 978-3-319-72346-4. S2CID 52098523.
- Textbook library availability
- Courses using the DSPA textbook