Exploratory Data Analysis Using R provides a classroom-tested introduction to exploratory data analysis (EDA) and introduces the range of "interesting" – good, bad, and ugly – features that can be found in data, and why it is important to find them. It also introduces the mechanics of using R to explore and explain data.The book begins with a detailed overview of data, exploratory analysis, and R, as well as graphics in R. It then explores working with external data, linear regression models, and crafting data stories. The second part of the book focuses on developing R programs, including good programming practices and examples, working with text data, and general predictive models. The book ends with a chapter on "keeping it all together" that includes managing the R installation, managing files, documenting, and an introduction to reproducible computing.The book is designed for both advanced undergraduate, entry-level graduate students, and working professionals with little to no prior exposure to data analysis, modeling, statistics, or programming. it keeps the treatment relatively non-mathematical, even though data analysis is an inherently mathematical subject. Exercises are included at the end of most chapters, and an instructor's solution manual is available.About the Author:Ronald K. Pearson holds the position of Senior Data Scientist with GeoVera, a property insurance company in Fairfield, California, and he has previously held similar positions in a variety of application areas, including software development, drug safety data analysis, and the analysis of industrial process data. He holds a PhD in Electrical Engineering and Computer Science from the Massachusetts Institute of Technology and has published conference and journal papers on topics ranging from nonlinear dynamic model structure selection to the problems of disguised missing data in predictive modeling. Dr. Pearson has authored or co-authored books including Exploring Data in Engineering, the Sciences, and Medicine (Oxford University Press, 2011) and Nonlinear Digital Filtering with Python. He is also the developer of the DataCamp course on base R graphics and is an author of the datarobot and GoodmanKruskal R packages available from CRAN (the Comprehensive R Archive Network).
Les mer
This textbook introduces exploratory data analysis (EDA) and covers the range of interesting features we can expect to find in data. The book also explores the practical mechanics of using R to do EDA.
Les mer
I Analyzing Data Interactively with R 1. Data, Exploratory Analysis, and R 2. Graphics in R 3. Exploratory Data Analysis: A First Look 4. Working with External Data 5. Linear Regression Models 6. Crafting Data Stories II Developing R Programs 7. Programming in R 8. Working with Text Data 9. Exploratory Data Analysis: A Second Look 10. More General Predictive Models 11. Keeping It All Together
Les mer

Produktdetaljer

ISBN
9781138480605
Publisert
2018-05-29
Utgiver
Vendor
CRC Press
Vekt
312 gr
Høyde
234 mm
Bredde
156 mm
Aldersnivå
U, G, 05, 01
Språk
Product language
Engelsk
Format
Product format
Innbundet
Antall sider
548

Forfatter

Om bidragsyterne

Ronald K. Pearson currently works for GeoVera, a property insurance company in Fairfield, California, primarily in the analysis of text data. He holds a PhD in Electrical Engineering and Computer Science from the Massachussetts Institute of Technology and has published conference and journal papers on topics ranging from nonlinear dynamic model structure selection to the problems of disguised missing data in predictive modeling. Dr. Pearson has authored or co-authored books including Exploring Data in Engineering, the Sciences, and Medicine (Oxford University Press, 2011) and Nonlinear Digital Filtering with Python, co-authored with Moncef Gabbouj (CRC Press, 2015). He is also the developer of the DataCamp course on base R graphics.