Exploratory Data Analysis 1st Edition by John Tukey (PDF)

86

 

Ebook Info

  • Published: 2010
  • Number of pages: 712 pages
  • Format: PDF
  • File Size: 7.24 MB
  • Authors: John Tukey

Description

The approach in this introductory book is that of informal study of the data. Methods range from plotting picture-drawing techniques to rather elaborate numerical summaries. Several of the methods are the original creations of the author, and all can be carried out either with pencil or aided by hand-held calculator.

User’s Reviews

Reviews from Amazon users which were colected at the time this book was published on the website:

⭐Tukey looms large over the field of data visualization and computer science generally. He famously coined the term “bit” and invented the box plot. From page one of “EDA” you get a sense of what an incredible person he must have been: there is vivid metaphor and an almost folksy lightness throughout that reminds me of the most generous and patient teachers.Tukey most informed my own book’s tour through data exploration. I like how he exposed me to so many novel techniques (tally boxes, stem-and-leaf diagram, hinge diagram, and many more). Yes, they were were interesting to learn about, but they also showed me how creative you could get while exploring data. As he says in EDA, “don’t expect standard summaries to reveal the unusual.”I most appreciated Tukey’s tour of data transformations, and dedicated one of my own page spreads to re-introducing the “Ladder of Transformation” to a new generation. I consider EDA to be essential reading. Its length may be intimidating, but do not worry. Lots of the content is textbook exercises and examples, printed with multiple-page data tables for reference, all of which can be skipped as you plow through.

⭐This is a classic text book by the famous author, one of the FFT innovators, Tukey. Very fundamental skills how to grasp the numbers easily are shown step by step. Especially the explanations on the counting and plots reminds me, a professor of physics in a college, that the first step of these skills were given and repeatedly trained in a junior high school but are rarely taught nowadays.

⭐What can you say? Tukey is Tukey.

⭐I’d love to see this book updated to make it more user friendly. The formatting, layout and overcrowded pages make it difficult to follow. It’s poorly organized but has an amazing amount of material.

⭐Good price, fast delivery, the Great John Tukey, used book condition described accurately.

⭐Wow. I had assumed this was out of print, but here it is. This is simply a remarkable book about the basics of thinking numerically. Even if you do your stats on a computer I recommend this. I use it to show Arts students that quantitative analysis can be fun, powerful and simple.

⭐In the preface, Tukey writes, “this book … exists to expose its readers and users to a considerable variety of techniques for looking more effectively at one’s data.” It succeeds remarkably well and is still relevant: after over 40 years, many of the techniques still have not been incorporated in commercial software and some of those that have (such as box-and-whisker plots and robust smoothing) are often emasculated.This book has served me well for decades: I have used most of its techniques in my statistical consulting practice and, more recently, have used it as a foundation for courses in data analysis that range from a few hours to an entire semester. Students always appreciate the practical experience and set of tools they acquire. The more experienced ones comment on the insight: “I never fully understood what the box-and-whisker plot really did until now” is a recent example from a mid-career engineering professional.Nevertheless, it is true that some of the material is outmoded due to its focus on manual calculation and some of the rest may be too idiosyncratic for most. What remains–which is plenty–can be studied on its own, because this book is designed for self-study: most of the chapter groups are independent of all but the introductory material, they provide detailed examples, ask many thought-provoking questions, and supply many datasets for practice. Tukey’s methods speak for themselves through the gains in insight they provide, so he is content to show *how* to do them and to provide copious examples. What he does not do is supply the mathematical theory. If you like, you can read about that in Hoaglin, Mosteller, and Tukey’s “Understanding Robust and Exploratory Data Analysis”.The highlights of this book, in terms of techniques, are:* Chapters 1-4 on graphing data and on basic, useful data summaries: stem-and-leaf plots and n-letter summaries. Most statistical software now provides these. They are of primary interest as building blocks in more advanced analyses.* “Straightening out plots:” simple, effective techniques to re-express the independent and dependent variables in a bivariate scatterplot so that the relationship becomes approximately linear. (Chapters 5 and 6.) I am not aware of any software that does this, but the techniques are so simple and elegant you can still carry them out with pencil and paper (or a spreadsheet) even with huge datasets.* Smoothing sequences (chapters 7, 8, and 16). The “3rssh” method has largely been displaced by Lowess smooths in software, but the principles and ideas of smoothing, “roughing,” and “re-roughing” are timeless.* Analyses of two-way tables using median polish. This technique has recently been exploited in various fields, including spatial statistics, but only in the most elementary way. Think of this as a robust version of Analysis of Variance, with a focus on finding the effects, but without any of the mathematical apparatus. (Chapters 10 and 11.) One of the best tools for performing median polish is a spreadsheet.* Advanced fitting of two-way tables: adding an interaction term; transforming the dependent variable; plotting the fits. (Chapters 12 and 13.)* Techniques for re-expressing and analyzing counts and fractions. (Chapters 15, 17, and 18.)Additional material covers “delineations” of scatterplots (chapters 8, 9, and 14) and assessing distributions (chapters 19 and 20). The latter suffers in retrospect from not using probability plotting methods. The former is worth learning but there does not seem to be any simple way to use modern statistical software to create these delineations.In brief, this book requires no more mathematical prerequisite than facility with arithmetic, but after working through it, the diligent reader will come away with a body of techniques for understanding almost any kind of data set, including methods of time series analysis, regression, analysis of variance, and contingency table analysis.

⭐I reviewed this book when it first appeared in 1977 — having been reading preliminary versions for about a decade before that.At the time I offered the not-very-prescient opinion that it would quickly become a classic. It has.The various reviews on this site are all correct. Yes there are still wonderful ideas to be mined (easy univariate transformations to symmetry, transforms to linearity to aid in curve fitting, the crucial importance of robustness, of looking at outliers and fringeliers, and the dominant role that graphics plays in forcing us to see what we never expected).And yes, there is some material that is out-dated (e.g. using break points to estimate logs in your head).But these are beside the point — Scholars still study Talmud and Newton’s Principia. Why? Obviously there are many reasons varying with the work and the reader.For me a principal reason for reading (and rereading) EDA is to get a close look at how a first class mind works, with the hope that when faced with a similar problem we can then try to emulate him. To this day as I read EDA I can see Tukey’s smiling face patiently explaining to me how to look at data and exhorting me not to miss subtle hints.This was a book to be treasured when it first became available, and I see no reason for that judgment to change now, or in the future.

⭐Clássico inestimável para quem trabalha com dados.Bought as a gift. My other half advised a good quality book with lots of great information. Just what was wanted

Keywords

Free Download Exploratory Data Analysis 1st Edition in PDF format
Exploratory Data Analysis 1st Edition PDF Free Download
Download Exploratory Data Analysis 1st Edition 2010 PDF Free
Exploratory Data Analysis 1st Edition 2010 PDF Free Download
Download Exploratory Data Analysis 1st Edition PDF
Free Download Ebook Exploratory Data Analysis 1st Edition

Previous articleRepresentations of Integers as Sums of Squares by E. Grosswald (PDF)
Next articleA Tribute to Emil Grosswald: Number Theory and Related Analysis (Contemporary Mathematics) by Emil Grosswald (PDF)