Statistical Thinking: Introduction

Friday, January 13, 2017

Introduction

Statistics is a field that is a science unto itself and that benefits all other fields and everyday life. What is unique about statistics is its proven tools for decision making in the face of uncertainty, understanding sources of variation and bias, and most importantly, statistical thinking. Statistical thinking is a different way of thinking that is part detective, skeptical, and involves alternate takes on a problem. An excellent example of statistical thinking is statistician Abraham Wald's analysis of British bombers surviving to return to their base in World War II: his conclusion was to reinforce bombers in areas in which no damage was observed. For other great examples watch my colleague Chris Fonnesbeck's Statistical Thinking for Data Science.

Some of my personal philosophy of statistics can be summed up in the list below:

Statistics needs to be fully integrated into research; experimental design is all important
Don't be afraid of using modern methods
Preserve all the information in the data; Avoid categorizing continuous variables and predicted values at all costs
Don't assume that anything operates linearly
Account for model uncertainty and avoid it when possible by using subject matter knowledge
Use the bootstrap routinely
Make the sample size a random variable when possible
Use Bayesian methods whenever possible
Use excellent graphics, liberally
To be trustworthy research must be reproducible
All data manipulation and statistical analysis must be reproducible (one ramification being that I advise against the use of point and click software in most cases)

Statistics has multiple challenges today, which I break down into three major sources:

Statistics has been and continues to be taught in a traditional way, leading to statisticians believing that our historical approach to estimation, prediction, and inference was good enough.
Statisticians do not receive sufficient training in computer science and computational methods, too often leaving those areas to others who get so good at dealing with vast quantities of data that they assume they can be self-sufficient in statistical analysis and not seek involvement of statisticians. Many persons who analyze data do not have sufficient training in statistics.
Subject matter experts (e.g., clinical researchers and epidemiologists) try to avoid statistical complexity by "dumbing down" the problem using dichotomization, and statisticians, always trying to be helpful, fail to argue the case that dichotomization of continuous or ordinal variables is almost never an appropriate way to view or analyze data. Statisticians in general do not sufficiently involve themselves in measurement issues.

I will be discussing several of the issues in future blogs, especially item 1 above and items 2 and 4 below.

Complacency in the field of statistics and in statistical education has resulted in

reliance on large-sample theory so that inaccurate normal distribution-based tools can be used, as opposed to tailoring the analyses to data characteristics using the bootstrap and semiparametric models
belief that null hypothesis significance testing ever answered the scientific question and the p-values are useful
avoidance of the likelihood school of inference (relative likelihood, likelihood support intervals, likelihood ratios, etc.)
avoidance of Bayesian methods (posterior distributions, credible intervals, predictive distributions, etc.)

I was interviewed by Kevin Gray in July, 2017 where more of my opinions about statistics may be found.

5 comments:

Fr.January 16, 2017 at 12:28 AM
I'm looking forward to reading more on your blog about the many points that you make above, and perhaps especially about the point to do with how statisticians should be engaging more in measurement.
ReplyDelete
Replies
UnknownJanuary 22, 2017 at 11:10 AM
Well. I am interested. I am an epidemiologist but like A LOT statistics. I will like to learn from you as well....
ReplyDelete
Replies
UnknownFebruary 3, 2017 at 1:05 AM
thanx for good articles, since i was studied all by my self with youtube, several books, and MOOC , I have a lot of questions and lack of understanding context, these articles may big help for me.
ReplyDelete
Replies
UnknownMarch 17, 2017 at 5:53 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
venkatMay 22, 2017 at 10:31 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies