layout: true --- name: xaringan-title class: inverse, left, middle .pull-left[ # .center[Bad statistics in medical research] ## .center[Adrian Barnett, QUT] ### .center[September 2022] [
@aidybarnett](http://twitter.com/aidybarnett) [
@agbarnett](http://github.com/agbarnett) [
Median Watch](https://medianwatch.netlify.app) [
a.barnett@qut.edu.au](mailto:a.barnett@qut.edu.au) ] .pull-right[ <!--- ![](figures/quality.gif)---> <img src="figures/nature_cover_statistics.jpg" width="150%" style="display: block; margin: auto;" /> (Illustration by David Parkins) ] --- background-image: url(figures/AcknowledgementTraditionalOwners.jpg) background-size: cover --- <img src="figures/lancet_paper.jpg" width="63%" style="display: block; margin: auto;" /> <!--- * Incorrectly used logistic regression instead of survival analysis ---> * 813 hospitalised, 613 (75%) excluded because they were still hospitalised or not confirmed as infected -- ### Two implications: * Bad research is not restricted to "low impact" journals * Has been cited 25,000+ times: Many researchers do not recognise the serious statistical flaw --- <img src="figures/lancet_impact.jpg" width="69%" style="display: block; margin: auto;" /> <!--- https://twitter.com/TheLancet/status/1542088311702962177 ---> <!--- Emphasis on publishing in top-tier journals is a huge driver of the reproducibility crisis ---> --- <img src="figures/scrabble.jpg" width="52%" style="display: block; margin: auto;" /> From [NPR](https://www.npr.org/sections/thetwo-way/2015/07/21/424980378/winner-of-french-scrabble-title-does-not-speak-french) --- class: inverse <img src="figures/gauss.jfif" width="44%" style="display: block; margin: auto;" /> (Image from _Wikipedia_) --- # Continuous data and the normal assumption <img src="slides_files/figure-html/unnamed-chunk-7-1.png" style="display: block; margin: auto;" /> --- # Continuous data and the normal assumption <img src="slides_files/figure-html/unnamed-chunk-8-1.png" style="display: block; margin: auto;" /> --- class: center # P-values <img src="https://imgs.xkcd.com/comics/p_values.png" width="32%" style="display: block; margin: auto;" /> (From _xkcd_) --- <img src="figures/gelman_age.png" width="58%" style="display: block; margin: auto;" /> Hat-tip to [Andrew Gelman](https://statmodeling.stat.columbia.edu/) --- # Quiz question <!--- Add a menti with options ---> #### https://www.menti.com/5624oqzron #### Go to www.menti.com and use the code 3205 7080 The P value is the probability ... * that the null hypothesis is true * that the alternative hypothesis is true * that the study worked * that the results happened by chance * of obtaining the observed or more extreme results if the alternative hypothesis is true --- class:inverse ## The evidence-base is horribly biased <img src="figures/Z_plot.png" width="60%" style="display: block; margin: auto;" /> Z-values extracted from confidence intervals in _Medline_ between 1976 and 2019 --- ## Bad practice in Australia .pull-left[ In the past three years how often have you: * Not attempted to publish a valid "negative" or "neutral" study = 25% * Selected the statistical method that provided the desired result = 8% ... * ... Rises to 30% when asking about witnessing this approach in others From NHMRC [2019 Survey of research culture in Australian NHMRC-funded institutions](https://www.nhmrc.gov.au/file/15321/download?token=PivJ5dGE) ] .pull-right[ <!--- from https://www.craigmarker.com/file-drawer-problem/ ---> ![](figures/filedrawer1-1002x675.jpg) ] --- ## Abandon p-values and statistical significance <img src="figures/pval_journey1.jpg" width="50%" style="display: block; margin: auto;" /> --- ## Abandon p-values and statistical significance <img src="figures/pval_journey2.jpg" width="50%" style="display: block; margin: auto;" /> --- ## Abandon p-values and statistical significance <img src="figures/pval_journey3.jpg" width="50%" style="display: block; margin: auto;" /> --- ## Abandon p-values and statistical significance <img src="figures/pval_journey4.jpg" width="50%" style="display: block; margin: auto;" /> False positive probability = 9 / (9 + 12) = 43%. See [DOI: 10.1098/rsos.140216](https://royalsocietypublishing.org/doi/10.1098/rsos.140216) for details. --- class:inverse ## Abandon p-values and statistical significance .pull-left[ ## "Significance tests are popular with non-statisticians, who like to feel certainty where no certainty exists" (Yates and Healy 1964) DOI: [10.2307/2344003](https://shibbolethsp.jstor.org/start?entityID=https%3A%2F%2Fidp.qut.edu.au%2Fentity&dest=https://www.jstor.org/stable/2344003&site=jstor) ] .pull-right[ <img src="figures/Yates_Fisher_Cochran.jpg" width="60%" style="display: block; margin: auto;" /> Yates, Fisher and Cochran (Source: Barry Eagel, _Wikimedia Commons_) ] --- ## Most studies are under-powered <img src="figures/power_histogram.jpg" width="52%" style="display: block; margin: auto;" /> * Actual power estimated from 20,000 randomised controlled trials in the Cochrane database DOI: [10.1111/1740-9713.01587](https://rss.onlinelibrary.wiley.com/doi/full/10.1111/1740-9713.01587) --- <!--- registered reports ---> <img src="figures/pbio.3000246.g001.png" width="61%" style="display: block; margin: auto;" /> DOI: [https://doi.org/10.1371/journal.pbio.3000246.g001](https://journals.plos.org/plosbiology/article/figure?id=10.1371/journal.pbio.3000246.g001) --- class: inverse, center, middle ![](figures/type_errors.jpg) --- ## Stop developing new diagnostic or prognostic models <img src="figures/Wynants.jpg" width="47%" style="display: block; margin: auto;" /> DOI: [10.1136/bmj.m1328](https://www.bmj.com/content/369/bmj.m1328) <!--- Stop developing prediction models (hard), talk about standard text in justification, talk about stochastic variability, rule out tests might still be useful, e.g. Troponin. Appreciate that data changes. Most predictive value comes from a few variables.---> --- class: inverse <img src="figures/prediction_flow1.png" width="62%" style="display: block; margin: auto;" /> --- class: inverse <img src="figures/prediction_flow2.png" width="68%" style="display: block; margin: auto;" /> From [Maarten Van Smeden](https://twitter.com/MaartenvSmeden/status/1561276708304609281/) --- class:center, inverse, middle ## Medical research funders should no longer fund projects that plan to develop a prediction model unless a qualified statistician is a principal investigator --- class: top, center, inverse background-image: url(figures/podium.jpg) background-size: cover ### .left[Worst ever statistical methods section] -- ## .left[t-test] -- ## .right[SPSS] <!--- http://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?ACTRN=12617001415392 ---> -- ## .center[SSPS] <!--- https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=373697 ---> --- class: inverse .pull-left[ ![](https://media.giphy.com/media/IX2sn8cnqDyRq/giphy.gif) ] .pull-right[ ## "Many people think that all you need to do statistics is a computer and appropriate software." ## Doug Altman ] --- class: center # Figures <img src="https://imgs.xkcd.com/comics/scientific_paper_graph_quality.png" width="65%" style="display: block; margin: auto;" /> (From _xkcd_) --- class: center, middle ![](figures/awful_plot2.jfif) --- class: center, middle ![](figures/awful_plot1.jfif) --- class: center, middle <img src="figures/awful_plot3.jfif" width="98%" style="display: block; margin: auto;" /> --- class: center, middle <img src="figures/awful_plot4.jfif" width="88%" style="display: block; margin: auto;" /> --- ## Stop using pie charts <img src="figures/pie_charts.jfif" width="62%" style="display: block; margin: auto;" /> From https://twitter.com/MaxCRoser/status/857389434756505600 --- class:center, inverse, middle .pull-left[ # This is everybody's problem and nobody's problem ] .pull-right[ ### Grayrigg derailment ![](figures/VirginPendolinoDerailment.jpg) By Lawrence Clift - http://www.lawrencecliftphotography.co.uk/ ] <!--- pathetic amount spent on quality control ---> --- class: center, middle <img src="figures/nature_cover.jpg" width="90%" style="display: block; margin: auto;" /> <!--- journals too often promote publishing “mansions of straw rather than houses of brick.” Bill Kaelin ---> (Illustration by David Parkins) --- # Quality control .pull-left[ ### Randomly audit 100 papers per year from each university & research institute ### Statisticians check the analysis and see if they can reproduce the results <!--- ### Reviewers read the papers and assess originality, reproducibility and translation into practice ---> ] .pull-right[ ![](figures/quality_control.png) ] DOI: [10.1371/journal.pone.0195613](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0195613) --- class:inverse, center # Statistical robots are coming ![](https://media.giphy.com/media/pIMlKqgdZgvo4/giphy.gif) This is not AI <!--- given up trying to fix things myself ---> --- ### Yuhji Saitoh <img src="figures/saitoh_tables.png" width="75%" style="display: block; margin: auto;" /> --- class:center, middle <img src="figures/smyth.png" width="70%" style="display: block; margin: auto;" /> (from [ABC news](https://www.abc.net.au/news/2022-01-11/qld-cancer-researcher-mark-smyth-fabricated-data-review-finds/100750208)) --- <img src="figures/saitoh_simple.jpg" width="70%" style="display: block; margin: auto;" /> --- class: inverse # The continuing scandal of bad medical research .pull-left[ * 1994: Doug Altman wrote about ‘The scandal of poor medical research’ * 2015: Doug: Wish I'd said 'bad' * 2022: 1013 out of 1659 at high risk of bias. Estimated cost of bad trials was £726 million to £8 billion, DOI: [10.1186/s13063-022-06415-5](https://trialsjournal.biomedcentral.com/articles/10.1186/s13063-022-06415-5) ] .pull-right[ ![](figures/Doug_altman.jpeg) ##### By Oxford Clinical Trials Research Unit ]