Highlights
- •In the “big five”.
- •Reporting about variable selection methods is insufficient.
- •Data-driven methods are not commonly used in causal explanatory models.
- •The addition of an adjustment variable is common in sensitivity analyses.
Abstract
Keywords
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of Clinical EpidemiologyReferences
- Causal diagrams for epidemiologic research. Epidemiol Camb Mass. 10. Jan 1999: 37-48
- On multiple regression analysis.Stat Neerlandica. Mar 1962; 16: 31-56
- Regression Shrinkage and Selection Via the Lasso.J R Stat Soc Ser B Methodol. Jan 1996; 58: 267-288
- Regularization and variable selection via the elastic net.J R Stat Soc Ser B Stat Methodol. Apr 2005; 67: 301-320
- Augmented Backwasrd Elimination: A Pragmatic and Purposeful Way to Develop Statistical Models.PLoS ONE. 2014 Nov 21; 9e113677
- A Review on Variable Selection in Regression Analysis.Econometrics. 2018 Nov 23; 6: 45
- Covariate selection strategies for causal inference: Classification and comparison.Biom J Biom Z. Sep 2019; 61: 1270-1289
- Variable selection - A review and recommendations for the practicing statistician.Biom J Biom Z. May 2018; 60: 431-449
- Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis.Second edition. Springer, Cham Heidelberg New York2015: 582 (Springer series in statistics)
- Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): explanation and elaboration.Ann Intern Med. 2007 Oct 16; 147: W163-W194
- Using the STROBE statement: survey findings emphasized the role of journals in enforcing reporting guidelines.J Clin Epidemiol. Dec 2019; 116: 26-35
- Variable selection: current practice in epidemiological studies.Eur J Epidemiol. 2009; 24: 733-736
- A descriptive review of variable selection methods in four epidemiologic journals: there is still room for improvement.Eur J Epidemiol. Aug 2019; 34: 725-730
- 2016 Journal Impact Factor, Journal Citation Reports.Clarivate Analytics, 2020
- State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues.Diagn Progn Res. Dec 2020; 4 (s41512-020-00074–3): 3
- Purposeful selection of variables in logistic regression.Source Code Biol Med. Dec 16 2008; 3: 17
- High-dimensional propensity score adjustment in studies of treatment effects using health care claims data.Epidemiol Camb Mass. Jul 2009; 20: 512-522
- Deletion/substitution/addition algorithm in learning with applications in genomics.Stat Appl Genet Mol Biol. 2004; 3: Article18
- Classification and regression trees.WIREs Data Min Knowl Discov. Jan 2011; 1: 14-23
- Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond.Stat Med. Jan 30 2008; 27 (discussion 207-212): 157-172
- Kernel Regularized Least Squares: Reducing Misspecification Bias with a Flexible and Interpretable Machine Learning Approach.Polit Anal. 2014; 22: 143-168
- Consumption of ultra-processed foods and cancer risk: results from NutriNet-Santé prospective cohort.BMJ. Feb 14 2018; : k322
- Associations of Dietary Cholesterol or Egg Consumption With Incident Cardiovascular Disease and Mortality.JAMA. Mar 19 2019; 321: 1081
- Risk of serious infections associated with use of immunosuppressive agents in pregnant women with autoimmune inflammatory conditions: cohort study.BMJ. Mar 6 2017; : j895
- Lifestyle in progression from hypertensive disorders of pregnancy to chronic hypertension in Nurses’ Health Study II: observational cohort study.BMJ. Jul 12 2017; : j3024
- Maternal thyroid function and child educational attainment: prospective cohort study.BMJ. Feb 20 2018; : k452
- Association of early postnatal transfer and birth outside a tertiary hospital with mortality and severe brain injury in extremely preterm infants: observational cohort study with propensity score matching.BMJ. 2019; 367: l5678
- Association Between Use of Antithrombotic Medication and Hematuria-Related Complications.JAMA. 2017; 318: 1260-1271
- Thyroid replacement therapy, thyroid stimulating hormone concentrations, and long term health outcomes in patients with hypothyroidism: longitudinal study.BMJ. Sep 3 2019; : l4892
- Dipeptidyl peptidase-4 inhibitors and incidence of inflammatory bowel disease among patients with type 2 diabetes: population based cohort study.BMJ. Mar 21 2018; : k872
- Revisiting the association of blood pressure with mortality in oldest old people in China: community based, longitudinal prospective study.BMJ. Jun 5 2018; (k2158)
- Covariate selection with group lasso and doubly robust estimation of causal effects: GLiDeR.Biometrics. Mar 2018; 74: 8-17
- Outcome-adaptive lasso: Variable selection for causal inference.Biometrics. Dec 2017; 73: 1111-1122
- A survey of variable selection methods in two Chinese epidemiology journals.BMC Med Res Methodol. Dec 2010; 10: 87
Article info
Publication history
Footnotes
Author statement: Thibaut Pressat-Laffouilhère: Conceptualization, Data curation, Formal analysis, Methodology, Writing original draft, Writing – review & editing. Romain Jouffroy: Methodology, Validation, Data Curation, Writing – review & editing. Adrien Leguillou: Methodology, Validation, Data Curation, Writing – review & editing. André Gillibert: Conceptualization, Methodology, Writing – review & editing. Gaetan Kerdelhue: Conceptualization, Methodology, Writing – review & editing. Jacques Bénichou: Conceptualization, Methodology, Writing – review & editing.
Conflict of interest: None.