Feature selection in clinical proteomics: with great power comes great reproducibility

Wei Wang, Andrew C.H. Sue, Wilson W.B. Goh*

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

29 Citations (Scopus)

Abstract

In clinical proteomics, reproducible feature selection is unattainable given the standard statistical hypothesis-testing framework. This leads to irreproducible signatures with no diagnostic power. Instability stems from high P-value variability (p_var), which is inevitable and insolvable. The impact of p_var can be reduced via power increment, for example increasing sample size and measurement accuracy. However, these are not realistic solutions in practice. Instead, workarounds using existing data such as signal boosting transformation techniques and network-based statistical testing is more practical. Furthermore, it is useful to consider other metrics alongside P-values including confidence intervals, effect sizes and cross-validation accuracies to make informed inferences.

Original languageEnglish
Pages (from-to)912-918
Number of pages7
JournalDrug Discovery Today
Volume22
Issue number6
DOIs
Publication statusPublished - Jun 2017
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2016 Elsevier Ltd

ASJC Scopus Subject Areas

  • Pharmacology
  • Drug Discovery

Fingerprint

Dive into the research topics of 'Feature selection in clinical proteomics: with great power comes great reproducibility'. Together they form a unique fingerprint.

Cite this