Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier

Li Rong Wang*, Xiuyi Fan, Wilson Wen Bin Goh*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Functional doppelgängers (FDs) are independently derived sample pairs that confound machine learning model (ML) performance when assorted across training and validation sets. Here, we detail the use of doppelgangerIdentifier (DI), providing software installation, data preparation, doppelgänger identification, and functional testing steps. We demonstrate examples with biomedical gene expression data. We also provide guidelines for the selection of user-defined function arguments. For complete details on the use and execution of this protocol, please refer to Wang et al. (2022).

Original languageEnglish
Article number101783
JournalSTAR Protocols
Volume3
Issue number4
DOIs
Publication statusPublished - Dec 16 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022 The Author(s)

ASJC Scopus Subject Areas

  • General Neuroscience
  • General Biochemistry,Genetics and Molecular Biology
  • General Immunology and Microbiology

Keywords

  • Bioinformatics
  • Computer sciences

Fingerprint

Dive into the research topics of 'Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier'. Together they form a unique fingerprint.

Cite this