Data pre-processing for analyzing microbiome data – A mini review

Ruwen Zhou, Siu Kin Ng, Joseph Jao Yiu Sung, Wilson Wen Bin Goh*, Sunny Hei Wong*

*Corresponding author for this work

Research output: Contribution to journalShort surveypeer-review

11 Citations (Scopus)

Abstract

The human microbiome is an emerging research frontier due to its profound impacts on health. High-throughput microbiome sequencing enables studying microbial communities but suffers from analytical challenges. In particular, the lack of dedicated preprocessing methods to improve data quality impedes effective minimization of biases prior to downstream analysis. This review aims to address this gap by providing a comprehensive overview of preprocessing techniques relevant to microbiome research. We outline a typical workflow for microbiome data analysis. Preprocessing methods discussed include quality filtering, batch effect correction, imputation of missing values, normalization, and data transformation. We highlight strengths and limitations of each technique to serve as a practical guide for researchers and identify areas needing further methodological development. Establishing robust, standardized preprocessing will be essential for drawing valid biological conclusions from microbiome studies.

Original languageEnglish
Pages (from-to)4804-4815
Number of pages12
JournalComputational and Structural Biotechnology Journal
Volume21
DOIs
Publication statusPublished - Jan 2023
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2023 The Authors

ASJC Scopus Subject Areas

  • Biotechnology
  • Biophysics
  • Structural Biology
  • Biochemistry
  • Genetics
  • Computer Science Applications

Keywords

  • 16S rRNA Sequencing
  • Batch Effect
  • Data Preprocessing
  • Microbiome Data
  • Normalization

Fingerprint

Dive into the research topics of 'Data pre-processing for analyzing microbiome data – A mini review'. Together they form a unique fingerprint.

Cite this