Methrix: an R/bioconductor package for systematic aggregation and analysis of bisulfite sequencing data

Whole-genome bisulfite sequencing (WGBS) measures DNA methylation at base pair resolution resulting in large bedGraph like coverage files. Current options for processing such files are hindered by discrepancies in file format specification, speed, and memory requirements.We developed methrix, an R p...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mayakonda Thippeswamy, Anand (VerfasserIn) , Schönung, Maximilian (VerfasserIn) , Hey, Joschka (VerfasserIn) , Batra, Rajbir (VerfasserIn) , Feuerstein-Akgöz, Clarissa (VerfasserIn) , Köhler, Kristin (VerfasserIn) , Lipka, Daniel (VerfasserIn) , Sotillo, Rocio (VerfasserIn) , Plass, Christoph (VerfasserIn) , Lutsik, Pavlo (VerfasserIn) , Toth, Reka (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 2020
In: Bioinformatics
Year: 2020, Jahrgang: 36, Heft: 22/23, Pages: 5524-5525
ISSN:1367-4811
DOI:10.1093/bioinformatics/btaa1048
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1093/bioinformatics/btaa1048
Verlag, lizenzpflichtig, Volltext: https://academic.oup.com/bioinformatics/article/36/22-23/5524/6042753
Volltext
Verfasserangaben:Anand Mayakonda, Maximilian Schönung, Joschka Hey, Rajbir Nath Batra, Clarissa Feuerstein-Akgoz, Kristin Köhler, Daniel B Lipka, Rocio Sotillo, Christoph Plass, Pavlo Lutsik and Reka Toth
Beschreibung
Zusammenfassung:Whole-genome bisulfite sequencing (WGBS) measures DNA methylation at base pair resolution resulting in large bedGraph like coverage files. Current options for processing such files are hindered by discrepancies in file format specification, speed, and memory requirements.We developed methrix, an R package, which provides a toolset for systematic analysis of large datasets. Core functionality of the package includes a comprehensive bedGraph or similar tab-separated text file reader—which summarizes methylation calls based on annotated reference indices, infers and collapses strands and handles uncovered reference CpG sites while facilitating a flexible input file format specification. Additional optimized functions for quality control filtering, subsetting and visualization allow user-friendly and effective processing of WGBS results. Easy integration with tools for differentially methylated region (DMR) calling and annotation further eases the analysis of genome-wide methylation data. Overall, methrix enriches established WGBS workflows by bringing together computational efficiency and versatile functionality.Methrix is implemented as an R package, made available under MIT license at https://github.com/CompEpigen/methrix and can be installed from the Bioconductor repository.Supplementary data are available at Bioinformatics online.
Beschreibung:Advance access publication date: 25 December 2020
Gesehen am 13.09.2021
Beschreibung:Online Resource
ISSN:1367-4811
DOI:10.1093/bioinformatics/btaa1048