Methrix: an R/bioconductor package for systematic aggregation and analysis of bisulfite sequencing data

Whole-genome bisulfite sequencing (WGBS) measures DNA methylation at base pair resolution resulting in large bedGraph like coverage files. Current options for processing such files are hindered by discrepancies in file format specification, speed, and memory requirements.We developed methrix, an R p...

Full description

Saved in:
Bibliographic Details
Main Authors: Mayakonda Thippeswamy, Anand (Author) , Schönung, Maximilian (Author) , Hey, Joschka (Author) , Batra, Rajbir (Author) , Feuerstein-Akgöz, Clarissa (Author) , Köhler, Kristin (Author) , Lipka, Daniel (Author) , Sotillo, Rocio (Author) , Plass, Christoph (Author) , Lutsik, Pavlo (Author) , Toth, Reka (Author)
Format: Article (Journal)
Language:English
Published: 2020
In: Bioinformatics
Year: 2020, Volume: 36, Issue: 22/23, Pages: 5524-5525
ISSN:1367-4811
DOI:10.1093/bioinformatics/btaa1048
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1093/bioinformatics/btaa1048
Verlag, lizenzpflichtig, Volltext: https://academic.oup.com/bioinformatics/article/36/22-23/5524/6042753
Get full text
Author Notes:Anand Mayakonda, Maximilian Schönung, Joschka Hey, Rajbir Nath Batra, Clarissa Feuerstein-Akgoz, Kristin Köhler, Daniel B Lipka, Rocio Sotillo, Christoph Plass, Pavlo Lutsik and Reka Toth
Description
Summary:Whole-genome bisulfite sequencing (WGBS) measures DNA methylation at base pair resolution resulting in large bedGraph like coverage files. Current options for processing such files are hindered by discrepancies in file format specification, speed, and memory requirements.We developed methrix, an R package, which provides a toolset for systematic analysis of large datasets. Core functionality of the package includes a comprehensive bedGraph or similar tab-separated text file reader—which summarizes methylation calls based on annotated reference indices, infers and collapses strands and handles uncovered reference CpG sites while facilitating a flexible input file format specification. Additional optimized functions for quality control filtering, subsetting and visualization allow user-friendly and effective processing of WGBS results. Easy integration with tools for differentially methylated region (DMR) calling and annotation further eases the analysis of genome-wide methylation data. Overall, methrix enriches established WGBS workflows by bringing together computational efficiency and versatile functionality.Methrix is implemented as an R package, made available under MIT license at https://github.com/CompEpigen/methrix and can be installed from the Bioconductor repository.Supplementary data are available at Bioinformatics online.
Item Description:Advance access publication date: 25 December 2020
Gesehen am 13.09.2021
Physical Description:Online Resource
ISSN:1367-4811
DOI:10.1093/bioinformatics/btaa1048