vignettes/EpiTxDb-creation.Rmd
EpiTxDb-creation.Rmd
To create an EpiTxDb
object a number of different
functions are available. The most univeral functions are
makeEpiTxDb
and makeEpiTxDbFromGRanges
.
makeEpiTxDb
uses four data.frame
s as input,
whereas makeEpiTxDbFromGRanges
is a wrapper for information
available as a GRanges
object.
The other functions are makeEpiTxDbFromRMBase
and
makeEpiTxDbFromtRNAdb
, which are aimed to make data
available from the RMBase v2.0 database (Xuan et al. 2017; Sun et al. 2015) or the
tRNAdb (Jühling et al.
2009; Sprinzl and Vassilenko 2005). However, before creating
your EpiTxDb
objects, have a look at the already available
resources for H. sapiens
, M. musculus
and
S. cerevisiae.
Additional metadata can be provided as separate
data.frame
for all functions. The data.frame
must have two columns name
and value
.
## Warning: replacing previous import 'GenomicFeatures::makeTxDbPackage' by
## 'txdbmaker::makeTxDbPackage' when loading 'EpiTxDb'
## Warning: replacing previous import
## 'GenomicFeatures::supportedUCSCFeatureDbTables' by
## 'txdbmaker::supportedUCSCFeatureDbTables' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDbFromGFF' by
## 'txdbmaker::makeTxDbFromGFF' when loading 'EpiTxDb'
## Warning: replacing previous import
## 'GenomicFeatures::makeTxDbPackageFromBiomart' by
## 'txdbmaker::makeTxDbPackageFromBiomart' when loading 'EpiTxDb'
## Warning: replacing previous import
## 'GenomicFeatures::supportedMiRBaseBuildValues' by
## 'txdbmaker::supportedMiRBaseBuildValues' when loading 'EpiTxDb'
## Warning: replacing previous import
## 'GenomicFeatures::supportedUCSCFeatureDbTracks' by
## 'txdbmaker::supportedUCSCFeatureDbTracks' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::browseUCSCtrack' by
## 'txdbmaker::browseUCSCtrack' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::getChromInfoFromBiomart'
## by 'txdbmaker::getChromInfoFromBiomart' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDbFromGRanges' by
## 'txdbmaker::makeTxDbFromGRanges' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::supportedUCSCtables' by
## 'txdbmaker::supportedUCSCtables' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDbFromBiomart' by
## 'txdbmaker::makeTxDbFromBiomart' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::UCSCFeatureDbTableSchema'
## by 'txdbmaker::UCSCFeatureDbTableSchema' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDbFromEnsembl' by
## 'txdbmaker::makeTxDbFromEnsembl' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDb' by
## 'txdbmaker::makeTxDb' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeFDbPackageFromUCSC' by
## 'txdbmaker::makeFDbPackageFromUCSC' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDbFromUCSC' by
## 'txdbmaker::makeTxDbFromUCSC' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeTxDbPackageFromUCSC'
## by 'txdbmaker::makeTxDbPackageFromUCSC' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makePackageName' by
## 'txdbmaker::makePackageName' when loading 'EpiTxDb'
## Warning: replacing previous import 'GenomicFeatures::makeFeatureDbFromUCSC' by
## 'txdbmaker::makeFeatureDbFromUCSC' when loading 'EpiTxDb'
makeEpiTxDb
and
makeEpiTxDbFromGRanges
The creation of an etdb object is quite easy starting with a
GRanges
object.
gr <- GRanges(seqnames = "test",
ranges = IRanges::IRanges(1,1),
strand = "+",
DataFrame(mod_id = 1L,
mod_type = "Am",
mod_name = "Am_1"))
etdb <- makeEpiTxDbFromGRanges(gr, metadata = data.frame(name = "test",
value = "Yes"))
## Creating EpiTxDb object ... done
etdb
## EpiTxDb object:
## # Db type: EpiTxDb
## # Supporting package: EpiTxDb
## # test: Yes
## # Nb of modifications: 1
## # Db created by: EpiTxDb package from Bioconductor
## # Creation time: 2024-03-26 22:07:01 +0000 (Tue, 26 Mar 2024)
## # EpiTxDb version at creation time: 1.15.3
## # RSQLite version at creation time: 2.3.5
## # DBSCHEMAVERSION: 1.0
metadata(etdb)
## name value
## 1 Db type EpiTxDb
## 2 Supporting package EpiTxDb
## 3 test Yes
## 4 Nb of modifications 1
## 5 Db created by EpiTxDb package from Bioconductor
## 6 Creation time 2024-03-26 22:07:01 +0000 (Tue, 26 Mar 2024)
## 7 EpiTxDb version at creation time 1.15.3
## 8 RSQLite version at creation time 2.3.5
## 9 DBSCHEMAVERSION 1.0
Additional data can be provided via the metadata columns of the
GRanges
object. For supported columns have a look at
?makeEpiTxDb
or ?makeEpiTxDbFromGRanges
.
makeEpiTxDbFromtRNAdb
The information of the tRNAdb can be accessed via the
tRNAdbImport
package using the RNA database. As a result a
ModRNAStringSet
object is returned from which the
modifications can be extracted using separate()
.
The only input require is a valid organism name returned by
listAvailableOrganismsFromtRNAdb()
.
# Currently not run since the server is not available
etdb <- makeEpiTxDbFromtRNAdb("Saccharomyces cerevisiae")
etdb
For additional information have a look at
?makeEpiTxDbFromtRNAdb
. The result returned from the tRNAdb
is also available as GRanges
object, if
gettRNAdbDataAsGRanges()
is used.
makeEpiTxDbFromRMBase
Analogous to the example above makeEpiTxDbFromRMBase()
will download the data from the RMBase v2.0. Three inputs are required,
organism
, genome
and modtype
,
which have to valid bia the functions
listAvailableOrganismsFromRMBase()
,
.listAvailableGenomesFromRMBase()
and
listAvailableModFromRMBase
.
etdb <- makeEpiTxDbFromRMBase(organism = "yeast",
genome = "sacCer3",
modtype = "m1A")
Internally, the files are cached using the BiocFileCache
package and passed to makeEpiTxDbFromRMBaseFiles()
, which
can also be used with locally stored files. The resuls for creating the
EpiTxDb
class are processed from these files via the
getRMBaseDataAsGRanges()
function.
## R Under development (unstable) (2024-03-24 r86185)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 22.04.4 LTS
##
## Matrix products: default
## BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
## LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.20.so; LAPACK version 3.10.0
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## time zone: UTC
## tzcode source: system (glibc)
##
## attached base packages:
## [1] stats4 stats graphics grDevices utils datasets methods
## [8] base
##
## other attached packages:
## [1] EpiTxDb_1.15.3 Modstrings_1.19.0 Biostrings_2.71.5
## [4] XVector_0.43.1 AnnotationDbi_1.65.2 Biobase_2.63.0
## [7] GenomicRanges_1.55.4 GenomeInfoDb_1.39.9 IRanges_2.37.1
## [10] S4Vectors_0.41.5 BiocGenerics_0.49.1 BiocStyle_2.31.0
##
## loaded via a namespace (and not attached):
## [1] DBI_1.2.2 bitops_1.0-7
## [3] httr2_1.0.0 tRNAdbImport_1.21.1
## [5] biomaRt_2.59.1 rlang_1.1.3
## [7] magrittr_2.0.3 matrixStats_1.2.0
## [9] compiler_4.4.0 RSQLite_2.3.5
## [11] GenomicFeatures_1.55.4 png_0.1-8
## [13] systemfonts_1.0.6 vctrs_0.6.5
## [15] txdbmaker_0.99.7 stringr_1.5.1
## [17] pkgconfig_2.0.3 crayon_1.5.2
## [19] fastmap_1.1.1 dbplyr_2.5.0
## [21] utf8_1.2.4 Rsamtools_2.19.4
## [23] rmarkdown_2.26 ragg_1.3.0
## [25] purrr_1.0.2 bit_4.0.5
## [27] xfun_0.43 zlibbioc_1.49.3
## [29] cachem_1.0.8 Structstrings_1.19.1
## [31] jsonlite_1.8.8 progress_1.2.3
## [33] blob_1.2.4 DelayedArray_0.29.9
## [35] BiocParallel_1.37.1 parallel_4.4.0
## [37] prettyunits_1.2.0 R6_2.5.1
## [39] bslib_0.6.2 stringi_1.8.3
## [41] rtracklayer_1.63.1 jquerylib_0.1.4
## [43] bookdown_0.38 SummarizedExperiment_1.33.3
## [45] knitr_1.45 Matrix_1.7-0
## [47] tidyselect_1.2.1 abind_1.4-5
## [49] yaml_2.3.8 codetools_0.2-19
## [51] curl_5.2.1 tRNA_1.21.2
## [53] lattice_0.22-6 tibble_3.2.1
## [55] KEGGREST_1.43.0 evaluate_0.23
## [57] desc_1.4.3 BiocFileCache_2.11.1
## [59] xml2_1.3.6 pillar_1.9.0
## [61] BiocManager_1.30.22 filelock_1.0.3
## [63] MatrixGenerics_1.15.0 rex_1.2.1
## [65] generics_0.1.3 RCurl_1.98-1.14
## [67] hms_1.1.3 ggplot2_3.5.0
## [69] munsell_0.5.0 scales_1.3.0
## [71] glue_1.7.0 tools_4.4.0
## [73] BiocIO_1.13.0 GenomicAlignments_1.39.4
## [75] fs_1.6.3 XML_3.99-0.16.1
## [77] grid_4.4.0 colorspace_2.1-0
## [79] GenomeInfoDbData_1.2.11 restfulr_0.0.15
## [81] cli_3.6.2 rappdirs_0.3.3
## [83] textshaping_0.3.7 fansi_1.0.6
## [85] S4Arrays_1.3.6 dplyr_1.1.4
## [87] gtable_0.3.4 sass_0.4.9
## [89] digest_0.6.35 SparseArray_1.3.4
## [91] rjson_0.2.21 memoise_2.0.1
## [93] htmltools_0.5.8 pkgdown_2.0.7
## [95] lifecycle_1.0.4 httr_1.4.7
## [97] bit64_4.0.5