OpenGDC DATA REPOSITORY

We are maintaining an open-access ftp repository with all the public genomic, clinical, and biospecimen data of DNA-Seq, RNA-Seq, DNA-Methylation, miRNA-Seq and CNV from The Genomic Data Commons (GDC) converted in BED format.
Each data set will be automatically updated when a new version of the same data set will be available on the GDC database.
Currently GDC contains genomic, clinical, and biospecimen data about a large-scale NCI program: TCGA .
The following table shows some statistics about our public data sets.





OpenGDC is a Java software tool that allows searching and retrieving all public genomic, clinical, and biospecimen data of DNA-Seq, RNA-Seq, DNA-Methylation, miRNA-Seq and CNV from one of the largest public repositories of cancer genomic data, The Genomic Data Commons (GDC), and transforming them in BED format, which also allows comprehensively querying them with the GenoMetric Query Language (GMQL). A user-friendly interface is available to search, download, and convert all the public GDC cancer related data sets.


Data Downloader


Data Converter