This repository contains a single, end-to-end Python notebook that integrates The Cancer Genome Atlas (TCGA) Breast Cancer (BRCA) multi-omics profiles—gene expression (RNA-seq), DNA methylation (450K), and copy-number variation (CNV) — to discover unsupervised patient groups and interpret them biologically.
Projects goals:
Biological interpretation via:
| Figure |
|---|
![]() Figure 1: CNV of key genes by cluster |
![]() Figure 2: Integrated multi-omics embedding (colored by cluster) |
![]() Figure 3: Marker gene expression by multi-omics cluster |
![]() Figure 4: Overall survival by multi-omics cluster (TCGA BRCA) |
git clone https://github.com/Anwesha19-prog/TCGA-BRCA-Multi-Omics-Intergration
cd TCGA-BRCA-Multi-Omics-Intergration
python -m requirements.txt
jupyter notebook Project-3.ipynb