Course No. 16 "TCGA Data Analysis"
The Cancer Genome Atlas Data Analysis
This course gives the
background information on how to access The Cancer Genome Atlas (TCGA) data and
provides an introduction to use of various online publicly available tools,
Topics to be covered:
§ Data types
§ Data levels
§ Access Tiers
§ Data download options
§ Mutation details and analysis
§ Obtaining significantly mutated genes and their functional/pathway analysis
§ OncoPrint compact visualization of genomic alterations
§ Information about gene copy number alternations from GISTIC
Visualization of network of mutated genes
TCGA Data Overview:
Click on the Projects tab at the top of the page. Note the primary sites, cancer abbreviations, and various data categories. Return to the Genomics Data Commons Data Portal, and then click on the Data tab link at the top of the page. Note different projects, access levels, data formats, data types and experiment types. Return to the Genomics Data Commons Data Portal again. Click on the Head and Neck link and then on the Cases link.
Click on Cases or Files. You can filter the results by options such as Gender, Age and Vital Status on the left side of the page. Some of this data is “open” and some is “controlled” access.
TCGA Data Analysis Tools:
Please note that some of the features have chaged since then.
Access the GDC home page. Click on the Analysis tab. Note links to the cBioPortal for Cancer Genomics and the Broad Firehose.
Click on the Browse link under the Analyses column for the Head and Neck squamous cell carcinoma. Click on the Mutation Analyses link. Click on the Mutation Analyses. Access the report by clicking on MutSig2.0. Note different kind of data analysis available. We will take a look at the significantly mutated genes.
Click on the + sign next to significanly mutated genes. List of all significanly mutated genes can be obtained by clicking on the Get Full Table link. To see details of the variations on the PIK3CA gene, click on the PIK3CA gene name. Note the muation hot spots around 540.
Go back to the main results page. Click on Geneset Analysis link. Access the full report by clicking on the Get Full Table link.
Select Head and Neck Squamous Cell Carcinoma (TCGA, Nature 2015) as the Cancer Study.
Scroll down and type in genes TP53, CDKN2A, PIK3CA and TRAF3 on separate lines in the “Enter Gene Set” block. Click on the Submit button. Mouse over the area above the picture on the right hand side and use the scroll bar to zoom out.
Note different Sort, Mutation Color, View and Download options.
Note the differences in the type of mutations by HPV status. Explore other tabs such as Mutual exclusivity, Plots, Mutations, Protein Changes, Network and finally the Download.
Plots: Select Clinical Attribute HPV Status on the horizonatl axis and Clinical Attribute Overall Survival (Months) on the Vertical Axis.
On the Mutations tab click on the PIK3CA gene. Note the mutation hot spots at 542 and 545.
Click on the Network tab to visualize network of genes. This page could be used further for protein and drug interactions.Finally, the Download tab can be used to download the data.
TCGA Program Brochure: http://cancergenome.nih.gov/pdfs/TCGA_Program_Brochure_2014
TCGA Data Portal Brochure: http://cancergenome.nih.gov/pdfs/TCGA_DataPortal_Brochure_2014
TCGA Data Flow: https://wiki.nci.nih.gov/display/TCGA/Introduction+to+TCGA
TCGA Data Classification: https://wiki.nci.nih.gov/display/TCGA/Data+Classification
Questions, Comments: Medha Bhagwat, PhD