Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Step-by-step tutorials, videos, and other materials to get you started.
Recording of tutorial given for NCI in May 2021. Includes closed captioning.
https://cbiit.webex.com/cbiit/lsr.php?RCID=acf4ea46dd9e41338662f0ba1ac59754
How do I ... guides are useful if you have a specific question.
Workshops are a great way to teach a group of people how to use Xena. They can be 1-hour, 1/2-day, or 1-day in length. Currently we are only giving workshops remotely via Zoom or a similar technology. We give workshops both within the USA and internationally. Please contact us for more information: genome-cancer@soe.ucsc.edu
Learn how to view whole chromosomes and view advanced datasets such as exon expression
This tutorial is made for those who have basic knowledge of how to use Xena. We will cover how to view whole chromosome and how to use the advanced dataset menu to access datasets such as exon expression.
This tutorial assumes basic knowledge of how to build and read a Visual Spreadsheet. To get this, go through Basic Tutorial: Section 1.
10 min
Create a visual spreadsheet that with a chromosome-wide column and data from the advanced dataset menu.
We will look at the ERG-TMPRSS2 gene fusion in patients from the TCGA Prostate Cancer study.
ERG is an oncogene that expressed at low levels in normal prostate tissue. Some patient's prostate cancer samples have higher expression of ERG. These samples tend to have an intra-chromosomal deletion that fuses ERG to TMPRSS2. TMPRSS2 is expressed at high levels in normal prostate tissue. This allows ERG to use the TMPRSS2 promoter to increase ERG expression.
Note that column D may look slightly different, depending on how you resize and zoom the column.
We can now see that there are many patient's samples with relatively high expression of ERG (column B). This relatively high expression is not uniform across the exons of ERG, but instead is in the exons closer to the 3' end of the gene (column C). Looking at column D, we can see that these samples also have an intra-chromosomal deletion of part of chromosome 21. If we hover over the genes at either end of the deletion, we can see that the end points fall within ERG and TMPRSS2.
Start at https://xenabrowser.net/
Type 'TCGA Prostate Cancer (PRAD)', select this study from the drop down menu, and click 'To first variable'.
Type 'ERG', select the checkbox for Gene Expression and click 'To second variable'.
Type 'ERG', click 'Show Advanced', select the checkbox for 'IlluminaHiSeq' under 'exon expression RNAseq', and click 'Done'.
Click the text 'Click to insert a column' after column C. Type 'chr21', select the checkbox for Copy Number and click 'Done'.
Click on the filter menu and select 'Remove samples with nulls'
Click on the handle in the lower right corner of column E, copy number for chromosome 21. Move it to the right to make the column bigger.
Click and drag within column E, copy number for chromosome 21 to zoom into the intra-chromosomal deletion.
More information:
Add copy number data for chromosome 1.
Add DNA Methylation data for ERG.
Learn to create your first views in Xena
This tutorial is made for those who have never used Xena. We will cover how to create a Visual Spreadsheet with gene expression, mutation, and copy number variation data.
This tutorial assumes basic knowledge of
gene expression, copy number variation, and mutational genomic sequencing data
how a change in copy number variation or mutations can lead to a change in gene expression
The Cancer Genome Atlas (TCGA)
These resources can help you gain basic knowledge of these concepts:
Part A: 5 min
Part B: 10 min
Part A
Create a Visual Spreadsheet
Compare data across columns
Part B
Move columns
Resize columns
Zoom in and out
We are going to look at EGFR aberrations in patients with lung adenocarcinomas using TCGA data. We will be looking at mutations and copy number aberrations and how they change gene expression.
Our goal is to build a Visual Spreadsheet and understand the relationship between the columns of data.
Type 'GDC TCGA Lung Adenocarcinoma (LUAD)', select this study from the drop down menu, and click 'To first variable'.
Type 'EGFR', select the checkboxes for Gene Expression, Copy Number, and Somatic Mutation, and click 'To second variable'.
How to read a Visual Spreadsheet
Samples are on the y-axis and your columns of data are on the x-axis. We line up columns so that each row is the same sample, allowing you to easily see trends in the data. Data is always sorted left to right and sub-sorted on columns thereafter.
We can see that samples from TCGA patients that have high expression of EGFR (red, column B) tend to either have amplifications of EGFR (red, column C) or mutations in EGFR (blue tick marks, column D).
More information
Making your own Visual Spreadsheet: Which TCGA study to choose
We will now move the columns to change the sort order and resize columns. We will zoom in to the whole Visual Spreadsheet and also within a column.
Move columns. Click column C, copy number variation, and drag it to the left so that it becomes the first column after the samples column (i.e. column B). Note that the samples are now sorted by the values in this column.
Resize columns. Click the handle in the lower right corner of column D, mutation. Move it to the right to make the column bigger.
Zoom in on a column. Click and drag within column D. Release to zoom.
Zoom out on a column. Click the red zoom out text at the top of column D.
Zoom in on samples. Click and drag vertically in any column in the Visual Spreadsheet to zoom in on these samples.
Zoom out on samples. To zoom out click either 'Zoom out' or 'Clear zoom' at the top of the Visual Spreadsheet.
More information
Create a Visual Spreadsheet looking at TP53 gene expression and mutation in samples from patients in the GDC TCGA Lower Grade Glioma study.
Change the Visual Spreadsheet from Question 1 so that the patient's samples are sorted by mutations rather than gene expression.
Learn how to remove samples with no data, subgroup samples, and make Kaplan Meier plots
This tutorial is made for those who have never used Xena but who have completed Section 1 of the Basic Tutorial. We will cover how to filter to just the samples you are interested in, how to create subgroups, and how to run a Kaplan Meier survival analysis.
This tutorial assumes completion of the . This tutorial begins where the Basic Tutorial: Section 1 ends.
Part A: 7 min
Part B: 15 min
Part C: 5 min
Part A
Search for samples of interest
Remove samples with no data
Part B
Make subgroups
Rename subgroups
Part C
Run a Kaplan Meier survival analysis
Use a custom time endpoint
In the Basic Tutorial Section 1 we found that we found that samples from patients that have aberrations in EGFR have relatively higher expression. These aberrations could be mutations or copy number amplifications.
Now we are going to look at whether those patient with aberrations in their samples also have a worse survival prognosis.
Our goal is to remove patient's samples with no data (i.e. null) from the view. This will make the view look cleaner and remove irrelevant samples from our Kaplan Meier survival analysis.
Type 'null' into the samples search bar. This will highlight samples that have 'null' values in any column on the screen. Null means that there is no data for that sample for that column.
Click the filter menu and select 'Remove samples'.
Delete the search term.
More information
Instead of typing 'null' and removing those samples from the view, you can also use the 'Remove samples with nulls' shortcut in the filter menu.
Our goal is to create two subgroups, those patient's with samples with aberrations in EGFR and those patient's samples without aberrations in EGFR. We will then name the subgroups.
Type '(mis OR inframe) OR B:>0.5' into the samples search bar. This will select samples that either have a missense or inframe deletion '(mis OR inframe)', or where copy number variation (column B) is greater than 0.5. Note that I arbitrarily choose a cutoff of 0.5.
You must have the copy number variation column as column B for the search term '(mis OR inframe) OR B:>0.5' to work. The 'B' in 'B:>0.5' is instructing Xena to search in column B for values that are greater than 0.5.
Click the filter menu and select 'New subgroup column'. This will create a new column that has samples that met our search term marked as 'true' (ie. those that have an EGFR aberration) and those that did not meet our search term as 'false' (ie. those that do not have an EGFR aberration).
Click the column menu for the column we just created (column B) and chose 'Display'.
Rename the display so that samples that are 'true' are instead labeled as 'EGFR Aberrations' and the samples that are 'false' are instead labeled as 'No EGFR Aberrations'. Click 'Done'
Delete the search term. This will remove the black tick marks for matching samples.