# Accessing Sequence Data via NCI's Cancer Data Service (CDS)

The CDS Portal, within NCI's Cancer Research Data Commons (CRDC), provides an interface to filter and select data from a variety of NCI programs, including controlled-access, primary sequence data from the Human Tumor Atlas Network (HTAN).

# DRS Manifest Files

To access data via CDS, first generate a CDS Data Repository Service (DRS) manifest containing the files you would like to obtain. DRS manifests are CSV files and require at minimum the name and drs_uri of each file of interest. For HTAN data, DRS manifests can be generated in one of three ways:

  1. CDS Portal
  2. HTAN Data Portal
  3. Google BigQuery (Coming Soon!)

# 1. Generating a Manifest File from the CDS Portal

In order to access HTAN imaging data within the CDS Portal, navigate to the portal in a web browser and click on the Explore CDS Data button on the landing page.

1

 

On the Data Explorer page, expand the STUDY section on the left sidebar, scroll down, and check the box next to Human Tumor Atlas (HTAN) primary sequence data.

Figure 2

 

This action will change the summary panel to reflect selecting HTAN data only.

Figure 3

 

Scroll down, or click on the Collapse View tab on the upper right just below the query summary line in order to see the tabulated view of all of the participants, samples or files in HTAN.

Figure 4

 

Click on the Add All Files button, or select the check boxes next to all Participants, Samples or Files for a subselection and then click on the Add Selected button. This action will update your cart icon in the upper right corner.

Figure 5

 

Clicking on the cart icon, will bring up a list of the selected files. Click on the Download Manifest button in the upper right to download a CSV-formated (Excel compatible) file of this file list.

Figure 6

 

# 2. Generating a Manifest File from the HTAN Data Portal

From the HTAN Data Portal, click CDS/SB-CGC (dbGaP) under the Data Access filter.

HTAN Portal: Accessing Genomic Data in CDS
HTAN Portal: Accessing Genomic Data in CDS

Navigate to the Files tab, check the box next to Filename in upper left, and then click Download selected files.

HTAN Portal: Selecting Genomic Files
HTAN Portal: Selecting Genomic Files

Click Download Manifest, which will download a local file called cds_manifest.csv.

HTAN Portal: Download DRS Manifest
HTAN Portal: Download DRS Manifest

# 3. Generating a Manifest File from Google BigQuery (Coming Soon!)

# Accessing Data

Once you have your manifest, follow the instructions on SB-CGC's Import from a DRS server documentation page to import data from a manifest file.