Neandertal Cntgs Track Settings
 
Neandertal Sequence Contigs Generated by Genotype Caller   (All Neandertal Assembly and Analysis tracks)

Display mode:       Reset to defaults   

Display read names
Minimum alignment quality:

Color track by bases: Help on base coloring

Alignment Gap/Insertion Display Options Help on display options
Draw double horizontal lines when both genome and query have an insertion
Draw a vertical purple line for an insertion at the beginning or end of the
query, orange for insertion in the middle of the query
Draw a vertical green line where query has a polyA tail insertion

Additional coloring modes:
Color by strand (blue for +, red for -)
Use gray for
No additional coloring

Display data as a density graph:

BAM configuration help

List subtracks: only selected/visible    all    ()
  Sample↓1   Track Name↓2  
 
hide
 All  Contigs Generated from All Data   schema 
 
hide
 Feld1  Feld1 Contigs   schema 
 
hide
 Mez1  Mez1 Contigs   schema 
 
hide
 Sid1253  Sid1253 Contigs   schema 
 
hide
 Vi33.16  Vi33.16 Contigs   schema 
 
hide
 Vi33.25  Vi33.25 Contigs   schema 
 
hide
 Vi33.26  Vi33.26 Contigs   schema 
    

Description

The Neandertal Sequence Contigs track shows consensus contigs called (after duplicate reads from each library were merged) from overlapping, non-redundant reads that passed mapping and base quality criteria.

Display Conventions and Configuration

The contigs (query sequences) from each of the six samples are contained in separate subtracks. Use the checkboxes to select which samples will be displayed in the browser. Click and drag the sample name to reorder the subtracks. The order in which the subtracks appear in the subtrack list will be the order in which they display in the browser.

The query sequences in the SAM/BAM alignment representation are normalized to the + strand of the reference genome (see the SAM Format Specification for more information on the SAM/BAM file format). If a query sequence was originally the reverse of what has been stored and aligned, it will have the following flag:

(0x10) Read is on '-' strand.

BAM/SAM alignment representations also have tags. Some tags are predefined and others (those beginning with X, Y or Z) are defined by the aligner or data submitter. The following tag is associated with this track:

  • AS: Alignment score generated by aligner

The item labels and display colors of features within this track can be configured through the controls at the top of the track description page.

  • Display Read Names: By default, read names are not displayed. To display the read names, select the check box next to "Display read names".
  • Minimum alignment quality: Excludes alignments with quality less than the given number. The default is 0.
  • Color track by bases: By default, mismatching bases are highlighted in the display. Change the selection to "item bases" to see all base values from the query sequence, or "OFF" to ignore query sequence. Click here for additional information.
  • Alignment Gap/Insertion Display Options: Click here for help with these options.
  • Additional coloring modes: Other aspects of the alignments can be displayed in color or grayscale.
    • Color by strand: Alignments on the reverse strand are colored dark red, alignments on the forward strand are colored dark blue.
    • Grayscale: Items are shaded according to the chosen method: alignment quality or base qualities. The alignment qualities of individual items are shaded on a scale of 0 (lightest) to 99 (darkest). Base qualities are shaded on a scale of 0 (lightest) to 40 (darkest). Alignment quality is the default.

Methods

All Neandertal sequence reads from each of the six samples were aligned to the human (hg18) genome using the short read aligner/mapper ANFO.

To reduce the effects of sequencing error, the alignments of Neandertal reads to the human and chimpanzee reference genomes were used to construct human-based and chimpanzee-based consensus "minicontigs". To generate the consensus, uniquely placed, overlapping alignments were selected (ANFO MAPQ ≥ 90) and these were merged into a single multi-sequence alignment using the common reference genome sequence.

At each position in the resulting alignment, for each observed base, and for each possible original base: i) The likelihood of the observation was calculated, ii) the likely length of single-stranded overhangs was estimated, and iii) the potential for ancient DNA damage using the Briggs-Johnson model was considered (Briggs et al. 2007). If most observations in a given position showed a gap, the consensus became a gap; otherwise the base with the highest quality score (calculated by dividing each likelihood by the total likelihood) was used as the consensus.

At the current coverage, heterozygous sites will appear as low quality bases with the second base (not shown) having a similar likelihood to the consensus base. Likewise, heterozygous indels are included only by chance or may show up as stretches of low quality bases.

Credits

This track was produced at UCSC using data generated by Ed Green.

Reference

Briggs AW, Stenzel U, Johnson PL, Green RE, Kelso J, Prüfer K, Meyer M, Krause J, Ronan MT, Lachmann M et al. Patterns of damage in genomic DNA sequences from a Neandertal. Proc Natl Acad Sci U S A. 2007 Sep 11;104(37):14616-21. PMID: 17715061; PMC: PMC1976210

Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N, Li H, Zhai W, Fritz MH et al. A draft sequence of the Neandertal genome. Science. 2010 May 7;328(5979):710-22. PMID: 20448178