Description
This track displays transcriptome data from tiling GeneChips produced by
Affymetrix. For the
complete non-repetitive portion of the human genome, more than 256 million
probe pairs were tiled every 5 bp in non-repeat-masked areas and hybridized to
cytosolic polyA+ long RNA (>200 nucleotides) from 8 different cell lines.
Note that the female cell lines HeLa, SK-N-AS, and U87MG do not contain data
for chrY. For HeLa and HepG2,
samples were also produced for nuclear polyA+ RNA in addition to cytosolic
polyA+ RNA. For experimental details and results, see Kapranov et al.
in the References section below.
This track contains the signal level estimated
for each mapped probe position after hybridization with long RNA samples.
A separate track -- "Affy Tx lRNA Reg " -- contains transcribed
fragments (transfrags) representing long RNAs (see Kapranov et al. in
the References section below). While the raw data are based on perfect match
minus mismatch (PM - MM) probe values and may contain negative values, the track
has a minimum value of zero for visualization purposes. Likewise, the probes
with high signal values are cut off at the intensity of 150 by default and will
appear to have similar magnitude.
Display Conventions and Configuration
This annotation follows the display conventions for composite tracks. The
subtracks within this annotation may be configured in a variety of ways to
highlight different aspects of the displayed data. The graphical configuration
options are shown at the top of the track description page, followed by a list
of subtracks. For more information about the graphical configuration options,
click the Graph
configuration help link. To display only selected subtracks, uncheck the
boxes next to the tracks you wish to hide.
Methods
For each data point, probes within 30 bp on either side were used to improve
the estimate of expression level for a particular probe. This helped to smooth
the data and produce a more robust estimate of the transcription level at a
particular genomic location. The following analysis steps were performed:
-
Replicate arrays were quantile-normalized and the median intensity (using both
PM and MM intensities) of each array was scaled to a target value of 50.
-
The expression level was estimated for each mapped probe position by
collecting all the probe pairs that fell within a window of ± 30
bp, calculating all non-redundant pairwise averages of PM - MM values
of all probe pairs in the window, and taking the median of all resulting
pairwise averages.
-
The resulting signal value is the Hodges-Lehmann estimator associated with the
Wilcoxon signed-rank statistic of the PM - MM values that lie within ± 30 bp of
the sliding window centered at every genomic coordinate.
-
Transfrags in the track labeled "Affy Tx lRNA Reg" were
determined by connecting adjacent positive probes with a maximum allowable gap
of 11 nucleotides and a minimum run of 49 nucleotides (at least 11 probes with
at most one negative probe between two positive probes). Transfrags were
discarded if they overlapped pseudogenes or harbored low-complexity or
repetitive sequences.
Credits
Data generation and analysis were performed by the transcriptome group at
Affymetrix with assistance from colleagues at the University of
Leipzig, Fraunhofer Institute, and University of Vienna: P. Kapranov, J. Cheng,
S. Dike, D.A. Nix, R. Duttagupta, A.T. Willingham, P.F. Stadler, J. Hertel,
J. Hackermüller, I.L. Hofacker, I. Bell, E. Cheung, J. Drenkow, E. Dumais,
S. Patel, G. Helt, M. Ganesh, S. Ghosh, A. Piccolboni, V. Sementchenko,
H. Tammana, T.R. Gingeras.
Questions or comments about this annotation? Email
genome@soe.ucsc.edu.
References
Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, Willingham AT, Stadler PF, Hertel J, Hackermüller J, Hofacker IL, et al.
RNA maps reveal new RNA classes and a possible function for
pervasive transcription.
Science. 2007 Jun 8;316(5830):1484-8.
|
|