Project description:Small RNA sequencing has been performed using eight commonly used breast cancer cell lines by Illumina as part of the Illumina iDEA Challange 2011. Total RNA of the cell lines was isolated and the sequencing library was prepared using Illuminas Small RNA v1.5 library prep protocol. This data was analyzed with Flicker, an add-on to the Illumina Genome Analyzer Pipeline software that allows for the preliminary analysis of small RNA runs on the GA. Flicker was used in this case for: * Trimming: Attempts to trim the adaptor sequence from small RNA reads Trimming is accomplished by aligning the adaptor with each read, both internally (the adaptor is fully contained within the read) and as a suffix (the adaptor overlaps the 3' end of the read). A similarity matrix is used to generate the matching score, taking into account both substitutions and indels. The alignment with the best matching score is returned, and that best alignment is trimmed from the read. A trimming threshold may be specified, and the adaptor is not trimmed unless the best score exceeds the threshold. * Aligning with Iterated ELAND: Attempts to aligns trimmed reads to targets (squashed genomes or databases such as miRBase) using the ELAND short read aligner. Each read is aligned by ELAND with specified alignment targets. ELAND was designed to align reads of a uniform length, and the trimmed reads from a small RNA run are several different lengths. We currently address this by running ELAND for reads of each length: reads are binned by length, and each bin of a given length is aligned against a specified target. Once all bins are aligned against all targets (in this case, gDNA and miRBase), the results are recombined into a single file with all the annotation information. ===== Key to Experiments HCT ID, Lane, Sample ID, Cell Line, ER +/- Status, Reads HCT20061, Lane 1, HS378, MCF7, +, 14,555,390 HCT20062, Lane 2, HS379, MDA-MB-231, -, 15,147,037 HCT20063, Lane 3, HS381, T47D, +, 14,701,254 HCT20064, Lane 4, HS377, BT-20, -, 16,529,823 HCT20065, Lane 5, HS375, BT-474, +, 16,443,361 HCT20066, Lane 6, HS380, MDA-MB-468, -, 16,746,981 HCT20067, Lane 7, HS382, ZR-75-1, +, 17,631,005 HCT20068, Lane 8, HS376, MCF10A, normal, 19,155,607
2016-03-14 | E-MTAB-4539 | biostudies-arrayexpress