Basic Statistics
Measure | Value |
---|---|
Filename | SRR2031741_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 976727 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 38 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CTTATACACATCTCCGAGCCCACGAGACGTAGAGGAATCTCGTATGCCGT | 3917 | 0.4010332467516512 | No Hit |
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1926 | 0.19718918387635437 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1801 | 0.18439133964761903 | No Hit |
ATACACATCTCCGAGCCCACGAGACGTAGAGGAATCTCGTATGCCGTCTT | 1550 | 0.15869326843631845 | TruSeq Adapter, Index 3 (95% over 21bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 1310 | 0.0 | 54.029133 | 1 |
AATACCG | 65 | 2.5900026E-6 | 43.843624 | 5 |
CGGAGAT | 80 | 2.2613312E-7 | 41.564358 | 1 |
AGCGTAA | 50 | 0.001616155 | 37.997807 | 8 |
GTATCAA | 2355 | 0.0 | 37.315895 | 1 |
ATCAACG | 2585 | 0.0 | 33.993977 | 3 |
TCAACGC | 2595 | 0.0 | 33.67994 | 4 |
CAACGCA | 2730 | 0.0 | 32.012806 | 5 |
AACGCAG | 2750 | 0.0 | 31.952703 | 6 |
ATACCGT | 90 | 2.4135465E-5 | 31.66484 | 6 |
TATCAAC | 2830 | 0.0 | 30.883192 | 2 |
ACGCAGA | 2985 | 0.0 | 29.278046 | 7 |
CGCAGAG | 3000 | 0.0 | 29.131653 | 8 |
GTGGTAT | 660 | 0.0 | 27.349707 | 1 |
CGTATAC | 90 | 8.946921E-4 | 26.388716 | 3 |
GCAGAGT | 3445 | 0.0 | 24.955019 | 9 |
TTTACGC | 50 | 0.0016546139 | 23.74863 | 38-39 |
GAGTACT | 2445 | 0.0 | 22.243092 | 12-13 |
CAGAGTA | 3505 | 0.0 | 21.614304 | 10-11 |
TTAGGCG | 165 | 6.324277E-5 | 20.150354 | 9 |