Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR2031507_1.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 1792678 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 101 |
| %GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CTTATACACATCTCCGAGCCCACGAGACCAGAGAGGATCTCGTATGCCGT | 6310 | 0.35198736192445046 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2779 | 0.15501947365896163 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 2143 | 0.11954182513535616 | No Hit |
| ATACACATCTCCGAGCCCACGAGACCAGAGAGGATCTCGTATGCCGTCTT | 2048 | 0.11424249084330818 | RNA PCR Primer, Index 23 (95% over 23bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 1950 | 0.0 | 54.411526 | 1 |
| GTATCAA | 3530 | 0.0 | 39.4924 | 1 |
| TCAACGC | 3925 | 0.0 | 34.610466 | 4 |
| CAACGCA | 4015 | 0.0 | 33.952946 | 5 |
| ATCAACG | 4130 | 0.0 | 33.007523 | 3 |
| AACGCAG | 4220 | 0.0 | 32.64124 | 6 |
| TATCAAC | 4350 | 0.0 | 31.665756 | 2 |
| ACGCAGA | 4675 | 0.0 | 29.15959 | 7 |
| GTACATG | 4565 | 0.0 | 28.34972 | 1 |
| CGCAGAG | 4805 | 0.0 | 27.975262 | 8 |
| TACATGG | 4795 | 0.0 | 26.547726 | 2 |
| ACATGGG | 4820 | 0.0 | 25.030401 | 3 |
| GACCGTC | 240 | 2.2919266E-10 | 23.749317 | 7 |
| GCAGAGT | 5925 | 0.0 | 22.526777 | 9 |
| CATGGGG | 2890 | 0.0 | 22.516653 | 4 |
| CAGAGTA | 5420 | 0.0 | 20.813515 | 10-11 |
| GTATTAC | 500 | 0.0 | 19.983425 | 1 |
| GAGTACT | 3485 | 0.0 | 19.62641 | 12-13 |
| AGAGTAC | 5155 | 0.0 | 19.303518 | 10-11 |
| TATCTCG | 125 | 0.006033433 | 18.999453 | 5 |