Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR616820_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 41248610 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 51 |
| %GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 154592 | 0.3747811138363208 | No Hit |
| TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 127300 | 0.30861646004556276 | No Hit |
| TATAGAATTCGCGGCCGCTCGCGATTTTTTTTTTTTTTTTTTTTTTTTTTT | 105321 | 0.25533224028639995 | No Hit |
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATCTCGTATGCC | 64073 | 0.1553337191241111 | TruSeq Adapter, Index 12 (100% over 51bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACACGTC | 11840 | 0.0 | 29.872858 | 13 |
| AATCTCG | 11150 | 0.0 | 29.15868 | 39 |
| CTCGTAT | 11275 | 0.0 | 28.336494 | 42 |
| TCTCGTA | 11290 | 0.0 | 28.21913 | 41 |
| ACGTCTG | 12795 | 0.0 | 27.67836 | 15 |
| TCGTATG | 12005 | 0.0 | 27.606726 | 43 |
| CACGTCT | 12985 | 0.0 | 27.29069 | 14 |
| CACACGT | 13160 | 0.0 | 27.081654 | 12 |
| CGTCTGA | 13295 | 0.0 | 26.552746 | 16 |
| CGTATGC | 12655 | 0.0 | 26.36655 | 44 |
| ATCTCGT | 12705 | 0.0 | 25.058561 | 40 |
| CGCGATT | 40690 | 0.0 | 24.556553 | 20 |
| GCACACG | 15045 | 0.0 | 24.122234 | 11 |
| GCGATTT | 42855 | 0.0 | 23.368477 | 21 |
| CGATTTT | 43260 | 0.0 | 23.128899 | 22 |
| GTATGCC | 14620 | 0.0 | 23.022821 | 45 |
| ATAGAAT | 49965 | 0.0 | 22.704466 | 2 |
| GTAATCT | 14500 | 0.0 | 22.422018 | 37 |
| TATAGAA | 52420 | 0.0 | 21.766228 | 1 |
| TAATCTC | 16140 | 0.0 | 20.213396 | 38 |