Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR2049368_1.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 6487389 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 100 |
| %GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| CTTATACACATCTCCGAGCCCACGAGACCGTACTAGATCTCGTATGCCGT | 17514 | 0.26996993705788264 | No Hit |
| GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9974 | 0.15374444171607407 | No Hit |
| GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 9794 | 0.15096982776892212 | No Hit |
| ATACACATCTCCGAGCCCACGAGACCGTACTAGATCTCGTATGCCGTCTT | 9140 | 0.14088873042760347 | TruSeq Adapter, Index 11 (95% over 21bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGTATCA | 9225 | 0.0 | 51.622963 | 1 |
| GTATCAA | 16095 | 0.0 | 38.64281 | 1 |
| ATCAACG | 19250 | 0.0 | 31.762096 | 3 |
| TCAACGC | 19345 | 0.0 | 31.581823 | 4 |
| CAACGCA | 19950 | 0.0 | 30.600521 | 5 |
| TATCAAC | 20650 | 0.0 | 29.972866 | 2 |
| AACGCAG | 20725 | 0.0 | 29.412685 | 6 |
| GTGGTAT | 4970 | 0.0 | 28.849829 | 1 |
| TGGTATC | 4640 | 0.0 | 27.954624 | 2 |
| ACGCAGA | 23825 | 0.0 | 25.427704 | 7 |
| CGCAGAG | 24115 | 0.0 | 24.985388 | 8 |
| GTACATG | 15315 | 0.0 | 22.500221 | 1 |
| GCAGAGT | 26795 | 0.0 | 22.029984 | 9 |
| TACATGG | 15175 | 0.0 | 21.214092 | 2 |
| GAGTACT | 18665 | 0.0 | 19.67281 | 12-13 |
| AGAGTAC | 24665 | 0.0 | 19.622686 | 10-11 |
| ACATGGG | 15685 | 0.0 | 19.53555 | 3 |
| CAGAGTA | 26095 | 0.0 | 19.5207 | 10-11 |
| AGTACTT | 19650 | 0.0 | 18.112429 | 12-13 |
| GTACTTT | 20685 | 0.0 | 17.55115 | 14-15 |