Basic Statistics
Measure | Value |
---|---|
Filename | SRR1033095_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1013500 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3354 | 0.33093241243216576 | No Hit |
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 3350 | 0.33053774050320667 | No Hit |
CTTATACACATCTCCGAGCCCACGAGACTAGGCATGATCTCGTATGCCGT | 1664 | 0.16418352244696596 | No Hit |
TATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT | 1573 | 0.15520473606314752 | No Hit |
GAGTAAGCAGTGGTATCAACGCAGAGTAAGCAGTGGTATCAACGCAGAGT | 1167 | 0.11514553527380365 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTATCA | 3675 | 0.0 | 38.617847 | 1 |
GTATCAA | 5565 | 0.0 | 28.891369 | 1 |
CCGTTAA | 155 | 4.9278242E-8 | 27.377243 | 1 |
CGTTAAT | 155 | 4.9538357E-8 | 27.362349 | 2 |
GTCGTAA | 90 | 9.341863E-4 | 26.194277 | 1 |
TCAACGC | 6285 | 0.0 | 24.97613 | 4 |
GTATATA | 95 | 0.0012805243 | 24.81563 | 1 |
TATCAAC | 6610 | 0.0 | 24.310587 | 2 |
ATCAACG | 6490 | 0.0 | 24.187208 | 3 |
AACGCAG | 6585 | 0.0 | 23.837088 | 6 |
GTACATG | 3725 | 0.0 | 23.796358 | 1 |
CAACGCA | 6625 | 0.0 | 23.764105 | 5 |
CGTAAAC | 100 | 0.0017588641 | 23.499247 | 3 |
CGACCGT | 70 | 1.9331073E-5 | 23.498667 | 18-19 |
TACATGG | 3785 | 0.0 | 23.157389 | 2 |
ATTAACG | 205 | 2.903107E-8 | 22.987337 | 2 |
CTATACA | 150 | 3.2916905E-5 | 21.93263 | 4 |
ACATGGG | 4020 | 0.0 | 21.161015 | 3 |
CCGGTGC | 135 | 3.990007E-4 | 20.955421 | 1 |
ACGCAGA | 7500 | 0.0 | 20.928963 | 7 |