Basic Statistics
Measure | Value |
---|---|
Filename | SRR4062075_1.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2031537 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 51 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GCGCAAGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTTCATTAAT | 2210 | 0.10878462956864679 | No Hit |
TCGTAGTTCCGACCATAAACGATGCCGACTGGCGATGCGGCGGCGTTATT | 2098 | 0.10327156236878776 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTATCAA | 2885 | 0.0 | 29.519653 | 1 |
GGTATCA | 1025 | 0.0 | 28.125067 | 1 |
TCAACGC | 4385 | 0.0 | 19.314684 | 4 |
GGCGTTA | 660 | 0.0 | 19.002127 | 42 |
TAGATCG | 70 | 8.129224E-4 | 18.856005 | 5 |
ATCAACG | 4545 | 0.0 | 18.68314 | 3 |
CAACGCA | 4535 | 0.0 | 18.675829 | 5 |
AACGCAG | 4575 | 0.0 | 18.560627 | 6 |
TATCAAC | 4570 | 0.0 | 18.534166 | 2 |
CAACGCG | 145 | 6.31735E-9 | 18.206245 | 21 |
AGCGCTA | 170 | 1.2732926E-10 | 18.11789 | 14 |
ATTACTC | 100 | 2.485652E-5 | 17.598936 | 3 |
TCGATCG | 190 | 3.274181E-11 | 17.369509 | 41 |
CGTTATT | 725 | 0.0 | 17.297634 | 44 |
AATCGGT | 80 | 0.001989127 | 16.499815 | 19 |
ATACCGT | 670 | 0.0 | 16.41692 | 6 |
TACCGTC | 660 | 0.0 | 16.332346 | 7 |
CCTATTC | 515 | 0.0 | 16.23203 | 3 |
GGACCGT | 150 | 1.8406536E-7 | 16.132359 | 6 |
CCGCCTA | 165 | 3.1113814E-8 | 16.001003 | 41 |