FastQCFastQC Report
Wed 25 May 2016
SRR1033072_1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR1033072_1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences4599382
Sequences flagged as poor quality0
Sequence length100
%GC42

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[WARN]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[WARN]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT56180.12214684494569053No Hit
GGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT49390.10738399202327618No Hit

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GGTATCA53500.047.5822641
GTATCAA101700.043.1552731
TCAACGC115150.037.3863144
ATCAACG116150.037.1061063
CAACGCA118350.036.4143645
AACGCAG122900.035.1427156
TATCAAC130100.033.2843252
ACGCAGA141400.030.3454137
CGCAGAG141800.030.1603818
GCAGAGT160800.026.3628399
GTACATG89000.025.6366041
TACATGG92450.023.4451182
AGAGTAC152650.021.62812410-11
ACATGGG95850.021.4281463
GAGTACT119700.020.69120412-13
GTGGTAT30150.020.3264331
CAGAGTA161100.020.18737610-11
AGTACTT131450.019.23494712-13
TATACCG2702.6024281E-819.1469945
TAGGGCG2403.424846E-617.6239385