FastQCFastQC Report
Thu 26 May 2016
SRR1512820_1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR1512820_1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences2050010
Sequences flagged as poor quality0
Sequence length25
%GC44

[OK]Per base sequence quality

Per base quality graph

[WARN]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[OK]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GTATCAACGCAGAGTACTTTTTTTT59540.29043760762142623No Hit
GGTATCAACGCAGAGTACTTTTTTT38330.18697469768440153No Hit
TATCAACGCAGAGTACTTTTTTTTT36160.1763893834664221No Hit
GTCCTACAGTGGACATTTCTAAATT33270.162291891259067No Hit
CTGTAGGACGTGGAATATGGCAAGA31240.15238950053902175No Hit
GTCCTAAAGTGTGTATTTCTCATTT27300.1331700820971605No Hit
CTTTAGGACGTGAAATATGGCGAGG26910.13126765235291535No Hit
GTCCTACAGTGTGCATTTCTCATTT22210.10834093492226868No Hit

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
TAGGACC7550.012.9707624
TCCAACG2150.012.80681418
ACCGTAT604.1004733E-412.6639338
CTCGAAC1750.012.47875718
CCAACGA1353.7471182E-1011.95629719
TCGAACT1800.011.60464119
GGCGAGG6750.011.5343119
TGTAGGA20500.011.4578512
GGACCGT752.0781206E-411.3978196
ACAGCGC752.0785916E-411.397548
GGTATCA14950.011.2023091
AGGACCG855.2808122E-511.1855235
CGGACAT600.005886189611.08094110
GCCTCGA1803.6379788E-1211.07742816
CTGTAGG20400.011.0548851
GTAGGAC20400.010.9990983
GACGTGG9100.010.9591737
CTAGGAC3000.010.7753873
GGACGTG16700.010.6925646
TTGGACA2950.010.6357254