bioinformatics-toolkit-0.9.0: A collection of bioinformatics tools

Safe HaskellNone
LanguageHaskell2010

Bio.Data.Fastq

Synopsis

Documentation

data Fastq Source #

A FASTQ file normally uses four lines per sequence.

  • Line 1 begins with a '@' character and is followed by a sequence identifier and an optional description (like a FASTA title line).
  • Line 2 is the raw sequence letters.
  • Line 3 begins with a + character and is optionally followed by the same sequence identifier (and any description) again.
  • Line 4 encodes the quality values for the sequence in Line 2, and must contain the same number of symbols as letters in the sequence.
Instances
Eq Fastq Source # 
Instance details

Defined in Bio.Data.Fastq

Methods

(==) :: Fastq -> Fastq -> Bool #

(/=) :: Fastq -> Fastq -> Bool #

Show Fastq Source # 
Instance details

Defined in Bio.Data.Fastq

Methods

showsPrec :: Int -> Fastq -> ShowS #

show :: Fastq -> String #

showList :: [Fastq] -> ShowS #

streamFastqGzip :: (PrimMonad m, MonadThrow m, MonadResource m) => FilePath -> ConduitT i Fastq m () Source #

Read gzipped fastq file.

qualitySummary :: Monad m => ConduitT Fastq o m [(Double, Double)] Source #

Get the mean and variance of quality scores at every position.

trimPolyA :: Int -> Fastq -> Fastq Source #

Remove trailing A