cobot-io-0.1.3.20: Biological data file formats and IO
Safe HaskellNone
LanguageHaskell2010

Bio.ABI.Clean

Synopsis

Documentation

class Cleanable a where Source #

Ability to clean initial data. Returns cleaned variant or Nothing if it can not be cleaned.

Minimal complete definition

cleanWith

Methods

clean :: a -> Maybe a Source #

cleanWith :: Thresholds -> a -> Maybe a Source #

data Thresholds Source #

Thresholds to clean the data.

ABI file contains sequence with quality. By design of sanger sequencing method start and end of the sequence have bad quality. Moreover, internal part of the sequence can also has bad quality. To clean the data we make 2 steps.

Step 1. Clean edges:

  • take frame with frameSize and go through the sequence;
  • on each step evaluate mean value;
  • if mean value less than edgeThreshold, go further;
  • if mean value more than edgeThreshold stop and cut the sequence from END of this frame;
  • repeat this algorithm for the right edge.

Step 2. Evaluate quality:

  • for cropped sequence evaluate mean value;
  • if mean value less then innerThreshold, sequence is bad;
  • if mean value more then innerThreshold, sequence is acceptable.

Logic of this algorithm and defaultThresholds were obtained by taking experiments with read ABI files.

Instances

Instances details
Eq Thresholds Source # 
Instance details

Defined in Bio.ABI.Clean

Show Thresholds Source # 
Instance details

Defined in Bio.ABI.Clean

defaultThresholds :: Thresholds Source #

These thresholds were selected by many experiments on ab1-files.