bio-0.3.3.4: A bioinformatics librarySource codeContentsIndex
Bio.Sequence.GeneOntology
Contents
Basic data types
Reading the OBO format
Reading 'terms and ids'
Reading UniProt associations
Utility stuff
Description

GeneOntology - parse and index Gene Ontology Annotations In particular, the file 'gene_association.goa_uniprot' that contains links between GO terms and UniProt accessions.

Synopsis
newtype GoTerm = GO Int
data GoDef = GoDef !GoTerm !ByteString !GoClass
type GoHierarchy = [(GoDef, [GoTerm])]
readObo :: FilePath -> IO GoHierarchy
readTerms :: FilePath -> IO [GoDef]
data Annotation = Ann !UniProtAcc !GoTerm !EvidenceCode
type UniProtAcc = ByteString
data GoClass
= Func
| Proc
| Comp
data EvidenceCode
= IC
| IDA
| IEA
| IEP
| IGC
| IGI
| IMP
| IPI
| ISS
| NAS
| ND
| RCA
| TAS
| NR
readGOA :: FilePath -> IO [Annotation]
isCurated :: EvidenceCode -> Bool
decomment :: ByteString -> [ByteString]
Basic data types
newtype GoTerm Source
A GO term is a positive integer
Constructors
GO Int
show/hide Instances
data GoDef Source
A GoDef maps a GoTerm to a description and a GoClass.
Constructors
GoDef !GoTerm !ByteString !GoClass
show/hide Instances
Reading the OBO format
type GoHierarchy = [(GoDef, [GoTerm])]Source
A list of Go definitions, with pointers to parent nodes. Read from the .obo file. The user may construct the explicit hierachy by storing these in a Map or similar
readObo :: FilePath -> IO GoHierarchySource
Read the GO hierarchy from the obo file. Note that this is not quite a tree structure.
Reading 'terms and ids'
readTerms :: FilePath -> IO [GoDef]Source
Read GO term definitions, from the GO.terms_and_ids file
Reading UniProt associations
data Annotation Source
A GOA annotation, containing a UniProt identifier, a GoTerm and an evidence code.
Constructors
Ann !UniProtAcc !GoTerm !EvidenceCode
show/hide Instances
type UniProtAcc = ByteStringSource
A UniProt identifier (short string of capitals and numbers).
data GoClass Source
Constructors
Func
Proc
Comp
show/hide Instances
data EvidenceCode Source
Evidence codes describe the type of support for an annotation http://www.geneontology.org/GO.evidence.shtml
Constructors
ICInferred by Curator
IDAInferred from Direct Assay
IEAInferred from Electronic Annotation
IEPInferred from Expression Pattern
IGCInferred from Genomic Context
IGIInferred from Genetic Interaction
IMPInferred from Mutant Phenotype
IPIInferred from Physical Interaction
ISSInferred from Sequence or Structural Similarity
NASNon-traceable Author Statement
NDNo biological Data available
RCAInferred from Reviewed Computational Analysis
TASTraceable Author Statement
NRNot Recorded
show/hide Instances
readGOA :: FilePath -> IO [Annotation]Source
Read the goa_uniprot file (warning: this one is huge!)
isCurated :: EvidenceCode -> BoolSource
The vast majority of GOA data is IEA, while the most reliable information is manually curated. Filtering on this is useful to keep data set sizes manageable, too.
Utility stuff
decomment :: ByteString -> [ByteString]Source
Produced by Haddock version 2.4.2