# B      !"#$%&'()*+,-./0123456789:;<=>?@ANoneLNoneBRepresents training data obtained from a corpus required by Punkt.Occurrences of each textual type, case-insensitive. Used during Punkt's type-based stage. Also contains occurrences of trailing periods.6Dictionary of orthographic data for each textual type.FCarries various orthographic statistics for a particular textual type.number of lowercase occurrencesuppercase occurrencesCnumber of lowercase occurrences in the first position of a sentence ?number of uppercase occurrences strictly internal to a sentence!*number of occurences in the first position%.Dunning log likelihood modified by Kiss/Strunk&!Dunning's original log likelihood,COccurrences of a textual type, strictly ignoring trailing period. c(w, ~.). Case-insensitive.-4Occurrences of a textual type with trailing period. c(w, .). Case-insensitive..c(w) == c(w, .) + c(w, ~.). Case-insensitive.0$Returns the log likelihood that (w_ snoc B() is an abbreviation. Case-insensitive.1 Decides if wD is a sentence ender based on its capitalization. Case-insensitive.2SSpecial orthographic heuristic for post-possible-initial tokens. Case-insensitive.3Log likelihood that w2 is a frequent sentence starter. Case-insensitive.4)Computes the collocational likelihood of w and x. Case-insensitive.5HBuilds a dictionary of textual type frequencies from a stream of tokens.@SMain export of the entire package. Splits a corpus into its constituent sentences.ArunPunkt data computation runs  computation using data collected from a corpus using 9.<  !"#$%&'()*+,-./0123456789:;<=>?@A<  !"#$%&'()*+,-./0123456789:;<=>?@A< ! "#$%&'()*+,-./0123456789:;<=>?@A%  !"#$%&'()*+,-./0123456789:;<=>?@AC       !"#$%&'()*+,-./0123456789:;<=>?@ABCDE punkt-0.1.1NLP.Punkt.Match NLP.Punkt re_split_impl re_split_posre_split re_compile word_sepsintrasepPunktTokenoffsettoklenentitysentendabbrevEntityDashEllipsis ParaStartPunctWord PunktData type_count ortho_count collocations total_enders total_toks OrthoFreq freq_lower freq_upperfreq_first_lowerfreq_internal_upperfreq_after_endernorm is_initialis_word strunk_log dunning_logask_type_countask_total_toksask_total_enders ask_ortho ask_collocfreq freq_snoc_dot freq_typedlen prob_abbr decide_orthodecide_initial_ortho prob_starter prob_collocbuild_type_countbuild_ortho_count build_collocs to_tokensbuild_punkt_dataclassify_by_typeclassify_by_nextclassify_punkt find_breaks substring match_spacessplit_sentencesrunPunktbaseGHC.Base.