úÎ,—*S      Safe-Inferred ^List containing characters at which we do not split words. This list is language dependent.tThe default list is for English and does only consider ASCII characters, the numbers 0..9 and some other symbols.SThere are resources for other languages, but they need review and contribution!ASCII characters,digitsand some more symbols ("+-/")Latin1Latin1 extended-ALatin1 extended-B!Greek and Coptic (needs revision) Cyrillic (needs revision)      Safe-Inferred The nolistô: Symbols in this list count as stop words independently from the chosen stop word list. This list can be used to exclude very specific "words" that may occur in a given domain like, for instance, mathematical formulas and symbols. Search tree for stop words Make  2 starting from a list of stop words encoded as  Make  2 starting from a list of stop words encoded as Search for a chunk of  in the  j. Note that, if a word or symbol does not appear in the stop word list, it may still be on the the nolist4 and, then, still counts as stop word (e.g. "-")."Load a stop word list from a file.The default stop word list ().Currently, the default nolist contains only the symbol "-".The "smart" stop word listThe "Fox" stop word list   (c) Tobias SchoofsLGPL  experimentalportable Safe-Inferred|The result is a keyword candidate, a keyword consisting of one or more words and a score associated with this keyword.:This interface provides most flexibility. It expects a  of stop words, a nosplit÷ list used by the word splitter, an additional list of words or symbols you want to exclude for a specific document and a text split into phrases. Users may pass in their own stop word list (e.g. by loading it from a file, see %) or one of the predefined lists (smartStopwords,  foxStopwords).The d function is a convenience interface that takes a couple of decisions internally: it uses the , the English language nosplit list, the default nolist2 and it splits the text into phrases using the .The function is equivalent to Ccandidates defaultStoplist defaultNosplit defaultNolist . pSplitter Sort the  list by scores (descending!) Sort the  list by words (ascending!)mDefault phrase splitter. It splits phrases at characters in the punctuation category (those for which  is   ) with the exception of '-'.!"#$%&'()*+   !"#$%&'()*+,      !"#$%&'(!)*+,-./0123456789 rake-0.0.1 NLP.RAKE.TextNLP.RAKE.ResourcesNLP.RAKE.StopwordsNoSplitdefaultNosplit enNosplit numNosplit othNosplit latin1NosplitlatinExAnosplitlatinExBnosplit greekNosplitcyrillicNosplitNoList StopwordsMap mkStopwordsmkStopwordsStrstopword loadStopWordsdefaultStoplist defaultNolist smartStoplist foxStoplist WordScore candidateskeywords sortByScore sortByWord pSplitter text-1.2.1.1Data.Text.InternalTextbaseGHC.BaseStringchashignoreWhitespacecontainers-0.5.5.1 Data.Map.BaseMap Data.Char isPunctuationghc-prim GHC.TypesTrueScoreMapnopuncspace wSplitterkfinderkwScoreskwScore wordScores wordScorewFilternumeric