aftovolio-0.4.0.0: An AFTOVolio implementation for creating texts with special phonetic / prosodic properties.
Copyright(c) OleksandrZhabenko 2020-2024
LicenseMIT
Maintaineroleksandr.zhabenko@yahoo.com
StabilityExperimental
Safe HaskellSafe-Inferred
LanguageHaskell2010

Aftovolio.Ukrainian.PrepareText

Description

Earlier it has been a module DobutokO.Poetry.Ukrainian.PrepareText from the dobutokO-poetry package. In particular, this module can be used to prepare the Ukrainian text by applying the most needed grammar for AFTOVolio to avoid misunderstanding for the produced text. The attention is paid to the prepositions, pronouns, conjunctions and particles that are most commonly connected (or not) in a significant way with the next text. Uses the information from: https://uk.wikipedia.org/wiki/%D0%A1%D0%BF%D0%BE%D0%BB%D1%83%D1%87%D0%BD%D0%B8%D0%BA and https://uk.wikipedia.org/wiki/%D0%A7%D0%B0%D1%81%D1%82%D0%BA%D0%B0_(%D0%BC%D0%BE%D0%B2%D0%BE%D0%B7%D0%BD%D0%B0%D0%B2%D1%81%D1%82%D0%B2%D0%BE)

Uses arrays instead of vectors.

Synopsis

Basic functions

complexWords :: [String] -> [String] Source #

Concatenates complex words in Ukrainian so that they are not separated further by possible words order rearrangements (because they are treated as a single word). This is needed to preserve basic grammar in phonetic languages.

splitLines :: [String] -> [String] Source #

Since 0.2.1.0 version the function is recursive and is applied so that all returned elements (String) are no longer than 7 words in them.

splitLinesN :: Int -> [String] -> [String] Source #

A generalized variant of the splitLines with the arbitrary maximum number of the words in the lines given as the first argument.

isConcatenated :: String -> Bool Source #

Since the dobutokO-poetry version 0.16.3.0 the (||) operator has been changed to the (&&). The idea is that these words are the ones that are pronouns and they "should" be treated (by the author's understanding) as independent words.

isUkrainianL :: Char -> Bool Source #

Is taken from the mmsyn6ukr package version 0.8.1.0 so that the amount of dependencies are reduced (and was slightly modified).

The end-user functions

prepareText :: String -> [String] Source #

Is used to convert a Ukrainian text into list of String each of which is ready to be used by the functions of the modules for the phonetic languages approach. It applies minimal grammar links and connections between the most commonly used Ukrainian words that "should" be paired and not dealt with separately to avoid the misinterpretation and preserve maximum of the semantics for the "phonetic" language on the Ukrainian basis.

prepareTextN :: Int -> String -> [String] Source #

A generalized variant of the prepareText with the arbitrary maximum number of the words in the lines given as the first argument.

prepareTextN2 :: Int -> String -> [String] Source #

A generalized variant of the prepareText with the arbitrary maximum number of the words in the lines given as the first argument. The '_' is not filtered out.

prepareTextN3 :: Int -> String -> [String] Source #

A generalized variant of the prepareText with the arbitrary maximum number of the words in the lines given as the first argument. Both '_' and '=' are not filtered out.

prepareTextNG Source #

Arguments

:: (Char -> Bool)

A predicate to filter the symbols during preparation.

-> Int 
-> String 
-> [String] 

An even more generalized variant of the prepareTextN with the arbitrary maximum number of the words in the lines given as the second argument and the possibility to provide custom function for filtering.

growLinesN :: Int -> [String] -> [String] Source #

@ since 0.2.0.0 Given a positive number and a list tries to rearrange the list's Strings by concatenation of the several elements of the list so that the number of words in every new String in the resulting list is not greater than the Int argument. If some of the Strings have more than that number quantity of the words then these Strings are preserved.

prepareGrowTextMN Source #

Arguments

:: Int

A maximum number of the words or their concatenations in the resulting list of Strings.

-> Int

A number of words in every String that the function firstly forms. To have some sense of usage, must be less than the first argument.

-> String 
-> [String] 

@ since 0.2.0.0 The function combines the prepareTextN and growLinesN function. Applies needed phonetic language preparations to the Ukrainian text and tries to 'grow' the resulting Strings in the list so that the number of the words in every of them is no greater than the given first Int number.

prepareGrowTextMNG Source #

Arguments

:: (Char -> Bool)

A predicate to filter the symbols during preparation.

-> Int

A maximum number of the words or their concatenations in the resulting list of Strings.

-> Int

A number of words in every String that the function firstly forms. To have some sense of usage, must be less than the first argument.

-> String 
-> [String] 

@ since 0.11.0.0 The generalized version of the prepareGrowTextMN with additional possibility to provide custom function for symbols filtering inside.

tuneLinesN :: Int -> [String] -> [String] Source #

@ since 0.6.0.0 Recursively splits the concatenated list of lines of words so that in every resulting String in the list except the last one there is just Int -- the first argument -- words.

prepareTuneTextMN Source #

Arguments

:: Int

A number of the words or their concatenations in the resulting list of Strings (except probably the last one).

-> Int

A number of words in every String that the function firstly forms. To have some sense of usage, must be less than the first argument.

-> String 
-> [String] 

@ since 0.6.0.0 The function combines the prepareTextN and tuneLinesN functions. Applies needed phonetic language preparations to the Ukrainian text and splits the list of Strings so that the number of the words in each of them (except the last one) is equal the given first Int number.

prepareTuneTextMNG Source #

Arguments

:: (Char -> Bool)

A predicate to filter the symbols during preparation.

-> Int

A number of the words or their concatenations in the resulting list of Strings (except probably the last one).

-> Int

A number of words in every String that the function firstly forms. To have some sense of usage, must be less than the first argument.

-> String 
-> [String] 

@ since 0.11.0.0 The generalized version of the prepareTuneTextMN with additional possibility to provide custom function for symbols filtering inside.

Used to transform after convertToProperUkrainian from mmsyn6ukr package

aux4 :: String -> Char Source #

Can be used to prepare the text after convertToProperUkrainian from Ukrainian module from mmsyn6ukr package so that all the String can be represented as unique Char.