### Composable Command Line Parsing with `optparse-applicative`
There are many solutions for parsing command line arguments in Haskell. Personally I like [`optparse-applicative`](https://hackage.haskell.org/package/optparse-applicative-0.12.1.0/), because, like the title suggests, you can compose parsers out of smaller pieces.
I have written command line parsers for the database connection info for [`postgresql-simple`](https://hackage.haskell.org/package/postgresql-simple-0.5.2.1/) many times and faced with the prospect of doing it again I opted to make this library, which is also this literate Haskell file. This way I could reuse it in web servers, db migrators, db job runners ... those are all the examples I could think of ... just trust me, it's worth it.
### Outline
- [The "Partial" Option Types](#partial)
- [The Composable Parser](#parser)
- [The Complete Option](#option)
- [Option "completion"](#completion)
- [The Option Parser](#option-parser)
- [The Runner](#runner)
- [The Tests](#tests)
### Standard Intro Statements to Ignore
```haskell
{-| A resuable optparse-applicative parser for creating a postgresql-simple
'Connection'
|-}
{-# LANGUAGE RecordWildCards, LambdaCase, DeriveGeneric, DeriveDataTypeable #-}
{-# LANGUAGE GeneralizedNewtypeDeriving, CPP #-}
module Database.PostgreSQL.Simple.Options where
import Database.PostgreSQL.Simple
import Options.Applicative
import Text.Read
import Data.ByteString (ByteString)
import qualified Data.ByteString.Char8 as BSC
import GHC.Generics
import Options.Generic
import Data.Typeable
import Data.String
#if !MIN_VERSION_base(4,8,0)
import Data.Monoid
#endif
```
### The "Partial" Option Types
In general, options types are built from many optional fields. Additionally, multiple options sets can be combined (i.e. command line options, config file, environment vars, defaults, etc). The easiest way to handle this is to create a "partial" option family that can be monoidally composed and is "completed" with a default option value.
```haskell
-- | An optional version of 'ConnectInfo'. This includes an instance of
-- | 'ParseRecord' which provides the optparse-applicative Parser.
data PartialConnectInfo = PartialConnectInfo
{ host :: Last String
, port :: Last Int
, user :: Last String
, password :: Last String
, database :: Last String
} deriving (Show, Eq, Read, Ord, Generic, Typeable)
```
We will utilize a boilerplate prevention library by Gaberiel Gonzales called [`optparse-generic`](https://hackage.haskell.org/package/optparse-generic-1.1.3) to generate the parser for use from the records field names.
To create the parser we have to merely declare an instance of `ParseRecord`.
```haskell
instance ParseRecord PartialConnectInfo
```
Now we make `PartialConnectInfo` an instance of `Monoid` so we can combine multiple options together.
```haskell
instance Monoid PartialConnectInfo where
mempty = PartialConnectInfo (Last Nothing) (Last Nothing)
(Last Nothing) (Last Nothing)
(Last Nothing)
mappend x y = PartialConnectInfo
{ host = host x <> host y
, port = port x <> port y
, user = user x <> user y
, password = password x <> password y
, database = database x <> database y
}
```
As it so happens there are two ways to create a db connection with `postgresql-simple`: `ConnectInfo` and a `ByteString` connection string. We have a partial version of `ConnectInfo` but we need something for the connection string.
```haskell
newtype ConnectString = ConnectString
{ connectString :: ByteString
} deriving ( Show, Eq, Read, Ord, Generic, Typeable, IsString )
```
I don't like the default option parsing for `String` in `optparse-applicative`. I want something that will escape double quotes, remove single quotes or just use the string unaltered. The function `parseString` does this.
```haskell
unSingleQuote :: String -> Maybe String
unSingleQuote (x : xs@(_ : _))
| x == '\'' && last xs == '\'' = Just $ init xs
| otherwise = Nothing
unSingleQuote _ = Nothing
parseString :: String -> Maybe String
parseString x = readMaybe x <|> unSingleQuote x <|> Just x
```
We use `parseString` to make a custom instance of `ParseRecord`.
```haskell
instance ParseRecord ConnectString where
parseRecord = fmap (ConnectString . BSC.pack)
$ option ( eitherReader
$ maybe (Left "Impossible!") Right
. parseString
)
(long "connectString")
```
Thus, my `PartialOptions` type is either the `ConnectString` or the `PartialConnectInfo` type.
```haskell
data PartialOptions
= POConnectString ConnectString
| POPartialConnectInfo PartialConnectInfo
deriving (Show, Eq, Read, Generic, Typeable)
```
There is one wrinkle. `optparse-generic` treats sum types as "commands". This makes sense as a default, but it is not what we want. We want to choose one record or another based on the non-overlapping flags. This is easy enough to do by hand.
```haskell
instance ParseRecord PartialOptions where
parseRecord
= fmap POConnectString parseRecord
<|> fmap POPartialConnectInfo parseRecord
```
### The Composable Parser
We can use `PartialOptions` as the type of a field in a larger options record defined elsewhere. When defining this more complicated parser, we reuse the work we did here by calling `parseRecord`. To make it even clearer we create an alias called `parser` so clients will know what to use.
```haskell
-- | The main parser to reuse.
parser :: Parser PartialOptions
parser = parseRecord
```
### The Complete Option
The connection option for `postgresql-simple` is either the record `ConnectInfo` or a connection string
```haskell
data Options
= OConnectString ByteString
| OConnectInfo ConnectInfo
deriving (Show, Eq, Read, Generic, Typeable)
```
### Option "completion"
`postgresql-simple` provides sensible defaults for `ConnectInfo` via `defaultConnectInfo`. We use these as the defaults when parsing. We create a `PartialConnectInfo` with these defaults.
```haskell
mkLast :: a -> Last a
mkLast = Last . Just
defaultPartialConnectInfo :: PartialConnectInfo
defaultPartialConnectInfo = PartialConnectInfo
{ host = mkLast $ connectHost defaultConnectInfo
, port = mkLast $ fromIntegral $ connectPort defaultConnectInfo
, user = mkLast $ connectUser defaultConnectInfo
, password = mkLast $ connectPassword defaultConnectInfo
, database = mkLast $ connectDatabase defaultConnectInfo
}
```
We can now complete the `PartialConnectInfo` to get a `ConnectInfo`.
```haskell
completeConnectInfo :: PartialConnectInfo -> ConnectInfo
completeConnectInfo x = case defaultPartialConnectInfo <> x of
PartialConnectInfo
{ host = Last (Just connectHost )
, port = Last (Just connectPortInt )
, user = Last (Just connectUser )
, password = Last (Just connectPassword)
, database = Last (Just connectDatabase)
} -> let connectPort = fromIntegral connectPortInt in ConnectInfo {..}
_ -> error "Impossible! No options should be required!"
```
Completing a `PartialOptions` to get an `Options` follows straightforwardly ... if you've done this a bunch I suppose.
```haskell
-- | mappend with 'defaultPartialConnectInfo' if necessary to create all
-- options
completeOptions :: PartialOptions -> Options
completeOptions = \case
POConnectString (ConnectString x) -> OConnectString x
POPartialConnectInfo x -> OConnectInfo $ completeConnectInfo x
```
### The Option Parser
Parse a `PartialOptions` and then complete it. This is **not** composable but is convient for testing and if you only need a `Option` type
```haskell
-- | Useful for testing or if only Options are needed.
completeParser :: Parser Options
completeParser = fmap completeOptions parseRecord
```
### The Runner
As a convenience, we export the primary use of parsing connection options ... making a connection.
```haskell
-- | Create a connection with an 'Option'
run :: Options -> IO Connection
run = \case
OConnectString connString -> connectPostgreSQL connString
OConnectInfo connInfo -> connect connInfo
```
### The tests
Testing is pretty straightforward using `System.Environment.withArgs`. See the [Spec.hs](/test/Spec.hs) for examples of how to test the parsers.