pcre2: Regular expressions via the PCRE2 C library (included)

[ apache, library, text ] [ Propose Tags ]

Please see the README on GitHub at https://github.com/sjshuck/hs-pcre2


[Skip to Readme]

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 1.0.0, 1.0.1, 1.0.1.1, 1.0.2, 1.1.0, 1.1.1, 1.1.2, 1.1.3, 1.1.3.1, 1.1.4, 1.1.5, 2.0.0, 2.0.1, 2.0.2, 2.0.3, 2.0.4, 2.0.5, 2.1.0, 2.1.0.1, 2.1.1, 2.1.1.1, 2.2.0, 2.2.1
Change log ChangeLog.md
Dependencies base (>=4.7 && <5), containers, mtl, template-haskell, text [details]
License Apache-2.0
Copyright 2020 Shlomo Shuck
Author Shlomo Shuck
Maintainer stevenjshuck@gmail.com
Category Text
Home page https://github.com/sjshuck/hs-pcre2#readme
Bug tracker https://github.com/sjshuck/hs-pcre2/issues
Source repo head: git clone https://github.com/sjshuck/hs-pcre2
Uploaded by SShuck at 2020-12-09T13:36:53Z
Distributions LTSHaskell:2.2.1, NixOS:2.2.1, Stackage:2.2.1
Reverse Dependencies 1 direct, 0 indirect [details]
Downloads 3661 total (85 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs available [build log]
Last success reported on 2020-12-09 [all 1 reports]

Readme for pcre2-1.0.1.1

[back to package description]

pcre2

Regular expressions for Haskell.
https://hackage.haskell.org/package/pcre2

Teasers

embeddedLicensePlate :: Text -> Maybe Text
embeddedLicensePlate = match "[A-Z]{3}[0-9]{3,4}"
case "The quick brown fox" of
    [regex|brown\s+(?<animal>\S+)|] -> Text.putStrLn animal
    _                               -> error "nothing brown"
let kv'd = lined . packed . [_regex|^\s*(.*?)\s*[=:]\s*(.*)|]

forMOf kv'd file $ execStateT $ do
    k <- gets $ capture @1
    v <- gets $ capture @2
    liftIO $ Text.putStrLn $ "found " <> k <> " set to " <> v

    case myMap ^. at k of
        Just v' | v /= v' -> do
            liftIO $ Text.putStrLn $ "setting " <> k <> " to " <> v'
            _capture @2 .= v'
        _ -> liftIO $ Text.putStrLn "no change"

Features

  • Low-surface API covering most use cases.
  • Quiet functions with simple types—for the most part it's
    Text (pattern) -> Text (subject) -> result.
  • Use partial application to create performant, compile-once-match-many code.
  • Low cognitive overhead—there's just one custom datatype for both compile and match options, the Option monoid.
  • Text everywhere for interop with both C and the broader Haskell ecosystem.
  • Match failures expressed via Alternative or pattern match failures.
  • Opt-in Template Haskell facilities for compile-time verification of patterns, indexing captures, and memoizing inline regexes.
  • Opt-in lens support.
  • No failure monads to express compile errors, preferring pure functions and throwing imprecise exceptions with pretty Show instances. Write simple code and debug it. Or, don't, and use the Template Haskell features instead. Both are first-class.
  • Vast presentation of PCRE2 functionality. We can even register Haskell callbacks to run during matching!
  • No dependencies that aren't distributed with GHC.
  • Bundled, statically-linked UTF-16 build of up-to-date PCRE2 (version 10.35), with a complete, exposed Haskell binding.

TODO

  • Global matching. (We already have global substitution.)
  • Many performance optimizations. Currently we are 2–3× slower than other libraries doing everything (a few μs).
  • Make use of DFA and JIT compilation.
  • Establish automated builds.
  • Improve PCRE2 C compile time.
  • PCRE2 10.36 is out already.

License

Apache 2.0.
PCRE2 is distributed under the 3-clause BSD license.

Author

©2020 Shlomo Shuck