# Maintainers' Guide ## PR Merge Checklist * All the CI tests pass * New tests are added where applicable * Benchmarks are added where applicable * Run benchmarks locally if you suspect any regressions * Documentation is added for any new combinators * New combinators have time and space complexity annotations * New combinators have `since` annotation * Changelog entry is added for exposed combinators. * Identify breaking changes, or changes that may require depndency version bound changes. See https://pvp.haskell.org/. Also see the "Breaking changes section below". * Optionally, look at hlint output if anything in that is worth fixing. * Merge the PR by rebasing. Note that github always creates new commits when merged with rebase, it records the committer as well as the author in the commit. This makes the local branch diverge from master. You can rebase and merge the commit manually in your local git tree and push the resulting master branch to avoid new commits and divergence from the original tree. ## Release Checklist * Check if any critical pending bugs or issues are to be included * If this is a major release check if any previously deprecated features are to be removed in this release. * Check pre-release APIs to be exposed, especially from streamly-examples * PVP is adhered to, additionally do not change pre-release APIs in a minor release * _Documentation_: * Documentation is updated. * Check if all the links in the docs are stable. * External links might point to the `master` branch of a few repositories. Pin them to a specific version that works with the current release. * You can run `rg '/blob/'` and `rg '/tree/'` to search for github links. * For example, links that point to `streamly-examples` point to its `master` branch. Instead, while making the release we need to pin the links to a specific revision that works with the latest commit of the current release. * Ideally, the distribution process should automatically replace these with current stable references. * The markdown documentation files have no broken links. See the section about [checking for broken links](#checking-for-broken-links-in-the-markdown-files) for more info. * Haddock docs are consistent with the changes in the release * Tutorial has been updated for new changes * Documents in the `docs` directory are consistent with new changes * All combinators have time and space complexity annotations * All combinators have `since` annotation * Modules that were renamed or APIs that moved to other modules should be marked as "since" the current release. * Check all released modules to ensure that they do not have any leftover `Internal` or `Pre-release` annotations. * Unreleased combinators should be marked with `Pre-release` or `Internal` annotations * Streamly homepage is updated with the release docs * FileSystem.Event.Windows/Darwin modules require special handling * _Build and Test_: * All CIs are green * Run tests using bin/test.sh to test with memory restrictions * Manually build and test for all flags (esp. `dev` flag) not covered by CIs * Test prime GHC version with -O0 and -O1 * _Benchmarks_: * Check regressions from previous release * Run benchmarks with large stream size (`bench-runner --long --quick`) to check for space leaks and to ensure constant memory usage for streaming operations. * Run benchmarks with `dev` flag on. Some `fileio` benchmarks are disabled in regular runs. * Check comparative benchmarks using streaming-benchmarks * _Dependencies_: * Make sure all dependency bounds can use latest versions * Update `stack.yaml` to latest stable resolver, cleanup extra-deps * Make sure fusion-plugin is up to date and uploaded * Check the dependency footprint of the library. See [Generating Dependency Graph](#generating-dependency-graph) section. * _Dependent Packages_: * Update [streamly-examples](https://github.com/composewell/streamly-examples) to make sure it runs with the latest release. * Check the performance of examples where applicable * Update: * streamly-bytestring * streamly-process * streamly-lz4 * _Update Package Metadata:_ * Make sure the description in cabal file is in sync with the docs * Make sure CI configs include last three major releases of GHC in CI testing. * Update GHC `tested-with` field * Update `docs/Compiling.md` with the distributions tested with * Any docs linked inside haddock/cabal/changelog or any other such file should go in the extra-doc-files section instead of extra-source-files. Otherwise hackage shows the links as broken. * Make sure any additional files are added to `extra-source-files` in cabal file. Artifacts required for build, test, benchmarks, docs, licenses should be packaged. Build environment customization may or may not be packaged. * _Copyrights and Contibutors_ * Make sure contributors to the release are listed in `docs/CONTRIBUTORS.md` under the current release title. * Make sure any third party code included in the release has been listed in `docs/Credits.md` and the license is added to the repo. * _Update changelog & Version_: * Find API changes using `./packdiff streamly ver1 streamly ver2` and record them in `docs/User/ProjectRelated/ApiChangelogs/ver1-ver2.txt`. * Make sure all the bug fixes being included in this release are marked with a target release on github. So that users can search by release if they want. * Make sure the package version in configure.ac has the correct release version, else set the version correctly and run autoreconf. This might need to be done for a minor release. * Make sure the `Unreleased` section at the top of changelog file has the new release version number and the month/year of publishing. * Make sure the package version in the cabal file and package.yaml has the correct release version, else set the version correctly. This might need to be done on a minor release. * Bump the package version in any docs/links, use something like `rg '0\.8\.'|grep -v -i since` to find any remaining occurrences of the old release. Check `rg -i unreleased` for any remaining todos. * _Upload_: * Wait for final CI tests to pass: * Manually run packcheck for ghcjs, `bin/run-ci.sh --targets ghcjs`. * Mask out the build status lines from the [README](/docs/User/ProjectRelated/README.md) * Upload to hackage * Use a clean workspace to create source distribution by freshly cloning the git repository. The reason for doing this is that we use wild-cards in cabal file for `extra-source-files`, these wild-cards may match additional files lying around in the workspace and unintentionally ship them as well. * Upload and verify the candidate * `cabal v2-sdist; cabal upload ` * `cabal v2-haddock --haddock-for-hackage --enable-doc; cabal upload -d ` * Publish the package * `cabal v2-sdist`; `cabal upload --publish ` * Run packcheck on the uploaded package. To get the latest uploaded version from hackage, run `cabal unpack `. You might have to run `cabal update`. * Create a git tag corresponding to the release where X.Y.Z is the new package version (`git tag vX.Y.Z && git push -f origin vX.Y.Z`). * Add to stackage (`build-constraints.yaml` in Stackage repo) if needed * Optionally upload `package-X.Y.Z-sdist.tar.gz` to github release page * `gh release create vX.Y.Z --title "vX.Y.Z"` * `gh release upload vX.Y.Z package-X.Y.Z-sdist.tar.gz` * Update release contributors on github release page (`git shortlog -s prev_tag..new_tag | sed $'s/^[0-9 \t]*/* /' | sort -f`) * Update and if needed release streaming-benchmarks package * Check https://matrix.hackage.haskell.org/package/streamly * Check haddocks on Hackage, upload if not built * Announce to haskell-cafe@haskell.org ### Post Release Tasks * _Bump package version_: * After a major release, bump the package version in cabal file or package.yaml to the next major release target. * Bump the package version in configure.ac to the next major release target and run autoreconf. * Sync the package version of `streamly` and `streamly-docs`. * Add an `Unreleased` section at the top of changelog file with the next major release target. ## Breaking Changes This section lists what constitutes breaking changes. See https://pvp.haskell.org/ breaking changes that can be determined by the API alone. We specify some additional recommendations here: Behavior changes, this kind of changes are nasty and should be avoided. If such changes are made they should be emphasized adequately in the changelog: * Silent changes in the behavior of an API without any changes to the signature. * Even fixing a bug could break things as users may have employed workarounds for the bug. * Changes in the behavior of a dependency may also cause a change in behavior. Be cautious about the following: * Change in version bounds of dependencies may cause compilation failure for some programs because they may not be able to find a build plan. * Ideally, new warnings should not be considered breaking, dependencies should never be compiled with -Werror. But packages may not be following it perfectly. * Deprecating an API may issue new warnings, however the code can still be compiled if warnings are not treated as errors. Internal APIs: * Internal APIs can change without having to change the major version. However, we should try to align the Internal API changes with a major release as long as possible. ## Managing Issues We label the issues in different dimensions based on the lifecyle and different management aspects of the issue. The folowing sections discuss the labels used to manage the issues. If a new label is required, it should be discussed before creating it. ### Disposition The level-1 triaging of the issue determines the current disposition of the issue. If no change is required the issue must have one of the following labels: * disposition:invalid * disposition:question * disposition:discussion * disposition:duplicate * disposition:wontfix If a change is required we need to do level-2 triage of the issue, see the sections below. ### Change type When a change is required we need to put one of the __change type__ labels as part of level-2 triaging: * type:performance: User visible impact on performance. * type:testing: Improves testing or related to testing. * type:usability: It is not convenient to use the library. * type:documentation: documentation is not correct or sufficient. * type:bug: A functionality issue, not working as expected. * type:enhancement: includes a new feature or enhancement * type:maintenance: A refactor or any other change with no user visible impact. ### Aspect In addition we can put a __product aspect__ label describing a feature name or any other product specific classification bucket. * aspect: ### Severity Optionally we can add a severity label to indicate how severe is the impact of the bug/issue, which may determine how important it is to fix it. By default the severity is normal, if it is high we put a label: * severity:high ### API impact For a user visible issue whether it has a release/changelog impact: * api:new: includes a new feature or enhancement * api:breaking: has a breaking impact on existing deployments * api:deprecating: deprecates an existing functionality __RULE__: Any commit that may affect the end user in some way MUST have either a changelog entry OR MUST have an issue marked with one of the above labels OR both. ### Priority By default the issues are normal priority. We use a label for high priority issues: * priority:high ### Scheduling If the issue is assigned to someone then it is considered scheduled. Otherwise it is unscheduled. Unassigned issues may have the following labels: * sched:deferred: deliberately deferred for some reason. * sched:blocked: blocked on any other fix or a decision to be mad ### For Contributors * help-wanted: anyone can take the issue and contribute * good-first-issue: good for new contributors ## Correlating Changes, Issues and Releases For planning purposes, open issues may be marked with milestones or target releases. However, it may not always be known which release a fix will finally land in. For example, we may decide to make a minor release instead of a major one if there are no breaking changes yet, so we may not always know what would be the next release version. Trackability means that we should be able to find which issues got fixed in which release. Or what all issues got fixed in a particular release. We track significant changes using the changelog. However, there may be more changes that can only be tracked via issues, PRs or commits. When we make a release we can mark all the issues fixed in that release with a correct release target for future trackability. For better trackability of which issue got fixed in which release we need the following: * Before you close an issue make sure a commit or a PR fixing the issue is attached with it. In the commit message you can reference an issue like "fixes #50", you can do the same in a PR as well. * Before we make a new release EVERY issue with a commit included in that release that affects the end user, especially bugs and breaking changes MUST have the target release correctly set. ## Changelog Management Keep the unreleased changes in the `Unreleased` section at the top of changelog file. Using `Unreleased` instead of the next release number for unreleased changes is important. First, we do not know the next release number in advance, it could be a major or minor release. Second, if we use the next release number instead, then when adding a new change to the changelog, at one look we cannot know whether the release number on top is unreleased or released. ## TODO * GPG signed releases and tags ## Generating Dependency Graph Ways to get the dependencies. In the streamly package top dir: ``` # Keep only streamly in the packages section of stack.yaml $ stack --system-ghc dot . --external|dot -Tpdf > streamly.pdf ``` Using nix, remove the test and benchmark components from `shellFor/packages` in `default.nix` to get library only dependencies. May show additional ghc boot libraries. ``` $ nix-shell --run "ghc-pkg dot | dot -Tpdf > streamly.pdf" ``` ## Checking for broken symlinks The following command - when run from the repository root - finds all the broken symlinks in the project. ``` find . -xtype l \ -not -path "./dist*" \ -not -path "./.stack*" \ -not -path "./benchmark/bench-report/dist*" ``` ## Checking for broken links in the markdown files You can use [markdown-link-check](https://github.com/tcort/markdown-link-check) for finding broken links. You can run it using `npx` like so: ``` npx markdown-link-check ``` The following command - when run from the repository root - finds all the markdown files that we need to consider: ``` find . -type f -name "*.md" \ -not -path "./dist*" \ -not -path "./.stack*" \ -not -path "./benchmark/bench-report/dist*" ``` The following executes the markdown link checking with proper configuration on the results. This also checks all the links are absolute to the root of the repo. ``` find . -type f -name "*.md" \ -not -path "./dist*" \ -not -path "./.stack*" \ -not -path "./benchmark/bench-report/dist*" \ -exec npx markdown-link-check -c /dev/stdin --quiet {} \; <&1 | grep -e "http" -e "ftp" -e "mailto" ```