Find broken links in text documents
Similar idea to awesome_bot, but with different output options.
Currently only supports http:// and https:// prefixed URLs
Binaries for Mac and Linux are available. Add the binary to a directory in your path (such as /usr/local/bin).
To build the latest version:
docker build -t brok https://github.com/smallhadroncollider/brok.gitTo run brök:
docker run brokIf you have cabal installed:
cabal install brokMake sure you run cabal update if you haven't run it recently.
Requirements: Stack
The following command will build brök and then install it in ~/.local/bin:
stack build && stack installCheck all links in a single text file:
brok test.mdOr in multiple files:
brok test.md links.texIf you're using this as part of a test suite, you probably only need the errors:
brok text.md links.tex > /dev/null
By default brök will cache successes for a day in a .brokdb file. It will always recheck errors.
If you want to adjust the cache length, you can enter the number of seconds after which the cache invalidates:
# cache for a week
brok --cache 604800 test.md links.texIf you want to avoid creating the .brokdb file or ignore the cache entirely you can use the --no-cache option:
# do not cache results
# and don't use previously generated cache
brok --no-cache test.md links.texMost browsers will display a website even if it has certificate issues (such as an incomplete certificate chain). By default Brök will not check certificates, so replicate this behaviour.
If you would like to enforce certificate checking, you can turn this on:
brok --check-certs test.mdAny sites with certificate issues will then return a Could not connect error.
You can tell brök to ignore URLs with specified prefixes:
# ignore facebook and amazon URLS
brok --ignore "http://facebook.com" "http://amazon.com" test.md links.texBy default brök waits for 100ms between URL checks. You can change the delay:
# wait for 1 second between checks
brok --interval 1000 test.md links.texIf you want to see what's going on, but you're not interested in successes, then you can use the --only-failures option:
# see what's going on, but only show failures
brok --only-failures test.md links.texIf you're using brök as part of a script then you should redirect stdout.
By default the output uses bash colour codes. You can turn this off using the --no-color setting.
If you want to check all the links in your Git repo are valid before being able to commit then add something like the following to .git/hooks/pre-commit.
#! /bin/bash
# cache for 1 week
# use find to check all *.md files
# only show errors (if there are any)
brok --cache 604800 $(find . -type f -name "*.md") > /dev/null#! /bin/zsh
# cache for 1 week
# using a zsh glob to check all *.md files
# only show errors (if there are any)
brok --cache 604800 */**/*.md > /dev/null