Wutches

Wutches are entities devoted to visiting web pages and checking specific values on them. Each Wutch represents a single tracker for a particular website, designed to extract a specific value from it.

Wutch Configuration

To operate, a Wutch needs to know a couple of things:

  • The full address (URL) of the page, including all the complex and often unreadable parameters added by developers.
  • The unique identifier (CSS Selector) of the element on the page to be checked.

Both of these can be specified either manually or with the help of our browser extension. You'll need these values only for the initial configuration. After that, Wutch will continue to visit the page and check for changes automatically.

Limitations

Wutch can access almost any site that your browser can. However, there are certain limitations:

  • Wutch cannot log into websites on your behalf.

    However, this feature is currently under consideration and may be available in the future.

  • Wutch is a robot, so it won't pass "Not a Robot" tests.

    Challenges like ReCaptcha are, for now, beyond our capabilities.

  • Some websites implement bot detection mechanisms to prevent automated content extraction.

    This may include detecting User-Agent and browser features to prevent unauthorized use. This is a highly dynamic field, with new methods and mechanics emerging regularly. We strive to stay ahead in this game.

  • Some websites have a highly dynamic internal structure that changes when new content is added to the site.

    This can prevent Wutch from "recognizing" the element it needs to extract content from.

Next up:

Learn how to create a new Wutch