Skip to main content

Introduction

AI Observability with whylogs and WhyLabs#

WhyLabs is an observability platform designed to monitor data pipelines and ML applications for data quality regressions, data drift and model performance degradation. Built on top of an open-source package called whylogs, the platform enables AI builders to:

  • Set up in minutes: Begin generating statistical profiles of any dataset using whylogs, the lightweight open-source library.

  • Upload dataset profiles to the WhyLabs platform for centralized and customizable monitoring/alerting of model inputs, outputs, and performance.

  • Integrate seamlessly: interoperable with any ML infrastructure and framework. Generate real-time insights in your existing data flow. See more about our integrations here.

  • Scale to terabytes: handle your large-scale data, keeping compute requirements low. Integrate with either batch or streaming data pipelines.

  • Maintain data privacy: WhyLabs relies statistical profiles created via whylogs so your actual data never leaves your environment!

Quick Start#

The whylogs logging agent is the easiest way to enable logging, testing, and monitoring in an ML/AI application. The lightweight agent profiles data in real-time, collecting thousands of metrics from structured data, unstructured data, and ML model predictions with zero configuration.

First, install whylogs:

pip install whylogs

Then, start logging statistical properties of features, model inputs, and model outputs to enable explorative analysis, data unit testing, and monitoring. Getting whylogs up-and-running is easy:

from whylogs import get_or_create_session
import pandas as pd
session = get_or_create_session()
df = pd.read_csv("path/to/file.csv")
with session.logger(dataset_name="my_dataset") as logger:
#dataframe
logger.log_dataframe(df)
#dict
logger.log({"name": 1})
#images
logger.log_images("path/to/image.png")

Onboarding to WhyLabs#

With whylogs integrated into your workflow, the next step is to onboard with the WhyLabs SaaS Platform to monitor model inputs, outputs, and performance. New users can get started for free! Onboarding only takes a few minutes, so please contact us to request an account.

For more details on how to quickly get value out of whylogs/WhyLabs, visit our onboarding page!

Learn More About whylogs - Open Source Logging Agent#

  • whylogs provides lightweight data collection, enterprise scalability, and flexibility designed for data science
  • workflows. It has built-in data tagging and aggregation capabilities. Furthermore, the installation to take minutes and
  • seamlessly integrate with existing tools. You can read an in-depth overview about whylogs.

Wondering if the whylogs is a good fit for your use case? Check out our use cases section or join our Slack channel.

Contribute to whylogs?#

Check out the whylogs contribution process and Code of Conduct to get started.

Choose between the whylogs Python GitHub repo or the whylogs Java GitHub repo for your contributions.

Brainstorm ideas and share feedback with the whylogs community members on Slack!

Resources#

Prefooter Illustration Mobile
Run AI With Certainty
Get started for free
Prefooter Illustration