Giskard-AI
diff --git a/‎README.md‎
Lines changed: 132 additions & 79 deletions b/‎README.md‎
Lines changed: 132 additions & 79 deletions
diff --git a/‎readme/architechture_giskard.png‎
1.59 MB b/‎readme/architechture_giskard.png‎
1.59 MB
diff --git a/‎readme/scan_example.gif‎
2.05 MB b/‎readme/scan_example.gif‎
2.05 MB
diff --git a/‎readme/suite_example.png‎
83.9 KB b/‎readme/suite_example.png‎
83.9 KB
@@ -11,8 +11,8 @@
  <a href="https://github.com/Giskard-AI/giskard/blob/main/LICENSE">
      <img alt="GitHub" src="https://img.shields.io/badge/License-Apache_2.0-blue.svg">
  </a>
-  <a href="https://github.com/Giskard-AI/giskard/actions/workflows/build.yml?query=branch%3Amain">
-    <img alt="build" src="https://github.com/Giskard-AI/giskard/actions/workflows/build.yml/badge.svg?branch=main"/>
+  <a href="https://github.com/Giskard-AI/giskard/actions/workflows/build_backend.yml?query=branch%3Amain">
+    <img alt="build" src="https://github.com/Giskard-AI/giskard/actions/workflows/build_backend.yml/badge.svg?branch=main"/>
  </a>
   <a href="https://sonarcloud.io/summary/new_code?id=giskard">
     <img alt="build" src="https://sonarcloud.io/api/project_badges/measure?project=giskard&metric=alert_status"/>
@@ -31,72 +31,59 @@
  </h3>
 <br />
 
-## Table of contents
-* 🐢 **[Why Giskard?](#why-giskard)**
-* 📗 **[Getting started](#getting-started)**
-  - [Install our Python library and testing server](#installation)
-  - [Scan your model to detect vulnerabilities](#scan-your-model-to-detect-vulnerabilities)
-  - [Automatically generate a test suite](#automatically-generate-a-test-suite-based-on-the-scan-results)
-  - [Upload your test suite to the Giskard server](#upload-your-test-suite-to-the-giskard-server)
-* 👋 **[How to contribute](#how-to-contribute)**
-* 💖 **[Like what we're doing?](#like-what-were-doing)**
-
-
-## Why Giskard?
-**Giskard is an open-source testing framework dedicated to ML models, from tabular models to LLMs.**
-
-Testing Machine Learning applications can be tedious. Since ML models depend on data, testing scenarios depend on the domain specificities and are often infinite. 
+## Install Giskard 🐢
+You can install the latest version of Giskard from PyPi using pip :
+```sh
+pip install "giskard[server]>=2.0.0b" -U
+```
+We officially support Python 3.8, 3.9 and 3.10.
+## Try in Colab 📙
+[Open Colab notebook](https://colab.research.google.com/github/giskard-ai/giskard/blob/main/python-client/docs/getting-started/quickstart.ipynb)
 
-<p align="center">
-<strong>Where to start testing? Which tests to implement? What issues to cover? How to implement the tests?</strong>
-</p>
+______________________________________________________________________
 
 <p align="center">
-  <img src="https://giskard.readthedocs.io/en/latest/_images/hey.png" alt="hey" width="20%">
+  <img src="readme/architechture_giskard.png" alt="Giskard Architechture" width="800">
 </p>
 
-At Giskard, we believe that Machine Learning needs its own testing framework. Created by ML engineers for ML engineers, Giskard enables you to:
+Giskard is a Python library that uses a variety of techniques to **detect vulnerabilities**, including:
 
-- **Scan your model to find dozens of hidden vulnerabilities**: The Giskard scan automatically detects vulnerability issues such as performance bias, data leakage, unrobustness, spurious correlation, overconfidence, underconfidence, unethical issue, etc.
+- **Data slicing and transformation**: Giskard can automatically generate different data slices and transformations to test the robustness of your model.
+- **Statistical analysis**: Giskard can use statistical analysis to identify patterns and relationships in your data that could indicate a vulnerability.
 
+ It's a powerful tool that helps data scientists **save time and effort** drilling down on model issues, and produce more **reliable and trustworthy models**.
+ 
 <p align="center">
-  <img src="readme/scan_example.png" alt="Scan Example" width="700px">
+  <img src="readme/scan_example.gif" alt="Scan Example" width="800">
 </p>
 
-- **Instantaneously generate domain-specific tests**: Giskard automatically generates relevant tests based on the vulnerabilities detected by the scan. You can easily customize the tests depending on your use case by defining domain-specific data slicers and transformers as fixtures of your test suites.
+Instantaneously generate test suites for your models ⤵️
 
 <p align="center">
-  <img src="readme/test_suite_example.png" alt="Scan Example" width="700px">
+  <img src="readme/suite_example.png" alt="Test Suite Example" width="800">
 </p>
 
-- **Leverage the Quality Assurance best practices of the open-source community**: The Giskard catalog enables you to easily contribute and load data slicing & transformation functions such as AI-based detectors (toxicity, hate, etc.), generators (typos, paraphraser, etc.), or evaluators. Inspired by the Hugging Face philosophy, the aim of Giskard is to become the open-source hub of ML Quality Assurance.
 
-<p align="center">
-  <img src="readme/catalog_example.png" alt="Scan Example" width="700px">
-</p>
+Giskard works with any model, any environment and integrates seamlessly with your favorite tools ⤵️ <br/>
 
-And of course, Giskard works with any model, any environment and integrates seamlessly with your favorite tools ⤵️ <br/>
 <p align="center">
   <img width='600' src="readme/tools.png">
 </p>
 <br/>
 
 
+# Contents
 
-## Getting started
-
-### Installation
-```sh
-pip install "giskard[server]>=2.0.0b" -U
-
-giskard server start
-```
+1. 🤸‍♀️ **[Quickstart](#%EF%B8%8F-quickstart)**
+2. ⭐️ **[Premium features](#%EF%B8%8F-premium-features)**
+3. ❓ **[FAQ](#-faq)**
+4. 👋 **[Community](#-community)**
 
-That's it. Access at http://localhost:19000
 
-### Scan your model to detect vulnerabilities
+# 🤸‍♀️ Quickstart
 
-After having wrapped your [model](https://docs.giskard.ai/en/latest/guides/wrap_model/index.html) & [dataset](https://docs.giskard.ai/en/latest/guides/wrap_dataset/index.html), you can scan your model for vulnerabilities using:
+## 1. 🔎 Scan your model
+Here's an example of Giskard scan on the famous titanic survival prediction dataset:
 
 ```python
 import giskard
@@ -105,7 +92,7 @@ import giskard
 df = giskard.demo.titanic_df()
 demo_demo_data_processing_function, demo_sklearn_model = giskard.demo.titanic_pipeline()
 
-# Wrap your Pandas DataFrame with Giskard.Dataset (test set, a golden dataset, etc.). Check the dedicated doc page: https://docs.giskard.ai/en/latest/guides/wrap_dataset/index.html
+# Wrap your Pandas DataFrame with Giskard.Dataset (test set, a golden dataset, etc.).
 giskard_dataset = giskard.Dataset(
     df=df,  # A pandas.DataFrame that contains the raw data (before all the pre-processing steps) and the actual ground truth variable (target).
     target="Survived",  # Ground truth variable
@@ -122,74 +109,140 @@ def prediction_function(df):
     return demo_sklearn_model.predict_proba(preprocessed_df)
 
 giskard_model = giskard.Model(
-    model=prediction_function,  # A prediction function that encapsulates all the data pre-processing steps and that could be executed with the dataset used by the scan.
+    model=demo_model,  # A prediction function that encapsulates all the data pre-processing steps and that could be executed with the dataset used by the scan.
     model_type="classification",  # Either regression, classification or text_generation.
     name="Titanic model",  # Optional
     classification_labels=demo_sklearn_model.classes_,  # Their order MUST be identical to the prediction_function's output order
     feature_names=['PassengerId', 'Pclass', 'Name', 'Sex', 'Age', 'SibSp', 'Parch', 'Fare', 'Embarked'],  # Default: all columns of your dataset
-    # classification_threshold=0.5,  # Default: 0.5
 )
 
-# Then apply the scan
-results = giskard.scan(giskard_model, giskard_dataset)
 ```
 
+✨✨✨Then run giskard's magical scan✨✨✨
+```python
+results = giskard.scan(giskard_model, giskard_dataset)
+```
 Once the scan completes, you can display the results directly in your notebook:
 
 ```python
-display(scan_results)  # in your notebook
+display(scan_results)
 ```
+*If you're facing issues, check out our wrapping [model](https://docs.giskard.ai/en/latest/guides/wrap_model/index.html) & [dataset](https://docs.giskard.ai/en/latest/guides/wrap_dataset/index.html) docs for more information.*
+## 2. 🪄 Automatically generate a test suite
 
-### Automatically generate a test suite based on the scan results
-
-If the scan found potential issues in your model, you can automatically generate a test suite.
-
-Generating a test suite from your scan results will enable you to:
-- Turn the issues you found into actionable tests that you can directly integrate in your CI/CD pipeline
-- Diagnose your vulnerabilities and debug the issues you found in the scan
+If the scan found potential issues in your model, you can automatically generate a **test suite** based on the vulnerabilities found:
 
 ```python
 test_suite = scan_results.generate_test_suite("My first test suite")
-
-# You can run the test suite locally to verify that it reproduces the issues
+```
+You can then run the test suite locally to verify that it reproduces the issues:
+```python
 test_suite.run()
 ```
 
-### Upload your test suite to the Giskard server
+Test suites are reusable objects that provide a way to apply consistent checks on your models. To drill down on failing tests and get even more out of the giskard library, we recommend heading over to the Giskard server ⤵️
 
-You can then upload the test suite to the local Giskard server. This will enable you to:
-- Compare the quality of different models to decide which one to promote
-- Debug your tests to diagnose the identified issues
-- Create more domain-specific tests relevant to your use case
-- Share results, and collaborate with your team to integrate business feedback
+# ⭐️ Premium Features
 
-First, install the Giskard server by following [this documentation](https://docs.giskard.ai/en/latest/guides/installation_app/index.html)
+The Giskard server is Giskard's premium offering. It provides a number of additional capabilities that are not available in the open-source version of Giskard, including:
 
-```python
-# Create a Giskard client after having installed the Giskard server (see documentation)
-token = "API_TOKEN"  # Find it in Settings in the Giskard server
-client = GiskardClient(
-    url="http://localhost:19000", token=token  # URL of your Giskard instance
-)
+- **Advanced test generation**: This includes the ability to to diagnose failing tests, debug your models and create more domain-specific tests.
+- **Model comparison**: This includes the ability to compare models in order to decide which one to promote,
+- **Test hub**: This includes a place to gather all of your team's tests in one place to collaborate more efficiently.
+- **Business feedback**: This includes the ability to share your results and collect business feedback from your team.
 
-my_project = client.create_project("my_project", "PROJECT_NAME", "DESCRIPTION")
+If you are interested in learning more about Giskard's premium offering, please [contact us](https://www.giskard.ai/contact).
 
-# Upload to the current project
-test_suite.upload(client, "my_project")
+<p align="center">
+  <img src="readme/catalog_example.png" alt="Scan Example" width="700px">
+</p>
+
+## 1. Start the Giskard server
 
+To start the **Giskard server**, run the following command: 
+```sh
+giskard server start
 ```
+
+🚀 That's it! Access it at http://localhost:19000
+
+## 2. Upload your test suite to the Giskard server
+
+You can then **upload the test suite** created using the `giskard` Python library to the Giskard server. This will enable you to:
+- Compare the quality of different models to decide which one to promote
+- Debug your tests to diagnose identified vulnerabilities
+- Create more domain-specific tests relevant to your use-case
+- Share results, and collaborate with your team to integrate business feedback
+
+1. First, make sure Giskard server is installed 
+    <details>
+      <summary>How to check if the Giskard server is running</summary>
+      
+      - check if http://localhost:19000 is running
+      - or use `giskard server status`
+    </details>
+
+2. Then execute the ML worker in your notebook:
+    ```python
+       !giskard worker start -d -k YOUR_TOKEN
+    ```
+
+3. Finally upload your test suite to the giskard server using the following code:
+    ```python
+    token = "API_TOKEN"  # Find it in Settings in the Giskard server
+    client = GiskardClient(
+        url="http://localhost:19000", token=token  # URL of your Giskard instance
+    )
 
-For more information on uploading to your local Giskard server, go to the [Upload an object to the Giskard server](https://docs.giskard.ai/en/latest/guides/upload/index.html) page.
+    my_project = client.create_project("my_project", "PROJECT_NAME", "DESCRIPTION")
+    
+    # Upload to the current project
+    test_suite.upload(client, "my_project")
+    ```
 
-## How to contribute
-We welcome contributions from the Machine Learning community!
+# ❓ Where can I get more help?
 
-Read this [guide](CONTRIBUTING.md) to get started.
 
-<br />
+<details>
+  <summary>What is a ML worker?</summary>
+
+  Giskard executes your model using a worker that runs the model directly in your Python environment containing all the dependencies required by your model. You can either execute the ML worker from a local notebook, a Colab notebook or a terminal. 
+  </details>
+
+<details>
+  <summary>How to get the API key</summary>
+  
+  Access the API key in the Settings tab of the Giskard server.
+</details>
+
+<details>
+  <summary>If Giskard server/ML worker is not installed</summary>
+
+  Go to the [Run the Giskard Server](https://docs.giskard.ai/en/latest/guides/installation_app/index.html) page.
+</details>
+
+<details>
+  <summary>If Giskard server is installed on an external server</summary>
 
-## Like what we're doing?
+  ```python
+    !giskard worker start -d -k YOUR_TOKEN -u http://ec2-13-50-XXXX.compute.amazonaws.com:19000/
+  ```
+</details>
+
+<details>
+  <summary>For more information on uploading to your local Giskard server</summary>
+
+  Go to the [Upload an object to the Giskard server](https://docs.giskard.ai/en/latest/guides/upload/index.html) page.
+</details>
+
+For any other questions and doubts, head over to our [Discord](https://gisk.ar/discord).
+
+# 👋 Community
+We welcome contributions from the Machine Learning community! Read this [guide](CONTRIBUTING.md) to get started.
+
+Join our thriving community on our Discord server : [join Discord server](https://gisk.ar/discord)
 
 🌟 [Leave us a star](https://github.com/Giskard-AI/giskard), it helps the project to get discovered by others and keeps us motivated to build awesome open-source tools! 🌟
 
 ❤️ You can also [sponsor us](https://github.com/sponsors/Giskard-AI) on GitHub. With a monthly sponsor subscription, you can get a sponsor badge and get your bug reports prioritized. We also offer one-time sponsoring if you want us to get involved in a consulting project, run a workshop, or give a talk at your company.
+