Skip to content

DanielAvdar/Tab-right

Repository files navigation

tab-right

PyPI - Python Version version License OS OS OS Tests Code Checks codecov Ruff Last Commit

Overview

tab-right is a Python package designed to simplify the analysis of tabular data for inference models—both machine learning and non-ML. The core philosophy is that most analyses, such as segmentation strength, drift analysis, and feature predictive value, can be performed using model predictions alone, without direct access to the model itself. This approach enables powerful, model-agnostic diagnostics and interpretability, making the package easy to implement and use.

Key Features

  • Segmentation Analysis: Analyze prediction strength across different data segments to uncover model biases and subgroup performance
  • Feature Analysis: Assess feature predictive power and value to inference, using techniques like feature importance, partial dependence, and more
  • Drift Detection: Perform drift analysis and monitor changes in data or prediction distributions over time
  • Rich Visualizations: Generate comprehensive visualization reports for all analyses, supporting both interactive and static outputs
  • Model-Agnostic: Focus on data and predictions, not model internals, for maximum flexibility and simplicity

Installation

# Install from PyPI
pip install tab-right

# For development version
pip install git+https://github.com/DanielAvdar/tab-right.git

Quick Start

Here's a simple example to get you started with tab-right:

import pandas as pd
import numpy as np
from tab_right.segmentations import calc_seg

# Load your data
data = pd.DataFrame({
    'feature_1': np.random.normal(0, 1, 1000),
    'feature_2': np.random.normal(0, 1, 1000),
    'predictions': np.random.uniform(0, 1, 1000)
})

# Perform segmentation analysis
segments = calc_seg(
    df=data,
    target_col='predictions',
    max_depth=3
)

# Print segmentation results
print(segments)

Documentation

For detailed documentation and examples, visit our documentation site.

The documentation includes:

  • Comprehensive API reference
  • In-depth tutorials
  • Example notebooks
  • Best practices guide

Use Cases

  • Model Evaluation: Compare model performance across different data segments
  • Model Monitoring: Track model drift and data distribution changes over time
  • Feature Engineering: Identify which features contribute most to predictions
  • Bias Detection: Uncover potential biases in model predictions across subgroups

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

See CONTRIBUTING.md for contribution guidelines.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use tab-right in a research paper, please cite it as:

@software{tab-right,
  author = {Avdar, Daniel},
  title = {tab-right: Model-Agnostic Analysis for Tabular Data},
  year = {2023},
  url = {https://github.com/DanielAvdar/tab-right}
}

Support

For questions, issues, or feature requests, please use the GitHub issue tracker.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •