LIME - DataFrame Library Documentation

Overview

LIME (Local Interpretable Model-agnostic Explanations) is a technique designed to explain the predictions of any machine learning model by approximating it locally with an interpretable model. This implementation integrates seamlessly with the DataFrame library and can explain predictions from any model that accepts DataFrame inputs.

While complex machine learning models like Random Forests or Neural Networks can achieve high accuracy, they often function as "black boxes" that don't reveal why they made specific predictions. LIME addresses this problem by creating simple, interpretable explanations for individual predictions.

How LIME Works

LIME works on the principle that while a model may be globally complex, its behavior can be approximated locally with a simpler model. For a specific prediction, LIME follows these steps:

Generates perturbed samples around the instance being explained
Gets predictions from the original "black box" model for these samples
Weights samples based on their proximity to the original instance
Fits a simple, interpretable model (like linear regression) to this local dataset
Extracts feature weights from this interpretable model as the explanation

LIME Algorithm Core Concept


// Conceptual overview of the LIME algorithm:
1. Given an instance x to explain and a black box model f:
   - Generate perturbed samples z around x
   - Get model predictions f(z) for these samples
   - Weight samples based on proximity to x: w = exp(-d(x,z)²/width²)
   - Fit an interpretable model g to minimize Σw·(f(z)-g(z))²
   - Extract feature weights from g as the explanation

// This gives us a local explanation of how the model behaves around x

Class Definition

Lime Class Definition


namespace ml {

/**
 * @brief LIME (Local Interpretable Model-agnostic Explanations) implementation
 * for explaining individual predictions of any black-box model.
 */
class Lime {
public:
    /**
     * @brief Construct a Lime explainer
     * 
     * @param training_data The training data used for generating explanations
     * @param target_column The name of the target column in the training data
     * @param categorical_features Optional set of feature names that should be treated as categorical
     * @param kernel_width Width of the exponential kernel (default: 0.75)
     * @param verbose Whether to print progress information (default: false)
     */
    Lime(const df::Dataframe& training_data, 
         const std::string& target_column,
         const std::set& categorical_features = {},
         double kernel_width = 0.75,
         bool verbose = false);
    
    /**
     * @brief Generate an explanation for a specific instance
     * 
     * @param instance The instance to explain
     * @param predict_fn A function that takes a Dataframe of samples and returns predictions
     * @param num_features Number of features to include in the explanation (default: 5)
     * @param num_samples Number of samples to generate for local approximation (default: 5000)
     * @return std::vector> Feature names and their weights in the explanation
     */
    std::vector> explain(
        const df::Dataframe& instance,
        std::function(const df::Dataframe&)> predict_fn,
        size_t num_features = 5,
        size_t num_samples = 5000);
    
    // Additional methods for advanced usage...
};

// Utility function to create a LIME explainer
inline Lime create_lime_explainer(
    const df::Dataframe& training_data,
    const std::string& target_column,
    const std::set& categorical_features = {}) {
    
    return Lime(training_data, target_column, categorical_features);
}

} // namespace ml

Parameters

Constructor

Parameter	Type	Description
training_data	const df::Dataframe&	The training data used to understand feature distributions. This helps in generating meaningful perturbations.
target_column	const std::string&	The name of the target column in the training data.
categorical_features	const std::set&	Set of feature names that should be treated as categorical. Optional (default: empty set).
kernel_width	double	Width of the exponential kernel used for weighting perturbed samples. A smaller value makes the explanation more local, while a larger value considers a broader area. Optional (default: 0.75).
verbose	bool	Whether to print progress information during explanation generation. Optional (default: false).

explain() Method

Parameter	Type	Description
instance	const df::Dataframe&	The specific instance (single row) to explain.
predict_fn	std::function(const df::Dataframe&)>	A prediction function that takes a Dataframe of samples and returns predictions as a Serie.
num_features	size_t	Number of features to include in the explanation. Optional (default: 5).
num_samples	size_t	Number of perturbed samples to generate for the local approximation. More samples may give more stable explanations but take longer to process. Optional (default: 5000).

Return Value

The explain() method returns a std::vector> containing feature names and their corresponding weights in the explanation. Features are sorted by importance (magnitude of weight), but the actual values indicate the direction and magnitude of influence:

Positive weights: The feature increases the predicted value (or likelihood of the predicted class)
Negative weights: The feature decreases the predicted value (or likelihood of the predicted class)
Magnitude: The absolute value indicates how strongly the feature influences the prediction

Understanding the Output

When you analyze the results from a LIME (Local Interpretable Model-agnostic Explanations) analysis, you're seeing a window into why your model made a specific prediction. A typical LIME result contains:

Features: The input variables that influenced the prediction
Weights/Coefficients: Numerical values indicating the strength and direction of each feature's influence

For example, you might see something like:

Feature                 Weight
----------------------------------------
sepal_length             0.3542
petal_length             0.2874
categorical=setosa       0.1953
petal_width             -0.1421
sepal_width             -0.0895

Key Interpretation Points

Direction of Influence

Positive weights indicate that the feature pushed the prediction toward the predicted class
Negative weights indicate that the feature pushed the prediction away from the predicted class

Magnitude of Influence

Larger absolute values (whether positive or negative) indicate stronger influence
Features are typically sorted by their importance (absolute weight)

Feature Types

Original numerical features appear as-is (sepal_length)
Categorical features appear with specific values (categorical=setosa)

Context Matters

The explanation is only valid for this specific prediction and nearby instances
Different instances may have different explanations even for the same predicted class

Practical Example

Let's say you're predicting iris flower species and LIME gives you these results for a "virginica" prediction:

petal_length             2.3541   (Long petals → supports virginica)
petal_width              1.8724   (Wide petals → supports virginica)
sepal_length             0.4532   (Long sepals → slightly supports virginica)
sepal_width             -0.3217   (Wide sepals → slightly against virginica)

This tells you:

Petal dimensions are the most important factors for this prediction
Long, wide petals strongly indicate virginica
Sepal dimensions play a smaller role
Wide sepals actually provide evidence against this being virginica

Common Interpretation Scenarios

For Classification Tasks

Dominant Features: Look for features with significantly larger weights than others—these are driving the prediction
Feature Conflicts: Features with opposite signs show competing evidence that the model is weighing
Surprising Influences: Features you wouldn't expect to be important might reveal model biases or data issues

For Regression Tasks

Direction of Impact: Positive weights increase the predicted value; negative weights decrease it
Relative Importance: Compare magnitudes to understand which features have the biggest impact on the prediction

Best Practices

Compare Multiple Instances: Look at explanations across several examples to identify patterns
Sanity Check: Verify if the explanations align with domain knowledge
Look for Biases: If unexpected features have high importance, investigate potential biases in your training data
Consider Feature Interactions: LIME examines features independently, so be aware that it might miss complex interactions
Use Alongside Other Methods: Combine with global explanations (like feature importance or partial dependence plots) for a more complete picture

What LIME Doesn't Tell You

Global Behavior: LIME only explains individual predictions, not the model's overall behavior
Feature Interactions: Complex interactions between features may not be fully captured
Causality: LIME shows correlation, not causation—high importance doesn't mean changing that feature will necessarily change the prediction
Confidence: The explanation itself doesn't indicate how confident the model is in its prediction

Taking Action on LIME Results

Model Improvement: If important features don't make sense, consider retraining with different features
Feature Engineering: Insights about important features can guide creation of new, more informative features
Detecting Biases: Consistently high importance for sensitive attributes may indicate bias
Domain Validation: Use explanations to verify that the model's reasoning aligns with domain knowledge

By carefully analyzing LIME results, you can gain valuable insights into your model's decision-making process and use this understanding to improve both your model and your trust in its predictions.

Related Functions

Example Usage

Basic Example with Random Forest


#include 
#include 
#include 
#include 
#include 
#include 
#include 

int main() {
    // Load data and train a model
    df::Dataframe data = df::io::read_csv("iris.csv");
    
    // Define categorical features
    std::set categorical_features;
    categorical_features.insert("species");
    
    // Split data into training and test sets (80/20 split)
    // ... [code to split data] ...
    
    // Create and train a random forest classifier
    ml::RandomForest rf = ml::create_random_forest_classifier(
        100,    // num_trees
        3,      // n_classes (3 species for iris)
        0,      // max_features (auto)
        10,     // max_depth
        2       // min_samples_split
    );
    
    rf.fit(train_data, "species");
    
    // Create a LIME explainer
    ml::Lime lime_explainer(train_data, "species", categorical_features);
    
    // Choose an instance to explain (e.g., first test instance)
    df::Dataframe instance = get_row(test_data, 0);
    
    // Create a prediction function for the random forest
    auto predict_fn = [&rf](const df::Dataframe& samples) {
        return rf.predict(samples);
    };
    
    // Get the explanation
    auto explanation = lime_explainer.explain(
        instance,
        predict_fn,
        4,      // Show top 4 features
        1000    // Use 1000 samples
    );
    
    // Display the explanation
    std::cout << "Instance predicted as: " << rf.predict(instance)[0] << std::endl;
    std::cout << "LIME Explanation:" << std::endl;
    
    for (const auto& [feature, weight] : explanation) {
        std::cout << feature << ": " << weight << std::endl;
    }
    
    return 0;
}

Example with Custom Model


// Example using LIME with a custom model (not just RandomForest)
// This could be any model that can produce predictions for DataFrame inputs

// Define a custom model's prediction function
auto custom_model_predict = [&your_model](const df::Dataframe& samples) {
    // Process the samples with your custom model
    // ...
    // Return predictions as a Serie
    return your_model.predict(samples);
};

// Create a LIME explainer
ml::Lime explainer(training_data, "target", categorical_features);

// Get explanation for an instance
auto explanation = explainer.explain(
    instance_to_explain,
    custom_model_predict,
    5,    // num_features
    2000  // num_samples
);

// Print explanation
for (const auto& [feature, weight] : explanation) {
    std::cout << feature << ": " << weight << std::endl;
}

Handling Different Data Types

LIME treats numerical and categorical features differently to generate meaningful perturbations:

Numerical Features

Perturbation: Adds Gaussian noise based on the feature's distribution in the training data
Distance Calculation: Uses normalized squared difference, accounting for the feature's variance
Interpretable Model: Used directly as numeric values in the linear model

Categorical Features

Perturbation: Randomly samples from the set of possible values seen in the training data
Distance Calculation: Uses a simple 0/1 distance metric (same/different)
Interpretable Model: Converted to binary features (one-hot encoding minus one category to avoid collinearity)

Working with Categorical Features


// When creating a LIME explainer, specify which features are categorical
std::set categorical_features;
categorical_features.insert("species");
categorical_features.insert("region");
categorical_features.insert("color");

// The explainer will handle these features appropriately
ml::Lime explainer(data, "target", categorical_features);

// In the explanation, categorical features will appear as "feature=value"
// For example: "color=red" with a positive or negative weight

Implementation Notes

Memory Efficiency: The implementation generates perturbed samples on demand rather than storing all training data in memory.
Performance: The most computationally intensive part is generating and predicting perturbed samples, so the num_samples parameter controls the trade-off between explanation quality and computation time.
Regularization: The interpretable model uses a small regularization term to handle potential collinearity in the feature matrix.
Sampling: For numerical features, the standard deviation is estimated from the training data, making perturbations adaptive to each feature's scale and distribution.
Missing Values: The current implementation does not explicitly handle missing values. Ensure your data is properly pre-processed before using LIME.

Applications

LIME is particularly valuable in scenarios where understanding model decisions is crucial:

Model Debugging

Explanations can reveal when models are making predictions based on spurious correlations or biases in the training data. If the features with high importance don't make sense for the problem domain, it may indicate issues with the model.

Trust and Transparency

In domains like healthcare, finance, or legal applications, stakeholders need to understand why a model made a specific prediction. LIME provides simple, interpretable explanations that non-technical users can understand.

Feature Engineering

By examining what features are most important across multiple explanations, you can gain insights for feature engineering or feature selection in subsequent modeling iterations.

Regulatory Compliance

In regulated industries, there may be requirements to explain automated decisions. LIME can help satisfy these requirements by providing transparent explanations for individual predictions.

LIME (Local Interpretable Model-agnostic Explanations)