ReHLine-Python: Efficient Solver for ERM with PLQ Loss and Linear Constraints

Fast, scalable, and scikit-learn compatible optimization for machine learning

ReHLine-Python is the official Python implementation of ReHLine, a powerful solver for large-scale empirical risk minimization (ERM) problems with convex piecewise linear-quadratic (PLQ) loss functions and linear constraints. Built with high-performance C++ core and seamless Python integration, ReHLine delivers exceptional speed while maintaining ease of use.

See more details in the ReHLine documentation.

✨ Key Features

🚀 Blazing Fast: Linear computational complexity per iteration, scales to millions of samples
🎯 Versatile: Supports any convex PLQ loss (hinge, check, Huber, and more)
🔒 Constrained Optimization: Handle linear equality and inequality constraints
📊 Scikit-Learn Compatible: Drop-in replacement with GridSearchCV, Pipeline support
🐍 Pythonic API: Both low-level and high-level interfaces for flexibility

📦 Installation

Quick Install

pip install rehline

🚀 Quick Start

Scikit-Learn Style API (Recommended)

ReHLine provides plq_Ridge_Classifier and plq_Ridge_Regressor that work seamlessly with scikit-learn:

from rehline import plq_Ridge_Classifier
from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split, GridSearchCV
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler

# Generate dataset
X, y = make_classification(n_samples=1000, n_features=20, random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Simple usage
clf = plq_Ridge_Classifier(loss={'name': 'svm'}, C=1.0)
clf.fit(X_train, y_train)
print(f"Accuracy: {clf.score(X_test, y_test):.3f}")

# Use in Pipeline
pipeline = Pipeline([
    ('scaler', StandardScaler()),
    ('classifier', plq_Ridge_Classifier(loss={'name': 'svm'}))
])
pipeline.fit(X_train, y_train)

# Hyperparameter tuning with GridSearchCV
param_grid = {
    'C': [0.1, 1.0, 10.0],
    'loss': [{'name': 'svm'}, {'name': 'sSVM'}]
}
grid_search = GridSearchCV(plq_Ridge_Classifier(loss={"name": "svm"}), param_grid, cv=5)
grid_search.fit(X_train, y_train)
print(f"Best params: {grid_search.best_params_}")

See more details in ReHLine with Scikit-Learn.

Low-Level API for Custom Problems

from rehline import ReHLine
import numpy as np

# Define custom PLQ loss parameters
clf = ReHLine()
# Set custom U, V matrices for ReLU loss
# and S, T, tau for ReHU loss
## U
clf.U = -(C*y).reshape(1,-1)
## V
clf.V = (C*np.array(np.ones(n))).reshape(1,-1)

# Set custom linear constraints A*beta + b >= 0
X_sen = X[:,0]
tol_sen = 0.1
clf.A = np.repeat([X_sen @ X], repeats=[2], axis=0) / n
clf.A[1] = -clf.A[1]

clf.fit(X)

See more detailed in Manual ReHLine Formulation.

🎯 Use Cases

ReHLine excels at solving a wide range of machine learning problems:

Problem	Description	Key Benefits
Support Vector Machines	Binary and multi-class classification	100-400× faster than CVXPY solvers
Fair Machine Learning	Classification with fairness constraints	Handles demographic parity efficiently
Quantile Regression	Robust conditional quantile estimation	2800× faster than general solvers
Huber Regression	Outlier-resistant regression	Superior to specialized solvers
Sparse Learning	Feature selection with L1 regularization	Scales to high dimensions
Custom Optimization	Any PLQ loss with linear constraints	Flexible framework for research

⚡ Performance Benchmarks

ReHLine delivers exceptional speed compared to state-of-the-art solvers. Here are speed-up factors on real-world datasets:

Speed Comparison vs. Popular Solvers

Task	vs. ECOS	vs. MOSEK	vs. SCS	vs. Specialized Solvers
SVM	415× faster	∞ (failed)	340× faster	4.5× vs. LIBLINEAR
Fair SVM	273× faster	100× faster	252× faster	∞ vs. DCCP (failed)
Quantile Regression	2843× faster	∞ (failed)	∞ (failed)	—
Huber Regression	∞ (failed)	452× faster	∞ (failed)	2.4× vs. hqreg
Smoothed SVM	—	—	—	1.6-2.3× vs. SAGA/SAG/SDCA/SVRG

Note: "∞" indicates the competing solver failed to produce a valid solution or exceeded time limits. Results from NeurIPS 2023 paper.

Reproducible Benchmarks (powered by benchopt)

All benchmarks are reproducible via benchopt at our ReHLine-benchmark repository.

Problem	Benchmark Code	Interactive Results
SVM	Code	📊 View
Smoothed SVM	Code	📊 View
Fair SVM	Code	📊 View
Quantile Regression	Code	📊 View
Huber Regression	Code	📊 View

🤝 Contributing

We welcome contributions! Whether it's bug reports, feature requests, or code contributions:

🐛 Open an issue
💬 Start a discussion
🔀 Submit a pull request

📚 Citation

If you use ReHLine in your research, please cite our NeurIPS 2023 paper:

@inproceedings{dai2023rehline,
  title={ReHLine: Regularized Composite ReLU-ReHU Loss Minimization with Linear Computation and Linear Convergence},
  author={Dai, Ben and Qiu, Yixuan},
  booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 364 Commits
.github/workflows		.github/workflows
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
doc		doc
figs		figs
rehline		rehline
src		src
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
to-do.md		to-do.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ReHLine-Python: Efficient Solver for ERM with PLQ Loss and Linear Constraints

✨ Key Features

📦 Installation

Quick Install

🚀 Quick Start

Scikit-Learn Style API (Recommended)

Low-Level API for Custom Problems

🎯 Use Cases

⚡ Performance Benchmarks

Speed Comparison vs. Popular Solvers

Reproducible Benchmarks (powered by benchopt)

🤝 Contributing

📚 Citation

🔗 ReHLine Ecosystem

🏠 Core Projects

📊 Resources

About

Uh oh!

Releases 8

Packages

Uh oh!

Contributors 9

Uh oh!

Languages

License

softmin/ReHLine-python

Folders and files

Latest commit

History

Repository files navigation

ReHLine-Python: Efficient Solver for ERM with PLQ Loss and Linear Constraints

✨ Key Features

📦 Installation

Quick Install

🚀 Quick Start

Scikit-Learn Style API (Recommended)

Low-Level API for Custom Problems

🎯 Use Cases

⚡ Performance Benchmarks

Speed Comparison vs. Popular Solvers

Reproducible Benchmarks (powered by benchopt)

🤝 Contributing

📚 Citation

🔗 ReHLine Ecosystem

🏠 Core Projects

📊 Resources

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Contributors 9

Uh oh!

Languages

Packages