Classification #13

GernotMaier · 2025-12-26T16:25:11Z

No description provided.

Use native xgboost output.

Copilot

Pull request overview

This PR introduces gamma/hadron classification functionality and significantly refactors the codebase to reduce duplication and improve maintainability. The changes consolidate common patterns for both stereo reconstruction and classification tasks.

Key Changes:

Adds new classification training and application scripts for gamma/hadron separation
Introduces energy and zenith binning for classification models
Refactors data processing, model loading, and evaluation code into shared modules
Adds pre-training quality cuts for data filtering
Removes all existing unit tests without replacement

Reviewed changes

Copilot reviewed 23 out of 23 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
tests/unit_tests/test_*.py	All unit tests deleted - significant test coverage reduction
tests/resources/classify-parameter.json	New configuration file defining energy and zenith bins for classification
tests/conftest.py	Updated with shared fixture for telescope data
src/eventdisplay_ml/features.py	New module centralizing feature definitions for both analysis types
src/eventdisplay_ml/hyper_parameters.py	New module defining hyperparameters and pre-cuts
src/eventdisplay_ml/config.py	New configuration module handling CLI arguments for both training and application
src/eventdisplay_ml/models.py	New module consolidating model loading and application logic
src/eventdisplay_ml/scripts/train_xgb_classify.py	New classification training script
src/eventdisplay_ml/scripts/apply_xgb_classify.py	New classification application script
src/eventdisplay_ml/scripts/train_xgb_stereo.py	Refactored to use new config and model modules
src/eventdisplay_ml/scripts/apply_xgb_stereo.py	Refactored to use new model application infrastructure
src/eventdisplay_ml/evaluate.py	Enhanced with classification evaluation and improved feature importance handling
src/eventdisplay_ml/data_processing.py	Significantly refactored with support for both analysis types and energy/zenith binning
src/eventdisplay_ml/utils.py	Added model parameter loading and output filename generation utilities
src/eventdisplay_ml/training_variables.py	Deleted - functionality moved to features.py
pyproject.toml	Added new CLI entry points for classification scripts
docs/changes/*.md	Added changelog entries for new features and refactoring

Comments suppressed due to low confidence (1)

tests/unit_tests/test_utils.py:1

This PR deletes all existing unit tests (test_utils.py, test_training_variables.py, test_evaluate.py, test_data_processing.py, test_train_xgb_stereo.py, test_apply_xgb_stereo.py) without providing replacement tests. This represents a significant reduction in test coverage for the codebase. The new functionality introduced (classification routines, refactored data processing, new model loading, etc.) also lacks test coverage. Consider adding comprehensive unit tests to maintain code quality and prevent regressions.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/eventdisplay_ml/evaluate.py

Copilot · 2026-01-01T16:10:39Z

src/eventdisplay_ml/models.py

+        )
+        flatten_data = flatten_data.reindex(columns=models[n_tel]["features"])
+        model = models[n_tel]["model"]
+        preds[group_df.index] = model.predict(flatten_data)


The apply_regression_models function uses group_df.index to index into the preds array on line 211. However, if the DataFrame has been reset with a non-default index or has gaps in indices, this could lead to IndexError or incorrect assignment. The function should use positional indices or ensure the index is properly aligned with the predictions array.

src/eventdisplay_ml/features.py

src/eventdisplay_ml/utils.py

src/eventdisplay_ml/scripts/train_xgb_classify.py

src/eventdisplay_ml/data_processing.py

src/eventdisplay_ml/features.py

src/eventdisplay_ml/utils.py

tests/conftest.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Classification

4250814

GernotMaier self-assigned this Dec 26, 2025

GernotMaier added 25 commits December 26, 2025 18:00

no energies in classification

d390191

write signal/background efficiency

78a8107

apply classification

69ac608

cleanup

3652bff

unit tests

a74f38b

zenith bins

e75735b

cleanup

ab4c762

simplification

b50b931

write more to joblib file

1c964b9

unification

5bd6109

simplification

165c9de

using native XGB

034caed

remove size from training

5e3d08b

config

b06b2a4

consistent naming

1980c41

ignore docstrings in tests

bf10257

tests

1b9d66c

unit tests

7a42e37

apply cuts

4695f2f

remove tests

45871e4

log message

4e8a760

Merge pull request #15 from Eventdisplay/xgboost-multi-output

72bc845

Use native xgboost output.

simplified configuration

c964219

notable simplifications

9df8c52

config module

7b75433

GernotMaier marked this pull request as ready for review January 1, 2026 16:06

Copilot AI review requested due to automatic review settings January 1, 2026 16:06

Copilot started reviewing on behalf of GernotMaier January 1, 2026 16:06 View session

Copilot AI reviewed Jan 1, 2026

View reviewed changes

GernotMaier and others added 10 commits January 1, 2026 17:12

cleanup

91dc97f

Update src/eventdisplay_ml/evaluate.py

a7499ce

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/eventdisplay_ml/features.py

daa5027

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/eventdisplay_ml/utils.py

18d3a7e

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/eventdisplay_ml/data_processing.py

8d07e69

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/eventdisplay_ml/features.py

97a8912

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/eventdisplay_ml/utils.py

c70dc53

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

disable unit tests

a88994c

pre-commit

77561b8

cleanup

9667d1d

GernotMaier merged commit a978a6a into main Jan 1, 2026
2 checks passed

GernotMaier deleted the gh branch January 1, 2026 16:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Classification #13

Classification #13

Uh oh!

GernotMaier commented Dec 26, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Jan 1, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Classification #13

Classification #13

Uh oh!

Conversation

GernotMaier commented Dec 26, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Jan 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants