Sam2 refactor #10

preshanth · 2025-12-30T18:50:47Z

This is a complete overhaul of SAM-RFI to be able to accommodate SAM2 models. We also explored SAM3 but have decided against it for now as the model native training values are not 1024 but rather 1008.

Major Changes: - Migrated from manual SAM2 library to HuggingFace transformers - Moved legacy code to legacy/ directory - Implemented clean class-based architecture with CLI and YAML configs New Modules: - data/: MSLoader, Preprocessor, SAMDataset (clean data pipeline) - training/: SAM2Trainer with validation loss tracking - inference/: RFIPredictor with iterative flagging support - config/: YAML configuration loader - data_generation/: MS and synthetic data generators Features: - Training & validation loss plots (dual curves) - Iterative flagging: N-pass RFI detection with cumulative masking - GPU profiling: validate_gpu.py with memory/utilization monitoring - Batch size optimization for V100/A100 - Real unit tests (removed mock-heavy tests) CLI Commands: - generate-data: Create datasets from MS or synthetic - train: Train on pre-generated HuggingFace datasets - predict: Single-pass or iterative flagging - create-config/validate-config: Config management Package: - pyproject.toml with proper dependencies (numpy>=1.26, pandas>=2.2) - pytest configuration - Example configs for training and validation Fixes: - Resolved pandas/numpy version conflicts - Separated data generation from training - Clean imports, no legacy dependencies

…ith plots

… The dataset generation now directly save torch tensors which allows for direct GPU loading. So dataset generation and preprocessing are done together and avoid loading time compute

…on tools Per-file changes: preprocessor.py: - Add automatic padding in _patchify_single_waterfall for arrays smaller than patch_size - Pad to multiples of patch_size for patchify compatibility - Store original_shapes in metadata for reconstruction cropping predictor.py: - Add save_probabilities parameter to save raw probability maps - Implement adaptive thresholding (threshold=None uses mean of probabilities) - Add upscaling of SAM2 256x256 outputs to patch_size using scipy.ndimage.zoom - Calculate padded shape for reconstruction, crop result to original dimensions evaluation/statistics.py (new): - Add compute_statistics for before/after flagging analysis - Add compute_ffi for Flagging Fidelity Index metric - Add print_statistics_comparison for formatted output evaluation/__init__.py: - Export compute_statistics, compute_ffi, print_statistics_comparison scripts/validate_single_array.py (new): - Standalone validation for synthetic or real single arrays - Probability heatmaps and histograms - Adaptive threshold testing - 2x4 grid (synthetic with GT) or 2x3 grid (real with FFI) ms_loader.py: - Add load_single_baseline method for extracting single baseline/pol sam_dataset.py: - Fix empty mask bbox: use full image [0,0,W,H] instead of center box sam2_trainer.py: - Fix logging check: use hasHandlers() instead of checking root logger configs/validation.yaml: - Fix stretch: sqrt → null for synthetic data pyproject.toml: - Add viz extras for holoviews/datashader visualization tools - Add samrfi.visualization package docs/batched_dataset_training.md: - Fix file extension examples: .npz → .pt This commit message and the doc updates are all made using Claude Code.

where I have incorporated the calcquality metric. Introducing a test for the metrics module.

Lazy loading casa and making a ci setup for pip install without heavy deps

preshanth added 30 commits September 30, 2025 02:00

Creating a script to generate and validate and then report the loss w…

a61f3cf

…ith plots

Updated to add pynvml

f2cc22a

Updating generate data in the cli

8230585

fixing the config inputs

725ca61

Black formatting

b4effae

Doing batched operations

99fdc91

Updating data generation to not have stretch

10f4241

Adding more RFI in the synthetic configs

3a46698

Updating to allows patching only when less than image size

f1bbb6f

Moving mask to python int from numpy

74bbbcc

validate gpu

c4cf1a6

Changing profiling to be not greedy

e5ad2c7

Update to clean up more memory leaks

05e1fca

Updating to ensure that synthetic RFI training uses the actual data

772cea5

Turning of always profile

8bca72b

Missed flipping the dict flag

721bda0

Update to fix memory leaks in validation

28e2207

Updated to catch all byte conversions before writing json

9a31740

Updating to reduce batch size to avoid apache overflow

689087f

wrong keyword removed

0f10fd0

going to numpy for training and HF for upload

43cd827

Updating for training using numpy datasets

df13e45

Fixing the training bottlenecks arising from memory

c35b6ca

Updating to flush after synthetic generator

c0e6c1b

Turning off mad flagging

cdbc401

Starting parallel training

77b7c31

Moving training params to config

4a4b0af

Updating the log nomalization such that the scales are now preserved.…

6b18bc5

… The dataset generation now directly save torch tensors which allows for direct GPU loading. So dataset generation and preprocessing are done together and avoid loading time compute

Updating to move out legacy code and older dcos

7a7d3ad

preshanth added 24 commits December 24, 2025 02:49

Changing compute order

73a867e

Removing a squeeze

c3c3bea

Debug statements

768f11a

Fixing mask sizes

0b9d552

Updating predictor

6445ef0

Cuda fix with spawn rather than fork

d7da606

Updating configs to increase amount of RFI in the data

a3ed154

Fixing flag template errors and adding unit tests

6307d1f

Updating to add unit tests and integration tests

5494a65

Updating to check in some useful scripts and the updated metric.py

ce01f1c

where I have incorporated the calcquality metric. Introducing a test for the metrics module.

Add CI and fix code quality issues

9597015

Pin linter versions to ensure consistency between pre-commit and CI

58bb022

Updating to make casa optional and to skip CI without CASA.

0317b5e

Removing casa import via msloader

d3eeb69

Trying to fix the isort issue

7ecb778

Updating to remove isort check upstream. Leaving it in pre-commit.

6d8e1f5

Lazy loading casa and making a ci setup for pip install without heavy deps

Removing package not used and moving gpu only packages to [gpu]

58eb522

Making sure torch cpu is there for tests

3230364

Fixing other ModelCache test locations for transformer dependence

0fb0679

Including transfomers in CI

a003aa3

Fixing casatools implicit deps

1bff774

Updating readme with current state

ec3b41a

HF integration for the SAM2 models for push and pull.

811f336

preshanth linked an issue Dec 30, 2025 that may be closed by this pull request

SAM2 Refactoring + Speedup Attempt #2

Closed

preshanth self-assigned this Dec 30, 2025

preshanth requested a review from Kitchi December 30, 2025 18:51

preshanth added 2 commits December 30, 2025 12:14

Updates to author list for SAM2

4b19c13

Cleanup

00f6a28

preshanth merged commit 9b7cccc into main Dec 30, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sam2 refactor #10

Sam2 refactor #10

Uh oh!

preshanth commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sam2 refactor #10

Sam2 refactor #10

Uh oh!

Conversation

preshanth commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants