Add `discard_spikes` to curation, and update to `v3` by chrishalcrow · Pull Request #4287 · SpikeInterface/spikeinterface

chrishalcrow · 2025-12-31T13:31:45Z

Would close #4261

Docs: https://spikeinterface--4287.org.readthedocs.build/en/4287/modules/curation.html

(Note: PR looks big but mostly tests+docs - main changes are updating the id strategy in sorting_tools.py (~100 lines) and updating the apply_merges function (~50 lines))

This PR allows you to remove spikes from a unit during curation by specifying the following in your curation json file:

"discard_spikes": [
    {
        "unit_id": "u10",
        "indices": [
            56,
            57,
            59,
            60
        ]
    },
    {
        "unit_id": "u14",
        "indices": [
            123,
            321
        ]
    }
]

This new feature means that the curation format gets bumped to v3!

You can discard at the same time as merging and splitting.

Tricky bit 1

How to apply it to an analyzer. Decided to discard spikes at the same time as splitting. During the splitting step, we re-wrangle discard spikes into another split unit (call them "discard units") and keep track of the discard unit id. Then remove the full discard units after the splitting. This allows us to use the existing splitting machinery (including splitting extension etc) for discards - nice!

Tricky bit 2

Much of the complexity is now related to the id strategy (ughh - why do we give the user THREE new id strategies!!!). It's difficult because we want a cleaned unit to retains its id. This is true even if there are other units which are just split, and the user is using the "append" or "split" new_unit_id_strategy.

So suppose the user does the following:

Unit 1: clean
Unit 2: split
Unit 3: clean and split

With strategy append, we want

Unit 1 -> Unit 1 + Unit 1 dirty
Unit 2 -> Unit 4 + Unit 5
Unit 3 -> Unit 6 + Unit 7 + Unit 3 dirty

We do this by slotting in the dirty units into places we know to split later. For "append" strategy, I chose to put "Unit 1 dirty" to the last possible unit id, and unit 3 dirty to "unit 3".

Other stuff

We have to do merges after splitting+discarding. This is because the spike indices change after merging. To avoid wrangling spike indices (gross!) we just do discarding first.

Tests to do:

test curation format
test v2 -> v3 update
test actual discarding when also splitting / merging

…spikeinterface into add-discard-spikes

alejoe91 · 2026-01-06T08:21:54Z

Awesome Chris! Quick comment before diving into the PR:

So suppose the user does the following:

Unit 1: clean
Unit 2: split
Unit 3: clean and split

I don't think that clean and split should be allowed. We have rules so that a unit can either be removed, merged, split, and I would add discard. Does it simplify the new unit id logic?

alejoe91 · 2026-01-06T08:26:23Z

src/spikeinterface/curation/curation_format.py

+        if len(discard_spikes_unit_ids) > 0:
+            ids_to_remove = []
+            for new_id_set in new_ids:
+                if new_id_set[0] in discard_spikes_unit_ids or new_id_set[1] in discard_spikes_unit_ids:
+                    ids_to_remove.append(new_id_set[0])
+
+            curated_sorting_or_analyzer = curated_sorting_or_analyzer.remove_units(ids_to_remove)


alejoe91 · 2026-01-06T08:27:15Z

src/spikeinterface/core/sorting_tools.py

+
+        # decide if unit is a simple discard, a simple split or a discard and split
+        just_discard = False
+        discard_and_split = False


Yeah I think this should not be allowed. A unit can be either split or cleaned, not both! This should simplify the logic a lot!

add discard_spikes

a129839

chrishalcrow added the curation Related to curation module label Dec 31, 2025

chrishalcrow and others added 8 commits December 31, 2025 13:32

update curation docs

a5a5b0a

Merge branch 'main' into add-discard-spikes

47f6316

add tests for one split and adjust code so that tests work

0a16794

Merge branch 'add-discard-spikes' of https://github.com/chrishalcrow/…

e9c3542

…spikeinterface into add-discard-spikes

properly compute next_max_unit_id

19e8ba8

test discard/split at same time

becd632

fixes to metric to make splitting extensions work

fc295e5

more metrics fixes

60d8bb1

alejoe91 reviewed Jan 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `discard_spikes` to curation, and update to `v3`#4287

Add `discard_spikes` to curation, and update to `v3`#4287
chrishalcrow wants to merge 9 commits intoSpikeInterface:mainfrom
chrishalcrow:add-discard-spikes

chrishalcrow commented Dec 31, 2025 •

edited

Loading

Uh oh!

alejoe91 commented Jan 6, 2026

Uh oh!

alejoe91 Jan 6, 2026

Uh oh!

alejoe91 Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chrishalcrow commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tricky bit 1

Tricky bit 2

Other stuff

Uh oh!

alejoe91 commented Jan 6, 2026

Uh oh!

alejoe91 Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

alejoe91 Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chrishalcrow commented Dec 31, 2025 •

edited

Loading