Add binning functions by mmccrackan · Pull Request #210 · simonsobs/so3g

mmccrackan · 2025-02-28T00:29:14Z

Adds binning functions that replicate most of the functionality of sotodlib's bin_signal (https://github.com/simonsobs/sotodlib/blob/master/sotodlib/tod_ops/binning.py). There are two functions, bin_signal and bin_flagged_signal since binning with flags is much slower. Setting the bin edges is left up to python since it has algorithms to find the optimum bin numbers, but might be added later. The weight and flag arrays may be either 1D (nsamps) or 2D (ndets, nsamps) as in the sotodlib version.

mhasself

Prelim review -- thanks!

mhasself · 2025-06-04T17:40:36Z

test/test_array_ops.py

+        np.testing.assert_allclose(binned_signal_sigma_so3g, binned_signal_sigma, atol=tolerance)
+        np.testing.assert_allclose(bin_counts_so3g, bin_counts_dets, atol=tolerance)
+
+    def test_02_binning_flags_float64(self):


Please reduce duplicate code in the test by using a helper function and calling it from test_01 and test_02.

Moved most of the duplicated code in tests to standalone functions.

src/array_ops.cxx

mhasself · 2025-06-04T17:48:43Z

src/array_ops.cxx

+void _histogram(const T* data, const T* weights, int* histogram,
+                const T* bin_edges, const int nsamps, const int nbins,
+                const T lower, const T upper)
+{


This implementation is very brute-force -- looping over ~every bin for every sample. A really basic optimization would be to assume correlation in data -- i.e. if sample i was in bin j, start your search for i+1 at bin j (and probably you'll find it in j, j-1, or j+1).

But really this function is used in one place ... where one already has "bin_indices" ... so perhaps this function isn't needed at all?

I removed this function and replaced with code that uses bin_indices and confirmed it is equivalent. While that added step can also be removed and just use the existing code in the samps loop, it is faster to do it up front like this then for check for each sample if possible.

src/array_ops.cxx

mhasself · 2025-06-04T17:50:04Z

src/array_ops.cxx

+            }
+        }
+        // Edge case to match np.histogram
+        if (data[i] == bin_edges[nbins] && data[i] <= upper) {


Is that == right here? Wow.

Yeah, technically it should have been >= since we would want to add the values between bin_edges[nbins] and upper though in some cases this would just limit to only ==. Function has been removed though, so now irrelevant.

src/array_ops.cxx

mhasself · 2025-06-04T18:00:40Z

src/array_ops.cxx

+            "  lower: lower bin range (float64)\n"
+            "  upper: upper bin range (float64)\n");


Point of these two args is unclear. And they're only propagated through to _histogram, which as discussed above I think should be perhaps be canned...

These are to allow for excluding values outside of a different (perhaps more restricted) range that is not strictly defined by the bin edges. I added these here to reproduce the range arguments in numpy's bin_edges and np.histogram. While I've removed the histogram function, I've kept these in the new code that uses bin_indices in order to make it consistent with the sotodlib version that has range as an input. I have also made the docstring a bit more explanative. I tested limiting the range in the tests to less than the full x array range and find that they both agree.

The sotodlib version doesn't use np.histogram when there are flags so that doesn't actually limit based on the range argument (other than in np.histogram_bin_edges) in the way that it does when there are no flags. I've matched that difference here, but perhaps we might want to make these two cases consistent across both versions.

Michael McCrackan and others added 8 commits February 27, 2025 14:53

add binning

f83db85

add strides, more dim checking, more tests

3adbc81

fix bug with range

8908f58

undo debug changes

3cd9fff

undo more debug changes

6d6f927

some doc changes, rename bins

cea537f

undo accidental changes

7fe9585

fix typo

a57a92e

mmccrackan marked this pull request as ready for review February 28, 2025 04:56

mmccrackan requested a review from mhasself February 28, 2025 04:56

Michael McCrackan and others added 3 commits February 27, 2025 21:19

fix docstring format

955ddcb

fix bin index type, stride cleanup

874ae70

clarify allowing different bin widths in docstring

e022ced

mhasself requested changes Jun 4, 2025

View reviewed changes

tskisner mentioned this pull request Jul 3, 2025

Update wheel dependencies and manylinux build container #214

Merged

Michael McCrackan and others added 4 commits July 9, 2025 07:54

updates and fixes

112c5e1

Merge branch 'master' into 20250227_binning

f2e8518

fix docstring

a74675d

add comment on up-front bin count calc

b8dd35d

mmccrackan requested a review from mhasself July 9, 2025 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add binning functions#210

Add binning functions#210
mmccrackan wants to merge 15 commits intomasterfrom
20250227_binning

mmccrackan commented Feb 28, 2025 •

edited

Loading

Uh oh!

mhasself left a comment

Uh oh!

mhasself Jun 4, 2025

Uh oh!

mmccrackan Jul 9, 2025

Uh oh!

Uh oh!

mhasself Jun 4, 2025

Uh oh!

mmccrackan Jul 9, 2025

Uh oh!

Uh oh!

mhasself Jun 4, 2025

Uh oh!

mmccrackan Jul 9, 2025

Uh oh!

Uh oh!

mhasself Jun 4, 2025

Uh oh!

mmccrackan Jul 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		" lower: lower bin range (float64)\n"
		" upper: upper bin range (float64)\n");

Conversation

mmccrackan commented Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhasself left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mmccrackan commented Feb 28, 2025 •

edited

Loading