Reworking nonnegative algorithm and solvers #542

cohenjer · 2024-02-01T16:30:38Z

First of all, I am sorry for the massive PR with so many small commits (to be squashed upon merge hopefully). I have been working on this for quite some time, and cutting this work into pieces seemed artificial and challenging.

Anyways I hope to make the nonnegative things in Tensorly a little cleaner and efficient with this!

Improvements for NNLS

Fixed Nonnegative Least Squares (NNLS) solvers (HALS, FISTA) that had bugs in them.
Added ridge and sparse regularization support for all NNLS solvers and HALS algorithms.
Replaced return_errors with callback for all nonnegative solvers (also Multiplicative Updates solvers). Added documentation which was missing for callback.
Added the option to have unconstrained modes in non_negative_tucker_hals (this is now similar to non_negative_parafac_hals API)
The error computation (math and code) were different in the various nonnegative decomposition algorithms. I chose to unify them, computing the loss function. This is not what is done in the CP function (we compute the square-root of the loss); this is open to discussion, but showing the loss is what makes the most sense to me. I updated the CP functions as well to have at least the same math for tensor_norm everywhere.
Updated API for HALS function to allow some degree of control over the inner HALS nnls solver. But I also simplified the nnls HALS API so these new inputs should be self-explanatory.
Reworked documentation (a lot) to add some missing arguments, and improve clarity.
Updated the nonnegative decomposition examples to make them easier to follow and simpler. Also they now explain random tensor generations and parameter tuning in the algorithms.
Of course, modified the tests where needed, and added new ones for the solvers/ (see below)

Overall improvements

Reverted MTTKRP back to the faster matrix-matrix product. I added the current memory-efficient version too, with some snippet of code in the documentation to show how to switch from one to the other using the backend system. This makes all CP algorithms significantly faster as we discussed here The unfolding_dot_khatri_rao-function is unneccesarily slow #442
Created a solvers/ folder in tensorly, to move all the optimization methods from tenalg/ to a dedicated place. solvers/ has four modules (nnls.py, admm.py, proximal.py and penalizations.py, this last one being mostly empty but I plan to populate it in a coming PR).

Others

Adding the maximum() function in the backend, useful for nonnegative projections.

Pending

The FISTA algorithm in HALS for nonnegative Tucker needs a stepsize, right now I use some hand-crafted scheduling (compute the Lipschitz constant for first 20 outer iterations which are critical, then update every 20 iterations). Is this satisfactory? If yes I can add some documentation for this.
There are some tests which are not passing, but I am unsure if this is due to bugs introduced by this PR (probably not). One test however that failed after changing back the MTTKRP to the matrix-matrix product version is test_entropy.py. I had to comment out the test with normalization activated, I have no explanation as to why that failed...

correcting HALS for sparse and ridge regularizations Adding balancing Adding Tucker normalization in class Updating API, with todos

…e_hals

…izatiom

codecov · 2024-02-01T16:33:07Z

Codecov Report

Attention: 104 lines in your changes are missing coverage. Please review.

Comparison is base (2a8ff56) 87.15% compared to head (3240db7) 86.85%.

Files	Patch %	Lines
tensorly/solvers/nnls.py	65.87%	43 Missing ⚠️
tensorly/decomposition/_tucker.py	59.80%	41 Missing ⚠️
tensorly/decomposition/_nn_cp.py	82.60%	12 Missing ⚠️
tensorly/solvers/penalizations.py	92.00%	6 Missing ⚠️
tensorly/backend/core.py	66.66%	1 Missing ⚠️
tensorly/tucker_tensor.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #542      +/-   ##
==========================================
- Coverage   87.15%   86.85%   -0.30%     
==========================================
  Files         121      127       +6     
  Lines        7693     7937     +244     
==========================================
+ Hits         6705     6894     +189     
- Misses        988     1043      +55

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

JeanKossaifi · 2024-03-12T01:27:46Z

@cohenjer The changes sound amazing, and went through it quickly, looks good on a high level.

I know you mentioned it feels artificial to break into multiple PRs, but breaking down the PR into smaller, more manageable PRs might make reviewing easier.

Docs, examples etc can be easily enough reviewed as is I think so maybe we can split novel additions first.
Otherwise, we can try to review as is. Maybe @aarmey and others would be able to have a look too?

aarmey · 2024-03-12T02:59:16Z

@cohenjer this is outstanding! Many thanks.

I agree about splitting this up. For instance, perhaps much of the (1) documentation, (2) return_errors interface, (3) MTTKRP, and (4) tl.maximum changes can be separate PRs? I'm happy to quickly review each if so. It's so easy to miss bugs otherwise...

I am strongly in favor of using a consistent loss throughout (and having it be the loss, not the square root)! This inconsistency has tripped me up several times.

cohenjer · 2024-03-12T08:58:49Z

Hi @JeanKossaifi @aarmey. Thank you for the kind comments, and very happy to hear you think the changes are positive! I agree the PR is too large to be reviewed efficiently; I will cut it down into several parts (possibly more than four). However, I am unsure if there is a more efficient process than copy/paste the new content manually. My commits are all over the place :/

aarmey · 2024-03-12T19:32:07Z

One method that has helped me is to make a new branch, then git checkout old_branch -- file_path. This way you can copy over a file and changes to a new branch without having to manually copy and paste.

JeanKossaifi · 2024-03-18T18:18:40Z

Great suggestion @aarmey - we should have a section in the contribution doc to collate all these tips!

cohenjer · 2024-04-10T12:38:58Z

Working on this right now, should be creating a few PRs soon !

cohenjer and others added 30 commits January 16, 2023 10:12

changing sparsity in hals

97a168a

Merge branch 'main' into sparse_hals

22f123f

correcting update in HALS

5a518d5

Merge branch 'main' into sparse_hals

69e5092

first working version of sparse HALS scaled

c62fef1

sparse HALS Gretsi Paper

7b773ae

Merge branch 'main' into sparse_hals

9252530

Adding a solver directory

b022d13

correcting HALS for sparse and ridge regularizations Adding balancing Adding Tucker normalization in class Updating API, with todos

Update _nn_cp.py

cf88a24

Adding callback to HALS implementations

5be193f

reverting to naive memory consuming mttkrp

c75d965

adding inner tolerance for nncp hals

2243f94

rebalance Tucker at the end of loop

d9123c0

fix callback first call

6ad934d

change default mttkrp to naive but faster version

ba1e786

debugging callback nncp

5a05bfa

Merge branch 'main' into sparse_hals

b4e911a

merging changes to nnls

8f933f5

change mttkrp

cc72c9d

propose two options for mttkrp

3b3e339

adding better documentation on switch mttkrps

297291c

revert init core tenalg

3793a0c

remove bug in signature of process weights

79f3315

correct bug in mttkrp

2b56359

mttkrp updates, moving proximal, fixing tests

1482474

no tl.sqrt(scalar) for pytorch

ea7c822

fixing jax and tensorflow error

ceace9d

Revert change in kinetic dataloader description

a63c537

documentation and debug for solvers

6f09cfc

updating nnls and ncp and cp with callback

a128582

cohenjer added 20 commits January 25, 2024 16:01

Merge branch 'sparse_hals' of github.com:cohenjer/tensorly into spars…

293899e

…e_hals

more fixes

94f2aac

Fixing tensorflow bug, calling len(tensor)

2aff589

black linting

b070a10

Adding documentation to penalizations.py

698ced8

update docs

91daaa8

squared tensor norm instead of norm, to have homogoeneous loss normal…

2731e4d

…izatiom

updates to documentation and nonnegative parafac MU

b576b1d

more adaptations of tensor_norm

96dde6d

Uniform error computation with MU and HALS (different from CP)

454a9ef

test updates for nncp

89ae5be

changes to the examples to reflect the new NCP and nnTucker API

f0d8b9e

Adding callback to NTD and fixing docs

22d5f6a

Remove scaling for PR

704705e

forgot imports

510848d

bugged rescale

cec3aea

correcting refs in doc nnls

f18f151

black linting

ff2700b

remove vscode settings

efbb70c

removing vscode config files again

3240db7

aarmey mentioned this pull request Apr 3, 2024

Provide faster implementation of the MTTKRP #549

Merged

This was referenced Apr 10, 2024

moving solvers related code to solvers/* directory, improving fista and nnls_hals #550

Open

add normalization method to tucker_tensor class (similar to cp_tensor) #551

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reworking nonnegative algorithm and solvers #542

Reworking nonnegative algorithm and solvers #542

cohenjer commented Feb 1, 2024

codecov bot commented Feb 1, 2024 •

edited

JeanKossaifi commented Mar 12, 2024

aarmey commented Mar 12, 2024

cohenjer commented Mar 12, 2024

aarmey commented Mar 12, 2024

JeanKossaifi commented Mar 18, 2024

cohenjer commented Apr 10, 2024

Reworking nonnegative algorithm and solvers #542

Are you sure you want to change the base?

Reworking nonnegative algorithm and solvers #542

Conversation

cohenjer commented Feb 1, 2024

Improvements for NNLS

Overall improvements

Others

Pending

codecov bot commented Feb 1, 2024 • edited

Codecov Report

JeanKossaifi commented Mar 12, 2024

aarmey commented Mar 12, 2024

cohenjer commented Mar 12, 2024

aarmey commented Mar 12, 2024

JeanKossaifi commented Mar 18, 2024

cohenjer commented Apr 10, 2024

codecov bot commented Feb 1, 2024 •

edited