Correctly handling the case λmax = 0. #53

barankarakus · 2020-09-06T20:40:37Z

Fixes #51.

Two changes:

Change to computeλ to ensure λmax = 0 leads to an output of [0] and
not [NaN, ..., NaN].
Change to fit! to ensure the case where autoλ = true and λmax = 0 is
handled correctly (rather than throwing an error).

Two changes: 1) Change to computeλ to ensure λmax = 0 leads to an output of [0] and not [NaN, ..., NaN]. 2) Change to fit! to ensure the case where autoλ = true and λmax = 0 is handled correctly (rather than throwing an error).

coveralls · 2020-09-06T20:58:59Z

Pull Request Test Coverage Report for Build 202

19 of 19 (100.0%) changed or added relevant lines in 2 files are covered.
7 unchanged lines in 3 files lost coverage.
Overall coverage increased (+6.6%) to 91.049%

Files with Coverage Reduction	New Missed Lines	%
src/segselect.jl	1	89.23%
src/coordinate_descent.jl	3	93.63%
src/Lasso.jl	3	88.05%

Totals
Change from base Build 198:	6.6%
Covered Lines:	885
Relevant Lines:	972

💛 - Coveralls

AsafManela

Thanks for this change.
It seems like the test case is basically one where there is no variation in y.
Do you think you could add a test for this case?

src/Lasso.jl

Changing spelling of 'regularisation'.

barankarakus · 2020-09-20T22:06:13Z

Added tests (and some more minor changes). Let me know if anything else needs done!

AsafManela · 2020-09-22T22:18:23Z

src/Lasso.jl

 # Compute automatic λ values based on λmax and λminratio
 function computeλ(λmax, λminratio, α, nλ)
    λmax /= α
+    if isapprox(λmax, 0; atol=1e-10)  # then assuming λmax = 0


This is tricky because I think lambda is not unitless, so if it is small or not depends on the data given.
How does glmnet in R (or GLMNet.jl) handle this case?

The reason I've changed the equality check to an isapprox() check is due to floating point arithmetic leading to a lambdamax that should actually be zero being very close to zero but non-zero instead. Simple example when this happens is a design matrix X with entries sampled from U[0, 1] and y a non-zero vector with identical entries.

I agree, the data could be such that lambdamax is genuinely very small but non-zero.

That said, I think it would be very rare to encounter such data in practice... especially since lambdamax (for the linear model) scales linearly with X and y, and we tend to standardise these.

I see two approaches going forward:

Keep this check as is - the case where it would fail to produce correct output basically never occurs anyway.

Revert back to the equality check. The real case in which the package failed was the case where lambdamax was exactly zero, anyway. Moreover, even if lambdamax should be zero but instead is a very small number, there is no major problem: the solver works very fast and it is clear from the output that every value of lambda yields zero active coefficients.

I'll leave it to you to decide 😃.

Additionally: I'm not sure how glmnet in R or Julia handles this.

AsafManela · 2020-09-22T22:19:21Z

test/lasso.jl

+    return true
+end
+
+@test zero_variation_test() == true


Maybe use @test_log instead?

Also, any idea why the tests stopped working in julia v1.0?

Maybe use @test_log instead?

I agree. Will implement tomorrow.

Also, any idea why the tests stopped working in julia v1.0?

Unfortunately nope!

Correctly handling the case λmax = 0.

3ccd5f0

Two changes: 1) Change to computeλ to ensure λmax = 0 leads to an output of [0] and not [NaN, ..., NaN]. 2) Change to fit! to ensure the case where autoλ = true and λmax = 0 is handled correctly (rather than throwing an error).

barankarakus changed the title ~~Correctly handling the case λmax = 0.~~ Correctly handling the case λmax = 0; fixes #51 Sep 6, 2020

barankarakus changed the title ~~Correctly handling the case λmax = 0; fixes #51~~ Correctly handling the case λmax = 0. Sep 6, 2020

AsafManela requested changes Sep 20, 2020

View reviewed changes

src/Lasso.jl Outdated Show resolved Hide resolved

barankarakus and others added 3 commits September 20, 2020 21:44

Update src/Lasso.jl

f4a3923

Changing spelling of 'regularisation'.

Replacing equality with approximate equality.

5636fed

Added test for case: zero variation in y.

5ac04f7

AsafManela requested changes Sep 22, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Correctly handling the case λmax = 0. #53

Correctly handling the case λmax = 0. #53

Uh oh!

barankarakus commented Sep 6, 2020 •

edited

Loading

Uh oh!

coveralls commented Sep 6, 2020 •

edited

Loading

Uh oh!

AsafManela left a comment

Uh oh!

Uh oh!

barankarakus commented Sep 20, 2020

Uh oh!

AsafManela Sep 22, 2020

Uh oh!

barankarakus Sep 22, 2020

Uh oh!

barankarakus Sep 22, 2020

Uh oh!

AsafManela Sep 22, 2020

Uh oh!

AsafManela Sep 22, 2020

Uh oh!

barankarakus Sep 22, 2020

Uh oh!

barankarakus Sep 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Correctly handling the case λmax = 0. #53

Are you sure you want to change the base?

Correctly handling the case λmax = 0. #53

Uh oh!

Conversation

barankarakus commented Sep 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Sep 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 202

💛 - Coveralls

Uh oh!

AsafManela left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

barankarakus commented Sep 20, 2020

Uh oh!

AsafManela Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

AsafManela Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

AsafManela Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

barankarakus commented Sep 6, 2020 •

edited

Loading

coveralls commented Sep 6, 2020 •

edited

Loading