-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: [R-package] Add support for specifying training indices in lgb.cv() #3989
Conversation
As seen in issue microsoft#3924
No idea why Azure pipelines are unhappy. It looks like it was an issue with the instances while building the containers... Shall we force rebuild? |
Thanks, done! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for taking the time to contribute! This is a really nice addition.
Could you please add a unit test in the lgb.cv()
section, checking that this behavior works as expected? It would be great if you can set up a small time-series cross validation example as the test. https://github.com/microsoft/LightGBM/blob/master/R-package/tests/testthat/test_basic.R#L262
I changed the PR's description to say "fixes" instead of "as seen in". That way when we merge this, the issue will be closed automatically. |
Sure! I will try to come up with a meaningful test. |
@julioasotodv are you still interested in contributing this? I'm available to help if you have any questions. |
@jameslamb Indeed! Sorry had a couple of busy weeks... Yes, I was thinking on how to make a test for this, as I don't see it straightforward... Since But perhaps there is a better way I did not think of... |
No problem, and no rush at all! Let me know if it's too simplistic, but what about this?
|
gently pinging @julioasotodv , is there anything I can do to help you with this? |
For now, I'm going to close this PR due to lack of response. @julioasotodv, thank you again for your interest in LightGBM and a great issue write-up in #3924! If you have the time and interest to contribute this in the future, we'd welcome a new pull request. I think providing better support for customization of the cross-validation process (like for cases mentioned in #3924 where you want to do time-based splits) is a valuable contribution to the R package, but not one that will be done by maintainers before the next release (#4310). I'm going to lock discussion on this PR for now to focus all discussion of this feature back on #3924. Anyone who is interested in contributing this feature (or wants to ask maintainers to give it higher priority and implement it) is welcome to comment on #3924. |
fixes #3924