test: Add test for ref_input parameter in fused linear preference #468

xingyaoww · 2024-12-11T21:37:27Z

This PR adds a test for the ref_input parameter that was introduced in #467.

Changes

Add test_ref_input.py to verify the ref_input parameter works correctly in LigerFusedLinearPreferenceBase
Test ensures that:
- Policy model outputs (chosen_logps, rejected_logps) are identical when using ref_input vs not using it
- Final loss and aux outputs differ when using ref_input vs using input_chunk for reference model
Uses same parametrization as other tests for consistency

Testing

The test verifies that:

When ref_input is not provided, the reference model uses input_chunk
When ref_input is provided, the reference model uses it instead of input_chunk
The policy model outputs remain unchanged regardless of ref_input
The final loss differs when using different inputs for the reference model

Follows up on #467 which added ref_input parameter support.

- Add test to verify ref_input parameter works correctly in LigerFusedLinearPreferenceBase - Test ensures policy outputs are identical but losses differ when using ref_input - Follows PR linkedin#467 which added ref_input parameter support

openhands-agent and others added 3 commits December 11, 2024 21:37

Fix style issues in test_ref_input.py

77a35c0

Merge branch 'main' into add-ref-input-test

15ce9c4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: Add test for ref_input parameter in fused linear preference #468

test: Add test for ref_input parameter in fused linear preference #468

Uh oh!

xingyaoww commented Dec 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

test: Add test for ref_input parameter in fused linear preference #468

Are you sure you want to change the base?

test: Add test for ref_input parameter in fused linear preference #468

Uh oh!

Conversation

xingyaoww commented Dec 11, 2024

Changes

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants