Refactor `LigerFusedLinearPreferenceBase` #381

pramodith · 2024-11-14T12:32:12Z

Summary

This PR refactors the LigerFusedLinearPreferenceBase class to contain an abstractmethod corresponding to the calculation of the loss that needs to be implemented by all sub-classes.

It also adds a new function to the class called _compute_loss which is mostly the same as the _compute_orpo_loss function introduced in #362 but makes it generic to calculate the NLL/Cross Entropy Loss plus accepts a custom loss function that implements a new alignment loss function.

Most RLHF/RLAIF/Alignment algorithms state their final loss as NLL + Beta * (Alignment_Loss) so adding the NLL logic inside the base class reduces repeated code.

The _compute_loss function accepts

Testing Done

On A100-80G-SXM

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

shivam15s · 2024-11-14T15:50:42Z

This was quick. Great refactor!

shivam15s

lgtm

shivam15s · 2024-11-14T15:49:03Z

src/liger_kernel/chunked_loss/orpo_loss.py

+            _input,
+            weight,
+            target,
+            bias,


nit: I wrongly assumed that forward just takes positional arguments.
Could you also make input, weight target, bias keyword args?

shivam15s

lgtm

pramodith and others added 4 commits November 14, 2024 11:15

Initial commit

53f2005

Run all makes

047636b

Refactor

3cadc12

More refactoring

573b292

shivam15s previously approved these changes Nov 14, 2024

View reviewed changes

Fix nit

d6aad35

pramodith dismissed shivam15s’s stale review via d6aad35 November 14, 2024 15:54

pramodith enabled auto-merge (squash) November 14, 2024 15:55

shivam15s approved these changes Nov 14, 2024

View reviewed changes

pramodith merged commit 2281b7e into linkedin:main Nov 14, 2024
1 of 3 checks passed

pramodith deleted the pramodith/preference_loss_interface branch November 15, 2024 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `LigerFusedLinearPreferenceBase` #381

Refactor `LigerFusedLinearPreferenceBase` #381

pramodith commented Nov 14, 2024

shivam15s commented Nov 14, 2024

shivam15s left a comment

shivam15s Nov 14, 2024 •

edited

Loading

pramodith Nov 14, 2024

shivam15s left a comment

Refactor LigerFusedLinearPreferenceBase #381

Refactor LigerFusedLinearPreferenceBase #381

Conversation

pramodith commented Nov 14, 2024

Summary

Testing Done

shivam15s commented Nov 14, 2024

shivam15s left a comment

Choose a reason for hiding this comment

shivam15s Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

pramodith Nov 14, 2024

Choose a reason for hiding this comment

shivam15s left a comment

Choose a reason for hiding this comment

Refactor `LigerFusedLinearPreferenceBase` #381

Refactor `LigerFusedLinearPreferenceBase` #381

shivam15s Nov 14, 2024 •

edited

Loading