uniformize processor Mllama #33876

yonigozlan · 2024-10-01T22:53:32Z

What does this PR do?

Adds uniformized processors following #31911 for Mllama.

Very small changes in Mllama processor to be coherent with other vlms processors
Add ProcessorMixin to Mllama processor tests

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@qubvel @molbap

HuggingFaceDocBuilderDev · 2024-10-01T22:53:44Z

Hey! 🤗 Thanks for your contribution to the transformers library!

Before merging this pull request, slow tests CI should be triggered. To enable this:

Add the run-slow label to the PR
When your PR is ready for merge and all reviewers' comments have been addressed, push an empty commit with the command [run-slow] followed by a comma separated list of all the models to be tested, i.e. [run_slow] model_to_test_1, model_to_test_2
- If the pull request affects a lot of models, put at most 10 models in the commit message
A transformers maintainer will then approve the workflow to start the tests

(For maintainers) The documentation for slow tests CI on PRs is here.

yonigozlan · 2024-10-01T22:55:06Z

tests/models/mllama/test_processor_mllama.py

+        self.bos_token = processor.bos_token
+        self.bos_token_id = processor.tokenizer.bos_token_id
+        self.tmpdirname = tempfile.mkdtemp()
+        processor.save_pretrained(self.tmpdirname)


changed the use of self.processor to this to have a "clean" processor at the beginning of each test.

That's a good idea, could be replaced by a fixture with a function scope then?

That's a good idea, could be replaced by a fixture with a function scope then?

Actually my bad, it looks like the setUp and tearDown functions are already fixtures with a function scope for unittest. So it's not necessary to use tempfile.
I don't know then why saving processors to a tempfile is used in the setUp function of so many test_processor files, and why tempfile in general is used in so many unittest setUp function in Transformers.
Unless there is a good reason to use tempfiles, maybe we should think about removing its use in setUp functions in all tests?

self.tmpdirname is used in ProcessorTesterMixin so I can't remove it here in fact, but maybe we could consider removing it from ProcessorTesterMixin in another PR

No strong opinion on it - it's just convenient to use and makes sure the save/load from files works, can't create race conditions either. Since so many of the transformers utils are designed to work in pair with the hub, I suppose it makes sense to use tempfile in that regard

Oh I see, I have no strong opinion on it either, so maybe no need to change it :)

HuggingFaceDocBuilderDev · 2024-10-01T23:19:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

molbap

LGTM! passing to @qubvel as well

molbap · 2024-10-02T08:33:09Z

src/transformers/models/mllama/processing_mllama.py

try: from typing import Unpack except ImportError: from typing_extensions import Unpack

should be replaced by the import of Unpack from processing utils

molbap · 2024-10-02T08:35:08Z

tests/models/mllama/test_processor_mllama.py

+        self.bos_token = processor.bos_token
+        self.bos_token_id = processor.tokenizer.bos_token_id
+        self.tmpdirname = tempfile.mkdtemp()
+        processor.save_pretrained(self.tmpdirname)


That's a good idea, could be replaced by a fixture with a function scope then?

molbap · 2024-10-02T08:39:37Z

tests/models/mllama/test_processor_mllama.py

-class MllamaProcessorTest(unittest.TestCase):
+class MllamaProcessorTest(ProcessorTesterMixin, unittest.TestCase):
+    processor_class = MllamaProcessor
+
    def setUp(self):
        self.checkpoint = "hf-internal-testing/mllama-11b"  # TODO: change


Is this TODO still needed?

No, it's not, can be removed

qubvel

Agreed with the comments above, other than that, looks great to me! Thanks for updating it

ArthurZucker

LGTM Thanks for fixing

* uniformize processor Mllama * nit syntax * nit

yonigozlan requested review from qubvel and molbap October 1, 2024 22:53

yonigozlan commented Oct 1, 2024

View reviewed changes

molbap approved these changes Oct 2, 2024

View reviewed changes

qubvel approved these changes Oct 2, 2024

View reviewed changes

yonigozlan requested a review from ArthurZucker October 2, 2024 13:55

ArthurZucker approved these changes Oct 2, 2024

View reviewed changes

yonigozlan added 3 commits October 2, 2024 14:44

uniformize processor Mllama

f149715

nit syntax

41b1b2f

nit

a657967

yonigozlan force-pushed the uniformize-processor-mllama branch from f23d64f to a657967 Compare October 2, 2024 14:45

yonigozlan merged commit d7950bf into huggingface:main Oct 2, 2024
11 checks passed

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

uniformize processor Mllama (huggingface#33876)

591fa88

* uniformize processor Mllama * nit syntax * nit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

uniformize processor Mllama #33876

uniformize processor Mllama #33876

yonigozlan commented Oct 1, 2024

HuggingFaceDocBuilderDev commented Oct 1, 2024

yonigozlan Oct 1, 2024

molbap Oct 2, 2024

yonigozlan Oct 2, 2024 •

edited

Loading

yonigozlan Oct 2, 2024

molbap Oct 2, 2024

yonigozlan Oct 2, 2024

HuggingFaceDocBuilderDev commented Oct 1, 2024

molbap left a comment

molbap Oct 2, 2024

molbap Oct 2, 2024

molbap Oct 2, 2024

qubvel Oct 2, 2024

qubvel left a comment •

edited

Loading

ArthurZucker left a comment

uniformize processor Mllama #33876

uniformize processor Mllama #33876

Conversation

yonigozlan commented Oct 1, 2024

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Oct 1, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yonigozlan Oct 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 1, 2024

molbap left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qubvel left a comment • edited Loading

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

yonigozlan Oct 2, 2024 •

edited

Loading

qubvel left a comment •

edited

Loading