Refactor HuggingFace model initialization to include base model name … #190

jardinetsouffleton · 2024-12-19T06:34:03Z

…and update tokenizer logic

recursix · 2024-12-19T12:57:40Z

src/agentlab/llm/huggingface_utils.py

+        if base_model_name is None:
+            self.tokenizer = AutoTokenizer.from_pretrained(model_name)
+        else:
+            self.tokenizer = AutoTokenizer.from_pretrained(base_model_name)


I don't get what's the point of that...

yes, sorry a little cryptic. I made this to support LoRA checkpoints, which are stored in the model_name directory. They are applied to a base model, stored at base_model_name directory. The directory where the adapters are stored contains only the adapters, while the model's safetensors + tokenizers, etc. are located in base_model_name

Refactor HuggingFace model initialization to include base model name …

2195a02

…and update tokenizer logic

jardinetsouffleton requested review from recursix and ThibaultLSDC December 19, 2024 06:34

recursix reviewed Dec 19, 2024

View reviewed changes

recursix approved these changes Dec 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor HuggingFace model initialization to include base model name … #190

Refactor HuggingFace model initialization to include base model name … #190

jardinetsouffleton commented Dec 19, 2024

recursix Dec 19, 2024

jardinetsouffleton Dec 19, 2024 •

edited

Loading

Refactor HuggingFace model initialization to include base model name … #190

Are you sure you want to change the base?

Refactor HuggingFace model initialization to include base model name … #190

Conversation

jardinetsouffleton commented Dec 19, 2024

recursix Dec 19, 2024

Choose a reason for hiding this comment

jardinetsouffleton Dec 19, 2024 • edited Loading

Choose a reason for hiding this comment

jardinetsouffleton Dec 19, 2024 •

edited

Loading