Adding PhiConfig #1568

michaelfeil · 2023-11-28T15:43:35Z

Work in progress to add one more model. (microsoft/phi)

I am really not sure if my conversions are right, maybe give it a check. heavily based on MixFormerSequentialLoader.

michaelfeil · 2023-11-28T15:56:22Z

Using the same test scripts as elsewhere:

To reproduce run converter with:

--model microsoft/phi-1_5 --trust_remote_code --output_dir /home/michi/ct2_convert_phi

import ctranslate2
from transformers import AutoTokenizer, AutoModelForCausalLM
from timeit import default_timer as timer
model_ct2 = ctranslate2.Generator(
    "/home/michi/ct2_convert_phi",
    device="cpu",
    compute_type="float32",
)
name_hf = 'microsoft/phi-1_5'
model_hf = AutoModelForCausalLM.from_pretrained(name_hf, trust_remote_code=True)
tokenizer =  AutoTokenizer.from_pretrained(name_hf, device="cpu", trust_remote_code=True)

text = "# this code print('hello', name) in python3 \n\ndef hello_name(name: "

def inference_ct2(text: str):
    tokens_in = tokenizer.convert_ids_to_tokens(tokenizer.encode(text))
    s = timer()
    tokens_out = model_ct2.generate_batch([tokens_in], max_length=32, min_length=32)
    total_ct2 = timer() - s
    text_out = tokenizer.decode(tokens_out[0].sequences_ids[0])
    return total_ct2, text_out

def inference_hf(text: str):
    inputs = tokenizer.encode(text, return_tensors="pt")
    s = timer()
    outputs = model_hf.generate(inputs, max_length=32, min_length=32)
    total_hf = timer() - s
    text_out2 = tokenizer.batch_decode(outputs)[0]
    return total_hf, text_out2


_, _ = inference_ct2("warm up")
_, _ = inference_hf("warm up")

total_ct2, text_out_ct2 = inference_ct2(text)
total_hf, text_out_hf = inference_hf(text)

print("hf\n",text_out_hf, "\nct2\n",text_out_ct2)
print(f"time for ct2={total_ct2} time for hf={total_hf}")
assert text_out_ct2[:len(text_out_hf)] == text_out_hf # same, ct2 generates +1 token compared to HF
print("done")

I get:

>>> inference_ct2("warm up")
(6.334434970000075, 'warm up before exercising.\n\n3. Why did John feel energized and ready to tackle the day after his workout?\nAnswer: John felt energized and')
>>> inference_hf("warm up")
(7.759652636999817, 'warm up before exercising.\n\n3. Why did John feel energized and ready to tackle the day after his workout?\nAnswer: John felt energized')

vince62s · 2023-11-29T09:25:08Z

can you fix flake8 ?

michaelfeil · 2023-11-29T10:44:16Z

Done!

vince62s · 2023-11-29T12:31:41Z

looking at the transformers library I don't see anywhere the use of MixFormerSequentialLoader, so just wondering if we could just get rid of it or if there is really a foundation to keep this class.

michaelfeil · 2023-11-29T16:21:22Z

Valid, point, i forgot that. It was my starting point.

vince62s · 2023-11-29T16:59:33Z

sorry to be annoying but shouldn't we remove the other one completely ? unless we want to support older versions of the transformers library for now.

michaelfeil · 2023-11-29T17:51:38Z

No idea, which models from huggingface use „MixFormerSequentialLoader“ - I assume there are some models for that.

This PR is for supporting „PhiConfig“. I initally thought the structure is very similar to MixFormerSequentialLoader. That was not the case. I think it makes sense to keep it, and open an extra class right?

vince62s · 2023-11-29T17:58:06Z

let's keep both for now but I really think this is the same model and they only renamed it. I see you changes vs the older one but I think specs are equivalent. Obviously you tested it so I'll merge it.

adding phiconfig

4af03c8

flake8 fix

8c3e525

update transformers

400008d

vince62s merged commit c6f7f3b into OpenNMT:master Nov 29, 2023
17 checks passed

michaelfeil deleted the phi-support branch November 29, 2023 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding PhiConfig #1568

Adding PhiConfig #1568

michaelfeil commented Nov 28, 2023 •

edited

Loading

michaelfeil commented Nov 28, 2023 •

edited

Loading

vince62s commented Nov 29, 2023

michaelfeil commented Nov 29, 2023

vince62s commented Nov 29, 2023

michaelfeil commented Nov 29, 2023

vince62s commented Nov 29, 2023 •

edited

Loading

michaelfeil commented Nov 29, 2023

vince62s commented Nov 29, 2023

Adding PhiConfig #1568

Adding PhiConfig #1568

Conversation

michaelfeil commented Nov 28, 2023 • edited Loading

michaelfeil commented Nov 28, 2023 • edited Loading

vince62s commented Nov 29, 2023

michaelfeil commented Nov 29, 2023

vince62s commented Nov 29, 2023

michaelfeil commented Nov 29, 2023

vince62s commented Nov 29, 2023 • edited Loading

michaelfeil commented Nov 29, 2023

vince62s commented Nov 29, 2023

michaelfeil commented Nov 28, 2023 •

edited

Loading

michaelfeil commented Nov 28, 2023 •

edited

Loading

vince62s commented Nov 29, 2023 •

edited

Loading