Add model warmup and jax compilation cache flags #187

vivianrwu · 2024-09-27T19:55:05Z

Adds the flags for model warmup support in jetstream-pytorch, and add jax compilation cache flags

Pass --enable_model_warmup=True

        args:
        - --size=8b
        - --model_name=llama-3
        - --batch_size=128
        - --max_cache_length=2048
        - --quantize_weights=False
        - --quantize_kv_cache=False
        - --tokenizer_path=/models/llama3-8b/final/bf16/tokenizer.model
        - --checkpoint_path=/models/llama3-8b/final/bf16/model.safetensors
        - --enable_model_warmup=True

Logs that indicate model warmup is occurring:

2024-09-27 19:13:55,447 - root - INFO - ---------Prefill engine 0 compiled for prefill length 256.---------
I0927 19:13:55.447131 134077799315008 warmup_utils.py:108] ---------Prefill engine 0 compiled for prefill length 256.---------
I0927 19:13:55.618359 134077990557248 warmup_utils.py:108] ---------Prefill engine 0 compiled for prefill length 64.---------
2024-09-27 19:13:55,618 - root - INFO - ---------Prefill engine 0 compiled for prefill length 64.---------
2024-09-27 19:13:56,050 - root - INFO - ---------Prefill engine 0 compiled for prefill length 128.---------
I0927 19:13:56.050476 134077955900992 warmup_utils.py:108] ---------Prefill engine 0 compiled for prefill length 128.-

After ssh into the container:

root@jetstream-pytorch-server-db6f74545-m8gxd:~/jax_cache# ls
jit_generate_impl-HASH
jit_insert-HASH
jit_insert-HASH
...
jit_prefill-HASH
jit_prefill-HASH

FanhaiLu1

Thanks for adding it! Look good to me.

vivianrwu added 3 commits September 27, 2024 18:28

add model warmup and jax compilation cache flags

3aa352e

expand compilation cache dir

2e7498f

Merge branch 'main' into model-warmup-jax-cache-support

b78713b

FanhaiLu1 approved these changes Sep 30, 2024

View reviewed changes

vivianrwu added 2 commits September 30, 2024 15:10

Merge branch 'main' into model-warmup-jax-cache-support

0eaedd7

pyink reformat

eee7b1a

qihqi approved these changes Oct 2, 2024

View reviewed changes

qihqi merged commit f2e5181 into AI-Hypercomputer:main Oct 2, 2024
4 of 5 checks passed

This was referenced Nov 6, 2024

Add model warmup flag into cli #197

Merged

Add jax compilation cache config #198

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add model warmup and jax compilation cache flags #187

Add model warmup and jax compilation cache flags #187

Uh oh!

vivianrwu commented Sep 27, 2024

Uh oh!

FanhaiLu1 left a comment

Uh oh!

Uh oh!

Uh oh!

Add model warmup and jax compilation cache flags #187

Add model warmup and jax compilation cache flags #187

Uh oh!

Conversation

vivianrwu commented Sep 27, 2024

Uh oh!

FanhaiLu1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!