Distil-Tuning for Decoder-Based Transformers

This repo represent a tiny and reforged version of the original MeDistil-d2n framework and the related paper studies for the BioASQ workshop on multilingual clinical texts summarization. The original project has a major limitation of Seq2Seq trainer dependencies. The goal of the project is to bridge the gap with fine-tuning SLM LLM models (AutoModelCasualLM) on long-input context by heavily rely on decoder based models with following input Formatting Concepts.

Contribution

✅ Replacement of Seq2SeqTrainer: AutoModelCasualLM models (Qwen series in particular).
- Support instruction tuning
✅ Refactoring and narrowing the scope, dropping dependencies.
✅ Switch dependencies to Python 3.10+

🛠️ Changeset

Narrow scope of the framework. We don not support DeepSpeed by default
Reforge data preparation concept (Qwen2.5 support) (see Formatting Concepts)
Refactor evaluation
- Fixed Trainer limitation on not-exploiting .generate call for predictions
Dataset cropping
Support rationale annotation using third-party API hosting (OpenRouter)
Reforge prefix TaskPrefixTrainer.
- Reforge list of parameters
‼️Memory leakage on evaluation
- Caused by this piece: https://github.com/nicolay-r/distill-tuning-llm/blob/07871555069ef07a8149e51b36ba6381dad4b423/utils/distill_trainer.py#L84

Setup

The complete list of dependencies

pip install -r requirements.txt

Download punkt_tab for nltk

import nltk
nltk.download('punkt_tab')

Finetuning

Manual Training:

./distill_ft_qwen25_test.sh --from_pretrained "AutoModelCasualLM-from-HF" --dataset "multiclinsum" --model_type "distill"

NOTE: We use the following post-processing script for dataset preparation.

List of the parameters

--from_pretrained: Model from hugging face that nesting AutoModelCasualLM
--dataset: multiclinsum (see downloading script and post-processing)
--alpha: Task weight for multi-task training.
- $Loss = alpha * pred_l + (1 - alpha) * rationale_l$
--model_type:
- standard: Standard finetuning (baseline)
- distill: Distilling step-by-step

The pretrained models are publicly available:

Model 🤗	Link
`nicolay-r/qwen25-05b-multiclinsum-distil`	model-card
`nicolay-r/qwen25-05b-multiclinsum-standard`	model-card

Inference

We use bulk-chain project to infer:

rationale prompts, necessary for distill-based fine-tuning [using this script].
Test data for competition submissions [using this script]

Datasets

MultiClinSum
- We use the following script for downloading datasets.
- Web: https://temu.bsc.es/multiclinsum
- Data: https://zenodo.org/records/15463353
- BioASQ: http://bioasq.org/

Input formatting concepts

Data formatting for QWEN
- https://qwen.readthedocs.io/en/latest/getting_started/concepts.html#control-tokens-chat-template
Fine-tuning setup
- https://github.com/QwenLM/Qwen2.5-VL/tree/main/qwen-vl-finetune

References

bulk-chain: https://github.com/nicolay-r/bulk-chain
- Annotation and test-set inference.

Name		Name	Last commit message	Last commit date
Latest commit History 268 Commits
analysis		analysis
predict		predict
resources		resources
test		test
utils		utils
LICENSE		LICENSE
README.md		README.md
SETUP.md		SETUP.md
cfg.py		cfg.py
distil_ft_qwen25.py		distil_ft_qwen25.py
distil_ft_qwen25_05b_A100-40GB_80GB_dis.sh		distil_ft_qwen25_05b_A100-40GB_80GB_dis.sh
distil_ft_qwen25_05b_A100-40GB_80GB_std.sh		distil_ft_qwen25_05b_A100-40GB_80GB_std.sh
distil_ft_qwen25_test.sh		distil_ft_qwen25_test.sh
requirements.txt		requirements.txt
setup_script.sh		setup_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distil-Tuning for Decoder-Based Transformers

Contribution

🛠️ Changeset

Setup

Finetuning

Inference

Datasets

Input formatting concepts

References

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

nicolay-r/distil-tuning-llm

Folders and files

Latest commit

History

Repository files navigation

Distil-Tuning for Decoder-Based Transformers

Contribution

🛠️ Changeset

Setup

Finetuning

Inference

Datasets

Input formatting concepts

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages