Chocolatine Admin 3 B S F T v0.3b

jpacifico

Introduction

Chocolatine-Admin-3B-SFT-v0.3b is a French language model specialized in administrative language, developed by Jonathan Pacifico in collaboration with Microsoft. It is fine-tuned from the Chocolatine-3B-Instruct-DPO-v1.2 model, based on Microsoft's Phi-3.5-mini-instruct.

Architecture

The model is a version of the Chocolatine series, specifically fine-tuned for French administrative language tasks. It leverages the architecture of the Phi-3.5-mini-instruct model by Microsoft and incorporates modifications for handling administrative terminology.

Training

The model was trained using a dataset derived from the French DITP's official administrative lexicon, consisting of 2362 terms. The data preparation included extracting lexicon pages, reformulating definitions for readability, and generating prompt-answer pairs. The fine-tuning process involved 11 epochs on an A100 GPU instance via Azure Machine Learning.

Guide: Running Locally

To run Chocolatine-Admin-3B-SFT-v0.3b locally, follow these steps:

  1. Install Transformers: Ensure that the transformers library is installed in your environment.
  2. Import Libraries:
    import transformers
    from transformers import AutoTokenizer
    
  3. Prepare the Prompt: Format your message as a prompt.
    message = [
        {"role": "system", "content": "You are a helpful assistant chatbot."},
        {"role": "user", "content": "What is a Large Language Model?"}
    ]
    tokenizer = AutoTokenizer.from_pretrained(new_model)
    prompt = tokenizer.apply_chat_template(message, add_generation_prompt=True, tokenize=False)
    
  4. Create a Pipeline: Set up a text generation pipeline.
    pipeline = transformers.pipeline(
        "text-generation",
        model=new_model,
        tokenizer=tokenizer
    )
    
  5. Generate Text: Use the pipeline to generate text sequences.
    sequences = pipeline(
        prompt,
        do_sample=True,
        temperature=0.7,
        top_p=0.9,
        num_return_sequences=1,
        max_length=200,
    )
    print(sequences[0]['generated_text'])
    

For optimal performance, consider using cloud GPUs such as those provided by Azure.

License

Chocolatine-Admin-3B-SFT-v0.3b is licensed under the MIT License, allowing for flexibility in usage and distribution.

More Related APIs in Text Generation