M N 12 B Mag Mell R1 G G U F I Q A R M Imatrix
LewdiculousIntroduction
The MN-12B-Mag-Mell-R1-GGUF-IQ-ARM-Imatrix is a model focused on roleplay and conversational applications. It is designed to be used with the ChatML prompt format, providing flexibility for various interactive scenarios.
Architecture
This model is a quantized version of the inflatebot/MN-12B-Mag-Mell-R1, optimized for roleplaying and conversational use. It utilizes the GGUF and IMATRIX libraries, supporting features like NeMo integration and English language processing.
Training
The model was tested under conditions of Temperature 1.25 and MinP 0.2, maintaining stability up to 10,000 steps. Adjustments to Temperature or MinP may be necessary to enhance coherence based on specific use cases.
Guide: Running Locally
- Setup Environment: Install Python and necessary libraries such as
transformers
andtorch
. - Download Model: Obtain the model files from the Hugging Face model page.
- Load Model: Use the
transformers
library to load the model into your application. - Adjust Parameters: Tweak the Temperature and MinP settings as needed for your use case.
- Run Inference: Initiate the model with the desired input prompts using the ChatML format.
For optimal performance, consider utilizing cloud GPUs from providers such as AWS, Google Cloud, or Azure.
License
This model is distributed under the Unlicense, allowing for unrestricted use, modification, and distribution.