Small Thinker 3 B Preview abliterated G G U F
QuantFactoryIntroduction
The SmallThinker-3B-Preview-abliterated-GGUF is a quantized model variant designed for conversational tasks, developed by QuantFactory. It builds on the SmallThinker-3B-Preview-abliterated model, which itself is an uncensored version of the PowerInfer/SmallThinker-3B-Preview. This model is created using the abliteration technique to remove refusals from language models without using TransformerLens.
Architecture
This model is based on the original SmallThinker-3B architecture. The model has been quantized using llama.cpp
to optimize for performance and efficiency. The quantization process helps in reducing the model size and computational requirements, making it suitable for a broader range of deployment environments.
Training
The SmallThinker-3B-Preview-abliterated-GGUF model was trained by modifying the original SmallThinker-3B-Preview with a technique called abliteration. This technique, as described in the remove-refusals-with-transformers repository, is used to eliminate refusal responses from the language model. This process does not involve the use of TransformerLens, offering a different methodology for refining language model responses.
Guide: Running Locally
To run the SmallThinker-3B-Preview-abliterated-GGUF model locally, follow these steps:
-
Environment Setup: Ensure that your local environment has the necessary dependencies installed. This typically includes Python and any additional libraries required for model execution.
-
Model Download: Obtain the model files from Hugging Face's model repository. Use the provided download links to get the quantized model version.
-
Execution: Use a compatible framework to run the model. The model can be executed using Ollama by running the command:
ollama run huihui_ai/smallthinker-abliterated
-
Hardware Recommendations: For optimal performance, it is recommended to use a cloud GPU. Providers like AWS, Google Cloud, or Azure offer suitable infrastructure for running deep learning models efficiently.
License
The licensing details for the SmallThinker-3B-Preview-abliterated-GGUF model are not explicitly provided in the documentation. Users should refer to the original model's repository or contact the developers for specific licensing terms and conditions.