Verus_ Vision_2.0b
SG161222Introduction
Verus Vision is a fine-tuned model aimed at generating high-quality, realistic images. It is based on the Flux Dev model and consists of 12 billion parameters. This model is designed for non-commercial use under the Flux-1-dev-non-commercial-license.
Architecture
Verus Vision is built upon the Flux Dev (de-distill) model architecture. The model's large parameter size of 12 billion enables the production of realistic images with high fidelity and detail.
Training
The training of Verus Vision involves multiple stages:
-
Stage 1: Data Preparation
- Completed tasks include dataset collection, processing, and image captioning.
-
Stage 2: Model Training
- Currently in progress, with part 4 out of an unspecified total completed, involving 800 images so far.
-
Stage 3: Testing and Comparison
- Model testing and comparison with the base model have yet to commence.
-
Stage 4: Release
- Upcoming plans include releasing the Transformer version of the model.
Guide: Running Locally
To run Verus Vision locally, follow these steps:
-
Environment Setup
- Ensure the necessary dependencies are installed, such as Python and required libraries.
-
Model Download
- Obtain the model files from the Hugging Face repository.
-
Execution
- Load the model using a suitable framework like PyTorch or TensorFlow.
- Run the model inference with your image data.
For optimal performance, it is recommended to use cloud GPUs like those provided by AWS, Google Cloud, or Azure to handle the large parameter size efficiently.
License
The Verus Vision model is distributed under the Flux-1-dev-non-commercial-license, which limits its use to non-commercial purposes. Further details can be found in the license document.