Flux Cute 3 D Kawaii Lo R A LLM Model

Introduction

The Flux-Cute-3D-Kawaii-LoRA model is a text-to-image diffusion model designed to generate 3D kawaii-style images based on text prompts. It utilizes the Diffusers framework and is developed as a LoRA (Low-Rank Adaptation) model.

Architecture

Base Model: black-forest-labs/FLUX.1-dev
Training Parameters:
- LR Scheduler: Constant
- Optimizer: AdamW
- Network Dimensions: 64
- Network Alpha: 32
- Epochs: 18
Image Processing:
- Noise Offset: 0.03
- Multires Noise: Discount 0.1, Iterations 10
- Repeat & Steps: 25 & 3270
Image Dimensions: Best at 768 x 1024, default 1024 x 1024

Training

The model was trained with:

Total Images: 24
Labeling: florence2-en (natural language & English)
The model is still under training and may contain artifacts or perform poorly in some scenarios.

Guide: Running Locally

Setup Environment: Ensure you have Python and PyTorch installed.

Import Required Libraries:

import torch
from pipelines import DiffusionPipeline

Initialize Model:

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

Load LoRA Weights:

lora_repo = "strangerzonehf/Flux-Cute-3D-Kawaii-LoRA"
pipe.load_lora_weights(lora_repo)

Setup Device:

device = torch.device("cuda")
pipe.to(device)

Generate Images: Use the trigger word "Cute 3d Kawaii" to generate images.

For optimal performance, running on cloud GPUs such as those provided by AWS, GCP, or Azure is recommended.

License

The model is licensed under the creativeml-openrail-m license, which allows for certain uses, adaptations, and redistributions under specified conditions.

More Related APIs in Text To Image