Dark Thing Flux Lo R A LLM Model

Introduction

Dark-Thing-Flux-LoRA is a text-to-image model under development by prithivMLmods. It uses LoRA (Low-Rank Adaptation) for generating images with specified prompts, focusing on themes like "Dark Creature."

Architecture

The model operates as an extension of the black-forest-labs/FLUX.1-dev base model, integrating LoRA technology to enhance image generation. It employs a diffusion pipeline with parameters such as AdamW optimizer and constant LR scheduler to refine image outputs.

Training

Parameters:
- LR Scheduler: Constant
- Optimizer: AdamW
- Network Dimensions: 64
- Network Alpha: 32
- Epochs: 10
- Image Dimensions: 1024 x 1024 (default)
Noise Parameters:
- Noise Offset: 0.03
- Multires Noise Discount: 0.1
- Multires Noise Iterations: 10
Training Data:
- Total Images: 15

The model remains in training and not finalized, potentially containing artifacts.

Guide: Running Locally

Set Up Environment:
- Install necessary libraries, including torch and pipelines.

Load the Model:

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

Load LoRA Weights:

lora_repo = "prithivMLmods/Dark-Thing-Flux-LoRA"
pipe.load_lora_weights(lora_repo)

Set Device:

device = torch.device("cuda")
pipe.to(device)

Use Trigger Words:
- Trigger image generation using the term "Dark Creature."

Recommendation: Utilize cloud GPUs for optimal performance, such as AWS EC2 or Google Cloud's AI Platform.

License

The model is licensed under the CreativeML OpenRAIL-M license.

More Related APIs in Text To Image