noobai X L V Pred cyberfixv2 perpendicularcyberfixv2

Panchovix

NOOBAI-XL-VPRED-CYBERFIXV2-PERPENDICULARCYBERFIXV2

Introduction

The NOOBAI-XL-VPRED-CYBERFIXV2-PERPENDICULARCYBERFIXV2 is an advanced text-to-image generative model based on Laxhar's NOOBAI-XL. It utilizes comprehensive datasets such as Danbooru and e621, integrating native tags with natural language captioning. This model operates distinctly as a v-prediction model and requires specific configurations for optimal performance.

Architecture

  • Developed by: Laxhar Lab
  • Model Type: Diffusion-based text-to-image generative model
  • Fine-tuned from: Laxhar/noobai-XL_v1.0
  • Sponsored by: Lanyun Cloud

Training

The model is trained using the latest Danbooru and e621 datasets, ensuring high-quality image generation. It employs data normalization, time-based decay coefficients, and ranks images to enhance quality tag accuracy. The model is fine-tuned for specific quality and aesthetic tags, considering user preferences and image aesthetics.

Guide: Running Locally

Method I: ReForge

  1. Install ReForge by following the instructions here.
  2. Launch WebUI and use the model.

Method II: ComfyUI

  1. Install ComfyUI by following the instructions here.
  2. Use the model with sample nodes.

Method III: WebUI

  1. Install WebUI by following the instructions here.
  2. Switch to the development branch using git switch dev.
  3. Pull the latest updates using git pull.
  4. Launch WebUI and use the model.

Method IV: Diffusers

import torch
from diffusers import StableDiffusionXLPipeline, EulerDiscreteScheduler

ckpt_path = "/path/to/model.safetensors"
pipe = StableDiffusionXLPipeline.from_single_file(
    ckpt_path,
    use_safetensors=True,
    torch_dtype=torch.float16,
)
scheduler_args = {"prediction_type": "v_prediction", "rescale_betas_zero_snr": True}
pipe.scheduler = EulerDiscreteScheduler.from_config(pipe.scheduler.config, **scheduler_args)
pipe.enable_xformers_memory_efficient_attention()
pipe = pipe.to("cuda")

prompt = "masterpiece, best quality, ..."
negative_prompt = "nsfw, worst quality, ..."
image = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    width=832,
    height=1216,
    num_inference_steps=28,
    guidance_scale=5,
    generator=torch.Generator().manual_seed(42),
).images[0]
image.save("output.png")

Recommended Settings

  • CFG: 4 ~ 5
  • Steps: 28 ~ 35
  • Sampling Method: Euler
  • Resolution: Around 1024x1024, with specific options like 768x1344, 832x1216, etc.

Cloud GPUs

For better performance, consider using cloud GPUs from providers like AWS, Google Cloud, or Azure.

License

The model is licensed under the fair-ai-public-license-1.0-sd, with specific usage restrictions:

  1. Usage Restrictions: Prohibits harmful, malicious, or illegal activities and the generation of unethical content.
  2. Commercial Prohibition: Commercial use, monetization, or derivative commercial products are prohibited.
  3. Open Source Community: Requires open source derivative works and shared synthesis details.
  4. Disclaimer: Users must assume risks and potential consequences of usage.

More Related APIs in Text To Image