waifu diffusion v1 4
hakureiIntroduction
Waifu Diffusion V1.4 is a latent text-to-image diffusion model specifically fine-tuned on high-quality anime images. It extends the capabilities of stable diffusion models to generate anime-style artwork from textual descriptions.
Architecture
The model leverages stable diffusion techniques to transform text prompts into detailed images, particularly focusing on anime aesthetics. It has been conditioned with a robust dataset of anime images, ensuring high-quality output.
Training
Waifu Diffusion V1.4 was developed by fine-tuning over anime images to optimize the generation quality. The training includes an epoch (Anime Epoch 1) designed to validate the setup's effectiveness. An inference configuration file is provided to support usage with Automatic's WebUI and the original Stable Diffusion codebase.
Guide: Running Locally
- Setup Environment: Ensure you have a Python environment ready. Install necessary dependencies.
- Download Model: Obtain the model weights and inference configuration file from the Hugging Face repository.
- Run Inference: Use the configuration with a compatible interface like Automatic's WebUI.
- Hardware Recommendation: For optimal performance, consider using cloud GPU services like AWS or Google Cloud.
License
The model is available under the CreativeML OpenRAIL-M license, which allows open access and commercial use. Key points include:
- Prohibition on generating illegal or harmful content.
- No claim over user-generated outputs, which must comply with license terms.
- Redistribution and commercial use are permitted under the same license terms, requiring users to share the license with end-users.
For full details, refer to the CreativeML OpenRAIL-M license.