[基础模型XL] 人类偏好lora 优化光影和细节

LORA
原创 · 创作者激励计划


更新

Human Preference Lora Alpha

It is an alpha version of my <Human Preference Lora>

Effect

This lora tends to make reasonable appearance and shadow.

Add or change details.

Align the human aesthetic.

Dataset

The original dataset is Pick a pic v2 dataset

https://huggingface.co/datasets/yuvalkirstain/pickapic_v2

Filtered 2500 high quality pairs for the training.

PS: It is just an alpha for proof of concept. It will be bigger after filtered more pairs for the training

Training method

The training code is modified from

Using the slider codebase and changed it to iterate image pairs with caption.

Might improve the loss function to

Diffusion Model Alignment Using Direct Preference Optimization

https://arxiv.org/pdf/2311.12908.pdf

if neccessary.

Buy me a coffee to support my work.

Contact:

Discord: .xiaozhi

QQ Group: 866612947 anwser: 小志Jason

版本详情

基础模型 XL

项目权限

严禁转载

    使用权限

  • 在吐司在线使用

  • 在 吐司 作为在线训练的底模

  • 使用时无需注明出处

  • 用于模型融合

  • 分享融合模型时使用不同的许可

    商用许可

  • 生成的内容用于商业用途

  • 作为生成服务来商用

  • 转售模型或出售融合模型

相关帖子