SubMaroon commited on
Commit
468fcfa
·
verified ·
1 Parent(s): 4e9f255

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -1
README.md CHANGED
@@ -4,4 +4,72 @@ datasets:
4
  - SubMaroon/danbooru-lineart
5
  base_model:
6
  - cagliostrolab/animagine-xl-3.0
7
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  - SubMaroon/danbooru-lineart
5
  base_model:
6
  - cagliostrolab/animagine-xl-3.0
7
+ ---
8
+
9
+ # Experimental ControlNet (Low Quality / Research Prototype)
10
+
11
+ > **Experimental model. Low quality. Not intended for production use.**
12
+ > This ControlNet was trained as a research experiment to explore line-based conditioning and colorization behavior in SDXL anime models.
13
+
14
+ ---
15
+
16
+ ## Model Summary
17
+
18
+ This repository contains an **experimental ControlNet for SDXL**, trained on anime-style images.
19
+ The model is **not stable**, shows **inconsistent color behavior**, and should be treated as a **research prototype** rather than a finished or polished solution.
20
+
21
+ The goal of this experiment was to understand:
22
+ - How SDXL ControlNet learns **colorization from line-based conditioning**
23
+ - How different conditioning types (Canny vs Lineart) affect **color consistency**
24
+
25
+ ---
26
+
27
+ ## Base Model
28
+
29
+ - **Base model:** `cagliostrolab/animagine-xl-3.0`
30
+ - **Architecture:** ControlNet SDXL
31
+ - **Training framework:** 🤗 Diffusers
32
+ - **Precision:** `bf16`
33
+
34
+ ---
35
+
36
+ ## Conditioning Type
37
+
38
+ - Primary conditioning: **Lineart / Canny-like edges**
39
+ - Backgrounds are mostly white
40
+ - Line quality varies (mostly clean, some noisy samples)
41
+
42
+ > Important limitation:
43
+ > Lineart / Canny **does not contain color information**, which leads to unstable and drifting color predictions.
44
+
45
+ ---
46
+
47
+ ## Dataset
48
+
49
+ - Size: ~**14,000 image pairs**
50
+ - Format:
51
+ - Original image (color)
52
+ - Conditioning image (lineart / canny)
53
+ - Prompt (caption)
54
+
55
+ ### Known dataset issues
56
+ - Some lineart images are **noisy or inconsistent**
57
+ - Images are resized to square resolution (possible cropping artifacts)
58
+ - No explicit color supervision
59
+ - No palette or region-level color constraints
60
+
61
+ ---
62
+
63
+ ## Training Configuration
64
+
65
+ Typical training setup:
66
+
67
+ ```bash
68
+ resolution: 768
69
+ train_batch_size: 2
70
+ gradient_accumulation_steps: 2
71
+ effective_batch_size: 4
72
+ learning_rate: 2e-5
73
+ lr_scheduler: cosine
74
+ max_train_steps: 6000–8000
75
+ mixed_precision: bf16