1
26-Nov-20
ICCIT 2020
The 23rd International Conference on Computer and Information Technology
Title of The Paper
Jamdani Motif Generation Using Conditional GAN
Authors
Presented by
Humaira Ferdous Shifa
2
26-Nov-20
JAMDANI MOTIF
THE MASTERLY CRAFTMANSHIP
Jamdani, the only surviving cotton variant of Dhakai Muslin, declared as the “Intangible Cultural Heritage of Humanity” by UNESCO in 2013.
Jamdani Motifs are:
3
26-Nov-20
Naming The Motifs
Beelpata Phool Paar
Indur Paar
Projapoti Phool
Beelpata Paar
Beelpata Paar
Source: Images collected from the book Traditional Jamdani Designs
4
26-Nov-20
Objective | Simulation of The Proposed System
5
26-Nov-20
Motivation
6
26-Nov-20
Not properly rewarded compared to the contributed effort & Dedication
Shifted from this profession
Master Artisans passed away
The Jamdani industry at the verge of extinction
Motifs lost in the mist of time
The Butterfly Effect Through Ages
In the Past:
7
26-Nov-20
Current Scenario
Proposed Solution
Resuscitating the industry through a dedicated tool for Jamdani Motif generation can:
Downbeat VS Upbeat
8
26-Nov-20
Current Scenario
Proposed Solution
Downbeat VS Upbeat
Motifs are:
9
26-Nov-20
Background Study
10
26-Nov-20
Deep Dive Into Pix2Pix GAN�(Image-to-Image Translation with Conditional Adversarial Networks by Phillip Isola and others)
So we opted to go for a Pix2Pix inspired model!
11
26-Nov-20
Data Collection
Internet
Two authentic sources:
12
26-Nov-20
Jamdani Festival 2019
06 September - 12 October 2019
Bengal Shilpalay, Dhaka
The Exhibition
Our Team
Source: Images taken at Bengal Shilpalay at the Jamdani Festival 2019
13
26-Nov-20
Sample Data Collected From The Archive
Old Jamdani Motifs
Probably from 19th century
Contemporary Jamdani Motifs
Woven Design & Line Drawing Sample Pair
The Book: Traditional Jandani Design
Source: Images taken from the book Traditional Jamdani Designs
14
26-Nov-20
A Day at Jamdani Polli
Tarabo, Shonargaon
OUR ACTIVITIES
Location: Shahina Jamdani Weaving Factory
Photo: Hand looms used for weaving Jamdani Saree
15
26-Nov-20
Weavers on the loom
Data Collection directly from the loom
Our Team
The Process of Weaving & Data Collection
Preparing a Yarn
Location: Shahina Jamdani Weaving Factory
16
26-Nov-20
Samples of Data Collected From Direct Observation
A saree with modern Jamdani design
getting weaved on the loom
A Saree with traditional
Jamdani motifs
Panjabi with Jamdani
Motifs
Source: Photos of the motifs taken at Shahina Jamdani Weaving Factory
17
26-Nov-20
Pix2Pix Data Format
Building The Dataset: Jamdani Noksha
Only available dataset of Jamdani motifs in digital format for computer vision research
18
26-Nov-20
Motif Extraction
Jamdani Boarder Design
Extracted Motifs
Source: Image taken from the book Traditional Jamdani Designs
19
26-Nov-20
Pix2Pix Data Format
This type of data consists of input and desired output side by side :
(Boundary to Shoes type)
(Boundary to Handbags type)
20
26-Nov-20
Real Image
Binary Image
Processed Data
Multiplication
skeletonize
Skeleton
Opening/Closing/Dilation
/Erosion
Steps of Data Pre-processing
21
26-Nov-20
Different Versions of Dataset
SL No | Version of Dataset | Size |
1 | Enhanced Resolution | 1983 |
2 | Reduced Branch | 913 |
3 | Sketch | 910 |
4 | Skeleton | 7932 |
5 | Boundary | 1116 |
Five versions of the dataset Jamdani Noksha—Skeleton, Reduced Branch, Sketch, Boundary, and Enhanced Resolution.
22
26-Nov-20
Reduced Branch
Sketch
Skeleton
Boundary
Enhanced Resolution
Samples of different variance of Dataset
23
26-Nov-20
Methodology
24
26-Nov-20
where G tries to minimize this objective against an adversarial D that tries to maximize it. The generator is tasked to not only fool the discriminator but also to be near the ground truth output in an L2 sense. We also explore this option, using L1 distance rather than L2 as L1 encourages less blurring,
The objective of a conditional GAN can be expressed as,
Our final objective is,
25
26-Nov-20
HOW DOES GAN WORKS?
26
26-Nov-20
Experiment & Result Analysis
27
26-Nov-20
Sample output (middle column of each group) for model trained on Jamdani Noksha’s Boundary, compared to ground truth (right column). Left column shows input strokes from user.
Fig: Loss graph for training on Boundary version
Boundary Version
L1 indicates absolute pixel to pixel translation. As the number of iteration increases the L1 tends to get lower, i.e.: generator become more competent in producing images that matches the real image distribution.
28
26-Nov-20
Sample output (middle column of each group) for model trained on Jamdani Noksha’s Enhanced Resolution, compared to ground truth (right column). Left column shows input strokes from user.
Fig: Loss graph for training on Enhanced Resolution version
Enhanced Resolution Version
L1 indicates absolute pixel to pixel translation. As the number of iteration increases the L1 tends to get lower, i.e.: generator become more competent in producing images that matches the real image distribution.
29
26-Nov-20
Sample output (middle column of each group) for model trained on Jamdani Noksha’s, (Reduced Branch compared to ground truth (right column). Left column shows input strokes from user.
Fig: Loss graph for training on Reduced Branch version
Reduced Branch Version
L1 indicates absolute pixel to pixel translation. As the number of iteration increases the L1 tends to get lower, i.e.: generator become more competent in producing images that matches the real image distribution.
30
26-Nov-20
Sample output (middle column of each group) for model trained on Jamdani Skeleton, compared to ground truth (right column). Left column shows input strokes from user.
Loss graph for training on Skeleton version
Skeleton Version
L1 indicates absolute pixel to pixel translation. As the number of iteration increases the L1 tends to get lower, i.e.: generator become more competent in producing images that matches the real image distribution.
31
26-Nov-20
Sample output (middle column) for model trained on Jamdani Noksha’s hand-drawn sketch version, compared to ground truth (right column). Left column shows input strokes from user.
Loss graph for training on Sketch version
Sketch Version
L1 indicates absolute pixel to pixel translation. As the number of iteration increases the L1 tends to get lower, i.e.: generator become more competent in producing images that matches the real image distribution.
32
26-Nov-20
Output Analysis
Outputs of 5 different versions of dataset
33
26-Nov-20
Loss Graph Analysis
Loss graph for training on (a) Boundary, (b) Enhanced Resolution, (c) Reduced Branch, (d) Skeleton, and (e) Sketch version
(a) (b) (c) (d) (e)
34
26-Nov-20
Limitation & Future Work
Constraints:
Future Work:
35
26-Nov-20
ANY
QUESTIONS?
36
26-Nov-20
THANK YOU!