1 of 18

Replicating Interactive 3D Scene �using Shape & Structural Priors

Jeonghyun Kim, M.Sc

_contact: mildjay16@gmail.com

KISTI OXR 융합연구단 기술세미나, 2023.02.17

CAD-Represented Digital Twin

2 of 18

Replicating 3D Indoor Scene

A. Rosinol et al., 3D Dynamic Scene Graphs, �RSS 2020

Facebook, The Replica Dataset: A Digital Replica of Indoor Spaces, arXiv 2020

F. Xiang et al., SAPIEN: A SimulAted Part-based Interactive ENvironment, CVPR 2020

Realistic Digital Replication

3D Simulation

Building-level Management

3 of 18

Replicating 3D Indoor Scene: Digital Twin

@NVIDIA Omniverse Replicator

synthetic data generation

physics-based simulation

@MatterPort 3D tour

Large-scale RGB-D Scanning�Virtual tour in 3D & 360-view

4 of 18

Replicating 3D Indoor Scene via RGB-D Scanning

Noisy, Occlusion, Non-interactable

RGB-D Sensor

Point Cloud (+ vertex color)

5 of 18

Replicating 3D Indoor Scene + Shape Prior

H. Muzhi et al., Interactive Scene Reconstruction, ICRA 2021

Noisy, Occlusion, �Non-interactable

Clean, Fine-grained, �Pre-defined Interaction

6 of 18

Deep learning-based Approaches!

S. Hampali et al., Monte Carlo Scene Search for 3D Scene Understanding, CVPR 2021

A. Avetisyan et al., Scan2CAD, CVPR 2019

H. Izadinia et al., IM2CAD, CVPR 2017

7 of 18

Framework Overview

Indoor�Scanning�(e.g., RGB-D, �Point Cloud)

Object Detection �/ Parsing

Shape�Retrieval

Alignment

RGB-D Sensor

Optimization Problem�(i.e. ICP-like method)

Deep-learning!

8 of 18

Parsing 3D Scene into Semantics & Instances

J. Schult et al., MASK3D, ICRA 2023

T. Vu et al., SoftGroup, CVPR 2022 Oral

Encoder-�Decoder

Point Cloud�(N, 3)

3D Semantic Instances �(K, points, semantics)

9 of 18

Parsing 3D Scene into Semantics & Instances

ScanNetv2 Benchmark

10 of 18

Traditional Shape Retrieval from Raw 3D Shape

Shape DB

Geometric �Distance

Time-consuming,�Missing Structural Information

Query shape

Retrieved�Shape

(i.e., Chamfer Distance)

11 of 18

Shape Retrieval via Learned Shape Code

Structural details are still not preserved

Shape Codes�(n-dim vector)

Shape Code

Feature�Distance

Feature�Space�(K, n)

Shape DB�(K)

ReconstructedShape DB

Minimize Geometric Error

Shape DB

Query shape

Retrieved�Shape

12 of 18

Structural Analysis on 3D Shape

chair

seat

back

base

surface

leg

Part Bounding Boxes�+ Part Relations�(i.e. symmetry, adjacency)

Structure Hierarchy

K. Mo et al., StructureNet: Hierarchical Graph Networks for 3D Shape Generation. ACM Transactions on Graphics (ToG), 2019.

K. Mo et al., StructureNet, ToG 2019

A X. Chang et al., ShapeNet, arXiv 2015

13 of 18

Seg&Struct – Structure Prediction on Raw 3D shapes

J. Kim et al., Seg&Struct, WACV 2023

Predicting Part Structure from Raw 3D shapes �using structure & segmentation priors

14 of 18

Seg&Struct: Structure-aware Shape Retrieval

Shape DB�w/ Structure�Info.

Structural�Difference

Intermediate�Representation�(i.e., Part Structure)

Query shape

Retrieved�Shape

Encoder-�Decoder

Using a predicted part structure as an intermediate representation �for fast/accurate shape retrieval

15 of 18

Seg&Struct: Structure-aware Shape Retrieval

Query

Ours

CD-based

Query

Ours

CD-based

Query

Ours

CD-based

16 of 18

Seg&Struct: Structure-aware Shape Retrieval

Query

Ours

CD-based

Query

Ours

CD-based

Query

Ours

CD-based

17 of 18

Next Breakthroughs (Future Work)

  • 3D Surface Scan (full region)
  • chair, table, cabinet
  • Supervised Learning
    • pre-defined structure

Noisy Scans

  • Sparse number of points
  • Missing/Occluded Region
  • ⇒ Shape Completion

Arbitrary�Shape Structures

  • Unseen novel categories
  • More complex structure �in the same category
  • ⇒ Unseen Structure Estimation

M. Sung et al., Data-driven Structural Priors for Shape Completion, ToG 2015

T. Luo et al., Learning-to-Group, ICRA 2020

18 of 18

Source Code / Dataset / etc.

  • Scan2CAD [CVPR 2019]
  • Interactive Scene Reconstruction [ICRA 2021]
  • Monte Carlo Scene Search for 3D Scene Understanding [CVPR 2022]
  • Deformation-aware Shape Retrieval [ECCV 2020]
  • SoftGroup [CVPR 2022 Oral]
    • https://github.com/thangvubk/SoftGroup
    • Almost SOTA in 3D Semantic Instance Benchmark (ScanNet v2)