Skip to main navigation Skip to main content
  • E-Submission

JKSPE : Journal of the Korean Society for Precision Engineering

OPEN ACCESS
ABOUT
BROWSE ARTICLES
EDITORIAL POLICIES
FOR CONTRIBUTORS
Regular

단일 RGB 영상을 이용한 VP 기반 3D 인체 자세 보정 및 디지털 트윈 매핑 프레임워크

조현서1, 홍민주2, 김병수2orcid

A VP-based 3D Human Pose Correction and Digital Twin Mapping Framework Using a Single RGB Image

Hyun Seo Cho1, Minju Hong2, Byeong Soo Kim2orcid
JKSPE 2026;43(6):589-595. Published online: June 1, 2026
1서울과학기술대학교 기계자동차공학과
2서울과학기술대학교 인공지능응용학과

1Department of Mechanical and Automotive Engineering, Seoul National University of Science and Technology
2Department of Applied Artificial Intelligence, Seoul National University of Science and Technology
Corresponding author:  Byeong Soo Kim, Tel: +82-2-970-9779, 
Email: bskim@seoultech.ac.kr
Received: 2 December 2025   • Revised: 28 December 2025   • Accepted: 8 January 2026
  • 8 Views
  • 0 Download
  • 0 Crossref
  • 0 Scopus
prev next

Accurate 3D human pose reconstruction from a single RGB image remains challenging due to scale ambiguity and perspective distortions. Current single-view methods primarily rely on learned priors or kinematic constraints, but they often struggle to maintain geometric consistency with the physical scene. This results in horizon alignment drift and instability when rendered in metric environments. To overcome these limitations, this study introduces a vanishing-point-driven framework that integrates scene geometry into the pose correction process. Under the Manhattan-world assumption, dominant vanishing points are detected to estimate the ground plane and recover the camera orientation with high precision. A lightweight 3D pose estimation network generates initial joint coordinates in camera-centric space. These coordinates are then refined through a VP-based ground-alignment transformation, which resolves scale ambiguity and minimizes geometric drift. The corrected poses are normalized to physical scale and streamed to NVIDIA OmniverseTM for real-time digital-twin visualization. Experiments conducted on indoor scenes from the NYU Depth V2 dataset demonstrate sub-pixel accuracy in vanishing-point localization and significant improvements in geometric alignment between the reconstructed poses and the true scene layout. This confirms the effectiveness of the proposed approach for single-view digital-twin human modeling.

Download Citation

Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

Format:

Include:

A VP-based 3D Human Pose Correction and Digital Twin Mapping Framework Using a Single RGB Image
J. Korean Soc. Precis. Eng.. 2026;43(6):589-595.   Published online June 1, 2026
Download Citation

Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

Format:
Include:
A VP-based 3D Human Pose Correction and Digital Twin Mapping Framework Using a Single RGB Image
J. Korean Soc. Precis. Eng.. 2026;43(6):589-595.   Published online June 1, 2026
Close