CVPR 2024

TexVocab:Texture Vocabulary-conditioned Human Avatars

1Tsinghua University
Given multi-view RGB videos of one character, we construct a texture vocabulary, and create realistic animatable human avatars.

Abstract

To adequately utilize the available image evidence in multi-view video-based avatar modeling, we propose TexVocab, a novel avatar representation that constructs a texture vocabulary and associates body poses with texture maps for animation.

Given multi-view RGB videos, our method initially back-projects all the available images in the training videos to the posed SMPL surface, producing texture maps in the SMPL UV domain. Then we construct pairs of human poses and texture maps to establish a texture vocabulary for encoding dynamic human appearances under various poses. Unlike the commonly used joint-wise manner, we further design a body-part-wise encoding strategy to learn the structural effects of the kinematic chain.

Given a driving pose, we query the pose feature hierarchically by decomposing the pose vector into several body parts and interpolating the texture features for synthesizing fine-grained human dynamics.

Overall, our method is able to create animatable human avatars with detailed and dynamic appearances from RGB videos, and the experiments show that our method outperforms state-of-the-art approaches.

Method

We first construct TexVocab by decomposing SMPL poses into body parts, sampling key body parts and gathering corresponding texture maps. Then given a query pose and a 3D coordinate, we decompose the pose into body parts, interpolate key body parts and sample texture maps as the pose conditioned feature. We finally utilize NeRF represented as an MLP to decode the dynamic character and render human appearance with detailed pose-dependent dynamics.

Results and Comparisons

Animation Results on Different Sequences

Animation Results on AIST++ Dataset

Comparison Results against ARAH, Tava, AniNeRF and Posevocab

Video

Coming Soon

BibTeX

@article{liu2024texvocab,
  author    = {Yuxiao, Liu and Zhe, Li and Yebin, Liu and Haoqian, Wang},
  title     = {TexVocab:Texture Vocabulary-conditioned Human Avatars},
  journal   = {CVPR},
  year      = {2024},
}

Contact Us