\ul
SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations
Abstract
Recovering photorealistic and drivable full-body avatars is crucial for numerous applications, including virtual reality, 3D games, and tele-presence. Most methods, whether reconstruction or generation, require large numbers of human motion sequences and corresponding textured meshes. To easily learn a drivable avatar, a reasonable parametric body model with unified topology is paramount. However, existing human body datasets either have images or textured models and lack parametric models which fit clothes well. We propose a new parametric model SMPLX-Lite-D, which can fit detailed geometry of the scanned mesh while maintaining stable geometry in the face, hand and foot regions. We present SMPLX-Lite dataset, the most comprehensive clothing avatar dataset with multi-view RGB sequences, keypoints annotations, textured scanned meshes, and textured SMPLX-Lite-D models. With the SMPLX-Lite dataset, we train a conditional variational autoencoder model that takes human pose and facial keypoints as input, and generates a photorealistic drivable human avatar.
Index Terms:
Drivable Avatar, Dataset, ReconstructionI Introduction
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x1.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x2.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x3.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x4.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x5.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x6.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x7.png)
Existing methods generally reconstruct clothed human models from images or videos. One class of methods based on the neural radiance field[1, 2, 3] utilizes an implicit functional representation[4, 5] that allows pixel-level alignment with the image, but lacks an explicit geometric representation.
Another class of methods is based on parametric models (e.g., SMPL[6], SMPL-X[7]), which use low-dimensional parametric models as human body priors and learn to fit the model parameters to align them with the person in the picture by training. These template models[6, 7] derived from large amounts of data can be flexibly controlled by low-dimensional pose and shape parameters, which can capture non-rigid deformation well and reduce artifacts from linear transformation. The popular SMPL model learns better pose and shape blend shapes on top of linear blend skinning, and can fit various deformation through pose and shape parameters.
Most methods for reconstructing 3D humans from images are to align the distorted SMPL model with 2D images and joints by predicting the SMPL parameters [8, 9]. However, these methods can only obtain minimally dressed human meshes, not clothed ones, due to the naked parametric model. Other methods for reconstructing clothed human bodies extend the SMPL model into SMPL-D to represent clothes by vertex displacement[10, 11]. IPNet[10] divides body and clothes into two layers, fitted with SMPL and SMPL-D respectively. Similarly, CAPE[11] employs CVAE to generate corresponding meshes by pose, clothes type, and clothes shape, thereby producing the clothes vertex displacement. The reconstructed models can also distort the clothes mesh in different poses through the skeleton and skinning weights of the internal human model. However, the results are often poor because wrinkles and deformation of clothes are more uncontrollable than the human body, so a lot of data is needed to learn, and corresponding scanned models are needed to supervise them. Clothes such as skirts and coats are also difficult to reconstruct due to the limitations of vertex displacement.
To utilize the advantages of the approaches above, recent work has attempted to combine the two representations. ARCH[12] and ARCH++[13] use human prior knowledge to transform a human body in any posture into a canonical space, and then learn implicit representations for reconstruction. These methods produce pixel-aligned models and can theoretically be reposed by changing model parameters. However, since there is no learning to infer pose-dependent clothing deformation, these methods simply apply articulated deformation to the reconstructed model. This results in an unrealistic pose-related distortion that lacks fine details of the garment.
Since the SMPL[6] model has only 24 joints and doesn’t accommodate facial expressions and finger movements, the adoption of the SMPL-X[7] model is increasingly common in the pursuit of better character fitting, which aggregates body, face, and hand. However, challenges arise when fitting vertices using thee SMPL-X model, including eye deformation and lip flipping. To address these concerns, we propose the SMPLX-Lite model, optimized for vertex fitting based on the SMPL-X, while retaining the exceptional face expression and hand action representation capabilities of the SMPL-X model.
In order to get an animatable human avatar, previous methods usually required reconstructing a character template for a single person and then modeling pose-dependent dynamic distortions. Recent works suggest that we can learn the deformation of a general character template from scanned data[11, 14] or RGB video data[15, 16] to get a drivable avatar directly. These methods usually require a large amount of data to train an avatar associated with a person, and when the data is insufficient, problems arise with over-fitting and posture generalization capabilities. So we introduce the SMPLX-Lite dataset, which uses 32 4K RGB cameras to capture over 20k frames of action sequences simultaneously, containing 5 characters (3 male and 2 female, wearing various types of clothes) and 15 different action types, and performs a series of data processing operations, i.e., image segmentation, 3D model reconstruction, pose estimation, SMPLX-Lite-D model fitting and texture map fitting. We have packaged all these annotated data into the SMPLX-Lite dataset to advance research in this field, making it possible that just a simple baseline can generate avatars with good results.
To underscore the contribution of the SMPLX-Lite dataset to the community, we develop a conditional variational autoencoder network using this dataset as a foundation following [17, 18]. Our method uses pose parameters, facial keypoints and view direction as conditions to generate a character model with texture based on the corresponding pose. This greatly simplifies the process of driving the character model. Compared with CAPE[11], our recovered avatar has finer geometry and photorealistic texture, making it more lifelike and directly applicable in industrial settings.
Our contributions can be summarized as follows:
-
•
We collect the most comprehensive and photorealistic avatar dataset to date, containing multi-view segmented image sequences, 3D keypoint annotation, textured scanned model and fitted SMPLX-Lite-D model with texture maps.
-
•
We propose the SMPLX-Lite model optimized for vertex fitting based on the SMPL-X model, succeeding as the first SMPLX-based model using vertex displacement to fit clothes.
-
•
We introduce a multi-stage fitting procedure capturing fine geometry details like facial expressions and cloth wrinkles. Compared with the SMPL-X model, it greatly reduces the difficulty of vertex fitting while retaining the details of facial expressions and hand movements.
-
•
We propose a CVAE model that receives driving input by facial keypoints and pose parameters to produce a photorealistic avatar.
II SMPLX-Lite Dataset
We present SMPLX-Lite dataset, the most comprehensive captured human avatar dataset currently. Please refer to the suppl. for detailed comparison with other datasets containing human model fits and a demo dataset for check. Our dataset contains multi-view segmented image sequences, 3D keypoints annotation, reconstructed textured scanned mesh, fitted SMPLX-Lite-D model and texture maps. In this section, we will describe in detail how to capture and organize the dataset, and the procedure for obtaining these annotation data.
II-A Data Capture
We employ 32 calibrated cameras to simultaneously capture 4096x3000 image sequences of 15 different actions, being performed by 5 subjects (3 male, 2 female) in daily clothes. The image sequences include 15 kinds of actions in daily scenes, such as discussion, debate, public speaking, phone conversations and stretching, which significantly enhances the authentic, diverse and generalizable nature of an avatar. For the convenience of statistics and processing, we select over 200 consecutive frames for each action sequence and eventually collect over 20k frames. Each frame has 32 views of the raw image, as well as all annotation results from post-processing.
II-B Data Process
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x8.png)
Textured Mesh Reconstruction. We utilize 32 RGB cameras with 48 additional IR cameras and random pattern projectors for reconstruction. Following [19], we first obtain the initial depth map from the IR images through the stereo matching algorithm[20] and then convert the depth map into a point cloud, which is later turned to the initial mesh by Poisson Surface Reconstruction (PSR) [21]. The obtained mesh has some mismatches w.r.t. the actual shape due to the accumulated error. We employ differentiable rendering [22] to optimize the vertex positions of the mesh geometry while extracting the texture of the mesh surface. Through these processes, we obtain the mesh model with higher accuracy and high-quality texture extremely close to the real picture.
3D Human Pose Estimation. Once the 2D keypoints of the person from each camera view are obtained, our accurate camera intrinsic and extrinsic parameters from calibration enable the calculation of 3D keypoints by triangulation. We use openpose[23] to estimate 2D human joints of each view. However, 2D keypoints estimated from different views may not be reasonable due to occlusion and limited camera field of view. Consequently, it is crucial to select highly confident views for each keypoint during the process of triangulation. We employ RANSAC[24] method to select reasonable views. See suppl. for detailed process. Subsequently, easymocap[25] is utilized to fit SMPL-X[7] model through the supervision of 2D and 3D keypoints for every frame.
SMPLX-Lite Model Transfer. SMPL-X has vertices and joints, and is defined as a function , where , , and are pose, shape, and expression parameters respectively. More specifically,
(1) |
where is a standard linear blend skinning function. Several parts of LBS function are:
(2) |
(3) |
and blend weights . Methods using SMPL-X plus vertex displacement to fit clothes extend Eq.(2) to
(4) |
The SMPL-X model with vertex displacement shown in Fig.3a, exhibits face flipping and distortion in the eyes, ears, mouth, nose and feet. In response to these issues, we propose SMPLX-Lite model, which greatly reduces the difficulty of vertex fitting while preserving the facial expression and hand gesture fitting capabilities of the SMPL-X model. The iterative process entail vertex deletion, face reconstruction, and face flattening, ultimately yielding the SMPLX-Lite model with a reduced vertex count of . Refer to suppl. for details.
As the number of vertices decreases, adjustments to the matrices , , , , , as indicated in Eq.(1, 2, 3), are vital for ensuring the transferred model inherits the control parameters of SMPL-X and the linear blend skinning function. Upon transferring all coefficient matrices, the SMPLX-Lite model becomes operational akin to SMPL-X, and utilizing Eq.(4), vertex displacement can be added to the model to fit clothes. The subsequent analysis will demonstrate the impressive efficacy of this model in vertex fitting.
SMPLX-Lite-D fit. The purpose of vertex fit is to fully capture the fine geometry details of the scanned meshes in a unified mesh topology and texture UV layout. After the 3D pose estimation in sectionII-B, we obtain a starting mesh close to the scanned mesh without surface details. We propose to solve for vertex fit in 2 stages. In the first stage, we adopt the method from [26] and warp the mesh by predefined embedded nodes, then solve for the warp field. In the second stage, we directly solve for the remaining vertex shifts. The detailed procedures and impact of all the registration steps are illustrated in suppl.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x9.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x10.png)
How will this dataset be useful to the community? Dedicated significant effort has been made to collect and process the most comprehensive 3D moving human avatar dataset with clothes and textures to date. The SMPLX-Lite dataset has significant implications for Drivable Textured Avatar Reconstruction, as it provides multi-view images, reconstructed texture models, and fitted clothed parametric models with texture maps. These diverse data types can be leveraged to reconstruct photorealistic drivable avatars, offering researchers a wider spectrum of supervising methods compared to datasets that offer only raw pictures [27] or solely reconstructed textured models [11]. This capability broadens the range of network structures that can be utilized, potentially enabling multiple stages of network training.
Besides, the SMPLX-Lite dataset is also pertinent to other important areas such as 3D Human Body Reconstruction and Novel View Synthesis. Moreover, researchers are encouraged to explore further applications of this dataset.
II-C Dataset Evaluation
Scan | SMPLX-Lite-D | ||||
---|---|---|---|---|---|
Sub. | PSNR | SSIM | PSNR | SSIM | CD |
WZL | 28.92 | 0.9714 | 28.61 | 0.9706 | 6.7372 |
LDF | 28.33 | 0.9706 | 27.80 | 0.9675 | 8.2897 |
ZX | 28.95 | 0.9760 | 28.52 | 0.9749 | 6.8234 |
LW | 27.21 | 0.9754 | 26.67 | 0.9744 | 6.4386 |
ZC | 27.51 | 0.9623 | 27.05 | 0.9602 | 6.9238 |
We present the evaluation results in Tab.I, including peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and chamfer distance (CD).
III Method
To demonstrate the effectiveness of SMPLX-Lite dataset, we utilize a basic baseline model to generate a drivable avatar and show the effect that all avatars in the dataset are driven by the same sequence of actions. Our approach is grounded on the network structure in [17, 18], involving comprehensive simplifications and adaptations.
III-A Network Structure
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x11.png)
The model employed is a conditional variational autoencoder (CVAE), consisting of an encoder and a decoder , both implemented using convolutional neural networks. See Fig.4 for the overview of our method.
The encoder takes as input the mean texture map for each individual in the dataset and the T-pose mesh derived via inverse LBS to SMPLX-Lite-D model for each frame. Rendering onto a position map in UV space yields a feature map of the same size as the average texture map, which is subsequently merged with the average texture map in channel dimension and fed into the encoder.
The encoder outputs the mean and standard deviation of the Gaussian distribution, which are trained to align as closely as possible to the standard normal distribution and then sampled to obtain the latent code .
Following [17], we utilize readily available pose parameters and face keypoints as driving signals. These are rendered to a position map in UV space and merged into feature maps, while the T-pose vertex coordinate is leveraged to generate view information feature maps. These driving signals and view feature maps serve as conditions and are combined with the latent code before being fed into the decoder to predict the T-pose mesh offsets and view-dependent texture map :
where means conv1x1. We use decoder to predict offsets because the network fits the residuals better than directly fitting the vertex locations. By adding these offsets to T-pose template and transforming them using pose parameters through LBS, the final reconstructed pose mesh is obtained:
(5) |
The dataset provided allows for the supervision of and through the model geometry of SMPLX-Lite-D and the associated texture map, as well as the rendered images through differentiable rendering with the captured images . This multi-faceted supervision facilitates the creation of high-quality drivable reconstructed human models.
During inference, the latent code is sampled from a standard normal distribution without the need for an encoder. Decoder takes along with the driving signal and view information as input to generate the geometry and texture of the person under the corresponding pose.
III-B Loss
The loss function we use is:
(6) |
where is the L2 distance between the vertex of the reconstructed model and gt SMPLX-Lite-D model, is the L2 loss of the texture map and gt texture map in the valid UV area with mask , is the Laplacian term used to ensure the smoothness of the model, and is the KL term of the standard VAE model[28]. If gt image is used for supervision, can be replaced with image loss plus image mask loss .
IV Experiments
In this section, we present the results of our photorealistic human model-driven algorithm on the SMPLX-Lite dataset. Subsequently, we compare our method with two baselines, in both novel view and novel pose synthesis experiments, to demonstrate the superior performance of our method in geometry and texture generation.
IV-A Reconstruction & Driving
We utilize 9 actions in the dataset for training and the others for testing. Subsequent experiments involve training individual character models on the training set and evaluating their reconstruction and driving effects on the test set.
To begin, we assess the method’s ability to reconstruct mesh and texture for new actions of the same person on the test set, which involves utilizing the encoder to generate latent code with the same distribution as the training data. Additionally, we evaluate the driving effect of the model by using the driving signal of the test set to drive the characters. Unlike reconstruction, driving necessitates the random sampling of latent code from the normal distribution without encoder .
The photorealistic reconstructed and driving results, along with quantitative evaluations for all subjects, are presented in Fig.5 and Tab.II, respectively. It is worth noting that driving is marginally less effective than reconstructing due to the absence of hidden space information associated with the character. Furthermore, we test the effect of using the same new sequence of actions to drive five trained character models and present the full results in suppl.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x12.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x13.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x14.png)
Reconstruction | Driving | |||||
---|---|---|---|---|---|---|
Sub. | PSNR | SSIM | CD | PSNR | SSIM | CD |
WZL | 26.60 | 0.9454 | 4.1098 | 26.54 | 0.9443 | 4.2264 |
LDF | 25.69 | 0.9394 | 3.9842 | 24.94 | 0.9307 | 4.2419 |
ZX | 25.33 | 0.9382 | 4.7422 | 24.71 | 0.9312 | 4.7719 |
LW | 23.38 | 0.9397 | 5.2762 | 22.42 | 0.9304 | 5.4222 |
ZC | 24.48 | 0.935 | 4.6135 | 23.49 | 0.9254 | 4.6383 |
IV-B Comparison with Baselines
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x15.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x16.png)
Additionally, We conduct comparisons with two baselines, Neural Body (NB)[29] and Ani-NeRF (AN)[16]. Following NB’s setting, our method outperforms the two baselines in both novel view and novel pose synthesis experiments, as demonstrated in Fig.6 and Tab.III.
Method | Novel View | Novel Pose | ||||||
---|---|---|---|---|---|---|---|---|
PSNR | SSIM | LPIPS | CD | PSNR | SSIM | LPIPS | CD | |
NB | 31.29 | 0.9707 | \ul0.0789 | \ul11.490 | 29.27 | 0.9616 | \ul0.0841 | \ul11.732 |
AN | 28.05 | 0.9500 | 0.0981 | 16.285 | 26.14 | 0.9382 | 0.1119 | 18.170 |
Ours | \ul30.14 | \ul0.9607 | 0.0567 | 7.1586 | \ul27.97 | \ul0.9485 | 0.0675 | 8.7690 |
The robust and highly generalizable nature of our approach enables it to capture intricate details and high-frequency information, leading to clearer textures and hand movements. In contrast, the baselines produce notably blurry results in both experiments, particularly in the hand area, with AN displaying abnormally twisted arms and fingers in Fig.6b. Besides, the meshes generated by our method appear smoother and retain a higher level of detail, as depicted in Fig.7.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x17.png)
V Conclusion
We propose the SMPLX-Lite model, which simplifies the methods using vertex displacement to fit clothes, while retaining the advantages of the SMPL-X model. This paves the way for the generation of the proposed SMPLX-Lite dataset, which stands as the most comprehensive and fairly photorealistic textured clothed avatar dataset currently available, supporting the advancement of the research community. Leveraging this dataset, we introduce a CVAE-based textured human model driving algorithm, showcasing the substantial advantage of SMPLX-Lite dataset in label richness and photorealism. Notably, our driving algorithm utilizes solely the captured images and textured SMPLX-Lite-D model in the dataset. Additionally, the SMPLX-Lite dataset includes annotations for 2D/3D keypoints and SMPL-X model, high-precision scanned models, and corresponding texture maps, which are invaluable data contributing to pertinent research endeavors.
Acknowledgment
This work was partly supported by the National Natural Science Foundation of China under U23B2030 and the Special Foundations for the Development of Strategic Emerging Industries of Shenzhen (Nos.JSGG20211108092812020 & CJGJZD20210408092804011).
References
- [1] S. Saito, Z. Huang, R. Natsume, S. Morishima, A. Kanazawa, and H. Li, “Pifu: Pixel-aligned implicit function for high-resolution clothed human digitization,” in ICCV, 2019.
- [2] S. Saito, T. Simon, J. Saragih, and H. Joo, “Pifuhd: Multi-level pixel-aligned implicit function for high-resolution 3d human digitization,” in CVPR, 2020.
- [3] Z. Dong, C. Guo, J. Song, X. Chen, A. Geiger, and O. Hilliges, “Pina: Learning a personalized implicit neural avatar from a single rgb-d video sequence,” in CVPR, 2022.
- [4] Z. Chen and H. Zhang, “Learning implicit fields for generative shape modeling,” in CVPR, 2019.
- [5] J. J. Park, P. Florence, J. Straub, R. Newcombe, and S. Lovegrove, “Deepsdf: Learning continuous signed distance functions for shape representation,” in CVPR, 2019.
- [6] M. Loper, N. Mahmood, J. Romero, and M. J. Black, “SMPL: A skinned multi-person linear model,” ACM SIGGRAPH Asia, 2015.
- [7] G. Pavlakos, V. Choutas, N. Ghorbani, T. Bolkart, M. Black et al., “Expressive body capture: 3D hands, face, and body from a single image,” in CVPR, 2019.
- [8] F. Bogo, A. Kanazawa, C. Lassner, P. Gehler, J. Romero, and M. J. Black, “Keep it smpl: Automatic estimation of 3d human pose and shape from a single image,” ECCV, 2016.
- [9] N. Kolotouros, G. Pavlakos, M. Black, and K. Daniilidis, “Learning to reconstruct 3d human pose and shape via model-fitting in the loop,” in ICCV, 2020.
- [10] B. Bhatnagar, C. Sminchisescu, C. Theobalt, and G. Pons-Moll, “Combining implicit function learning and parametric models for 3d human reconstruction,” in ECCV, 2020.
- [11] Q. Ma, J. Yang, A. Ranjan, S. Pujades, G. Pons-Moll, and M. J. Black, “Learning to Dress 3D People in Generative Clothing,” in CVPR, 2020.
- [12] Z. Huang, Y. Xu, C. Lassner, H. Li, and T. Tung, “Arch: Animatable reconstruction of clothed humans,” in CVPR, 2020.
- [13] T. He, Y. Xu, S. Saito et al., “Arch++: Animation-ready clothed human reconstruction revisited,” in ICCV, 2021.
- [14] S. Saito, J. Yang, Q. Ma, and M. J. Black, “SCANimate: Weakly supervised learning of skinned clothed avatar networks,” in CVPR, 2021.
- [15] L. Liu, M. Habermann, V. Rudnev, K. Sarkar, J. Gu, and C. Theobalt, “Neural actor: Neural free-view synthesis of human actors with pose control,” ACM SIGGRAPH Asia, 2021.
- [16] S. Peng, J. Dong, Q. Wang, S. Zhang, Q. Shuai, X. Zhou, and H. Bao, “Animatable neural radiance fields for modeling dynamic human bodies,” in ICCV, 2021.
- [17] T. M. Bagautdinov, C. Wu, T. Simon, F. Prada et al., “Driving-signal aware full-body avatars,” ACM TOG, 2021.
- [18] D. Xiang, F. Prada, T. Bagautdinov, W. Xu, Y. Dong, H. Wen, J. Hodgins, and C. Wu, “Modeling clothing as a separate layer for an animatable human avatar,” ACM TOG, 2021.
- [19] A. Collet, M. Chuang, P. Sweeney, D. Gillett, D. Evseev, D. Calabrese, H. Hoppe, A. Kirk, and S. Sullivan, “High-quality streamable free-viewpoint video,” ACM TOG, 2015.
- [20] J. Li, P. Wang, P. Xiong, T. Cai, Z. Yan, L. Yang, J. Liu, H. Fan, and S. Liu, “Practical stereo matching via cascaded recurrent network with adaptive correlation,” in CVPR, 2022.
- [21] M. Kazhdan, M. Bolitho, and H. Hoppe, “Poisson Surface Reconstruction,” in Symposium on Geometry Processing, 2006.
- [22] S. Laine, J. Hellsten, T. Karras, Y. Seol, J. Lehtinen, and T. Aila, “Modular primitives for high-performance differentiable rendering,” ACM TOG, 2020.
- [23] Z. Cao, G. Hidalgo Martinez, T. Simon, S. Wei, and Y. A. Sheikh, “Openpose: Realtime multi-person 2d pose estimation using part affinity fields,” IEEE Tran. on PAMI, 2019.
- [24] M. A. Fischler and R. C. Bolles, “Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography,” Commun. ACM, 1981.
- [25] Q. Shuai, Q. Fang, J. Dong, S. Peng, D. Huang et al., “Easymocap - make human motion capture easier.” Github, 2021. [Online]. Available: https://github.com/zju3dv/EasyMocap
- [26] H. Li, B. Adams, L. J. Guibas, and M. Pauly, “Robust single-view geometry and motion reconstruction,” ACM TOG, 2009.
- [27] W. Cheng, S. Xu, J. Piao, C. Qian et al., “Generalizable neural performer: Learning robust radiance fields for human novel view synthesis,” arXiv preprint arXiv:2204.11798, 2022.
- [28] D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114, 2013.
- [29] S. Peng, Y. Zhang, Y. Xu, Q. Wang et al., “Neural body: Implicit neural representations with structured latent codes for novel view synthesis of dynamic humans,” in CVPR, 2021.
- [30] “Renderpeople,” https://renderpeople.com/, 2018.
- [31] F. Bogo, J. Romero, G. Pons-Moll, and M. J. Black, “Dynamic FAUST: Registering human bodies in motion,” in IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Jul. 2017.
- [32] C. Zhang, S. Pujades, M. J. Black, and G. Pons-Moll, “Detailed, accurate, human shape estimation from clothed 3d scan sequences,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
- [33] P. Patel, C.-H. P. Huang, J. Tesch, D. T. Hoffmann, S. Tripathi, and M. J. Black, “AGORA: Avatars in geography optimized for regression analysis,” in Proceedings IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Jun. 2021.
- [34] Z. Yu, J. S. Yoon, I. K. Lee, P. Venkatesh, J. Park, J. Yu, and H. S. Park, “Humbi: A large multiview dataset of human body expressions,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
- [35] T. Yu, Z. Zheng, K. Guo, P. Liu, Q. Dai, and Y. Liu, “Function4d: Real-time human volumetric capture from very sparse consumer rgbd sensors,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021), June 2021.
- [36] Z. Cai, D. Ren, A. Zeng, Z. Lin, T. Yu, W. Wang, X. Fan, Y. Gao, Y. Yu, L. Pan, F. Hong, M. Zhang, C. C. Loy, L. Yang, and Z. Liu, “Humman: Multi-modal 4d human dataset for versatile sensing and modeling,” October 2022.
- [37] G. Tiwari, B. L. Bhatnagar, T. Tung, and G. Pons-Moll, “Sizer: A dataset and model for parsing 3d clothing and learning size sensitive 3d clothing,” in ECCV. Springer, August 2020.
- [38] C. Ionescu, D. Papava, V. Olaru, and C. Sminchisescu, “Human3.6m: Large scale datasets and predictive methods for 3d human sensing in natural environments,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 7, pp. 1325–1339, jul 2014.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x18.png)
We provide additional dataset processing details (Appendix A), extended dataset evaluation results (Appendix B), extended diverse dataset visualization (Appendix C) and extended experiment results (Appendix D). Actors and actresses participating in SMPLX-Lite are well-informed and acknowledge that the data will be made public for research purposes.
Appendix A Additional Dataset Processing Details
A-A 3D Human Pose Estimation
We provide a detailed description of the RANSAC[24] algorithm mentioned in Sec.3.2 of the main paper in Algorithm1.
A-B SMPLX-Lite Model Transfer
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x19.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x20.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x21.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x22.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x23.png)
We present the new SMPLX-Lite parametric model, which is derived from SMPL-X. The model aims to capture the intricate geometry of the scanned mesh, while also ensuring stable geometry in critical areas such as the nose, mouth, and feet, as well as preserving the overall facial and finger shapes. The entire process is depicted in Fig.9.
First, we eliminate the vertices within the eyeballs, cochlea, lips, nostrils, and toe seam region from the SMPL-X (9a) model that are either hidden or folded. Subsequently, we connect the edge vertices to create faces, while keeping the remaining vertices and topology unaltered. The resulting model (9b, 9c) still has a large depression area, which could affect vertex fitting. Consequently, we flatten the faces in these particular regions to achieve a smoother surface, ensuring a uniform vertex distribution. Nonetheless, it is observed that the vertex and face distribution remains uneven during the fitting process, resulting in clustering of some vertices and severe distortion of corresponding faces (refer to Fig.9d). To address this issue, we undertake multiple rounds of vertex deletion, face reconstruction, and face flattening to obtain a more suitable model for vertex fitting, which we designate as the SMPLX-Lite model (9e).
Subsequently, the reduction in the number of vertices necessitates adjustments to the matrices , , , , and , as described in Sec2.2 of the main paper to ensure that the transferred model inherits the control parameters of SMPL-X and the linear blend skinning function. Initially, we resize these matrices to , where , to ensure that the number of rows remains consistent across all matrices. Then, for the , , , and matrices, we determine the nearest neighbor on the SMPL-X model for each vertex of the SMPLX-Lite model, and uses the corresponding row in the original matrix to populate the new matrix. However, for the matrix, using the nearest neighbor will result in a loss of regression coefficients for certain vertices to joints. To circumvent this, we identify the nearest neighbor on the SMPLX-Lite model for each vertex of the SMPL-X model, and subsequently aggregate the rows corresponding to the same point on the SMPL-X Lite model as a row of the new matrix.
A-C SMPLX-Lite-D fit
We describe in detail the 2 stages of SMPLX-Lite-D fit process in Sec.2.2 of the main paper.
Stage 1: Embedded Nodes. The embedded nodes are initialized on the T-pose mesh without clustering vertices by radius as done in [26]. Instead, we cluster vertices by connectivity. The unbalanced distribution of embedded nodes is naturally adapted to the distortion ability of SMPLX-Lite mesh surface.
-
1)
We initialize a candidate set with all the vertices on the mesh. We randomly select vertex from the candidate set as a new embedded node and remove level of neighbor vertices from the candidate set, forming the remaining set . By level of neighbors, we refer to at least jumps from the select vertex to the neighbor vertex. In practice, we use .
-
2)
Repeat step 1) until the candidate set is empty.
-
3)
For a embedded node , we define a base radius as the average radius of its level of neighbors. We define the weight of a embedded node w.r.t. a vertex by their geodesic distance :
where controls how far an embedded node can affect. In practice, we have .
-
4)
The smooth term between embedded nodes is defined in a similar way to step 3). Two embedded nodes are considered neighbors if the geodesic distance between them is less than twice the largest base radius of them.
Upon initializing the embedded nodes on T-pose mesh, we record the selected vertex indices and weights. When applied to a posed SMPLX-Lite model, the embedded nodes’ positions are initialized using the corresponding vertex positions on the posed mesh. Please note that even the embedded nodes have the same initial positions as the chosen vertices, they are not bonded to be the same during the subsequent solving iterations.
To solve for the rotation and translation of the embedded nodes, our objective is to minimize the distance of warped vertices towards their nearest match on the scanned mesh. We refer the readers to [26] for more details.
Stage 2: Vertex Shifts. After the fitting with embedded nodes, we only need to solve for tiny vertex shifts to ultimately capture the fine geometry details. With the final shifts regularized by a Laplacian matrix initialized from the resulting mesh of Stage 1, the fitted mesh is denoted by SMPLX-Lite-D.
We present our fit pipeline in Fig.10. The stage 1 reasonably fits the scanned mesh, while the Stage 2 presents more geometry details.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x24.png)
mesh
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x25.png)
nodes
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x26.png)
results
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x27.png)
results
We also compare the results of our two-stage fitting vs. direct vertex fitting (stage 2 only) in Fig. 11. Direct vertex fitting may generate undesirable artifacts in many regions.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x28.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x29.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x30.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x31.png)
Appendix B Extended Dataset Evaluation Results
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x32.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x33.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x34.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x35.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x36.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x37.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x38.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x39.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x40.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x41.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x42.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x43.png)
The comparison of the SMPLX-Lite dataset with other datasets containing human model fits is presented in Tab.IV. As discussed in the main paper, SMPLX-Lite dataset offers a range of valuable components, including multi-view images, reconstructed texture models, and fitted clothed parametric models with texture maps. This variety of data types allows for the reconstruction of photorealistic drivable avatars, thereby providing researchers with a broader spectrum of supervising methods compared to datasets that only offer raw images [27] or solely reconstructed textured models [11]. In contrast, other datasets featuring both RGB images and scanned textured meshes are either synthetic or lack registered parametric models. Importantly, these datasets are unable to furnish a parametric model that facilitates control over facial expressions and hand movements and achieve vertex alignment. The fitting results of different parametric models are compared in Fig.12. Notably, our registered SMPLX-Lite-D models enable multiple supervision methods, such as direct supervision of 3D mesh and texture, as well as supervision with 2D images.
Dataset | RGB | Mesh | Texture | Registered |
|
|
K3D | Act | Sequence | ||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
RenderPeople[30] | |||||||||||||
DFAUST[31] | Dyna | ||||||||||||
BUFF[32] | SMPL | ||||||||||||
AGORA[33] | SMPL-X&SMPL | ||||||||||||
HUMBI[34] | SMPL | ||||||||||||
THuman2.0[35] | SMPL-X | ||||||||||||
ZJU LightStage[29] | SMPL-X | ||||||||||||
GeneBody[27] | SMPL-X | ||||||||||||
HuMMan[36] | SMPL | ||||||||||||
Sizer[37] | SMPL-G | ||||||||||||
CAPE[11] | SMPL-D | ||||||||||||
Facebook†[17, 18] | |||||||||||||
Ours | SMPLX-Lite-D |
Then, we provide more detailed dataset evaluation results. We utilize 8 telephoto cameras and 24 standard cameras to capture images with full body and local details simultaneously. PSNR and SSIM results of telephoto cameras are lower than standard cameras because they capture finer images, as shown in Fig.15 and Tab.LABEL:tab:eval.
In Tab.LABEL:tab:eval, we have a complete list of the average results per act for each subject. The names of the 15 actions are “01 discussion”, “02 debating”, “03 presentation”, “04 eating”, “05 directions”, “06 greeting”, “07 purchasing”, “08 posing”, “09 waiting”, “10 walking”, “11 walking dog”, “12 phoning”, “13 taking photo”, “14 turning around”, “15 stretching”. Some of the actions refer to the paper[38].
Appendix C Extended Diverse Dataset Visualization
We present multi-view visualization in Fig.15 and reconstructed high-resolution scan models of 5 subjects in Fig.8.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x44.png)
Appendix D Extended Experiments Results
Our experiment settings are as follows: , . We utilize AdamW as optimizer and ExponentialLR as a scheduler with
D-A Driving Results
We train a drivable model for each subject and use the same driving signal to drive all the models, as shown in Fig.16. Driven by the same signal, all reconstructed human models can present corresponding actions and facial expressions, and the geometry and texture of clothes change reasonably with the change of pose.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x45.png)
D-B Driving vs. Reconstruction
We further visualize the qualitative results in Fig.5 of the main paper in Fig.13. We nonlinearly transform the difference between every two images and get heat maps. From the heatmap, we can see that the driving results are very close to the reconstruction results and both restore the captured image, perfectly recovering clothing details, finger movements, and facial expressions.
D-C Ablation Study
We perform ablation experiments to compare the effects of texture and image supervising. The experiment settings are as follows: Tex: ; Img: ; Both: . As results in Tab.V demonstrate, texture map supervising works better than image supervising. The result of using both to supervise and simply averaging loss weights is the worst.
Supervise | PSNR | SSIM | CD() |
---|---|---|---|
Texture | 26.17 | 0.9396 | 4.5589 |
Image | 26.01 | 0.9335 | 9.0415 |
Both | 19.32 | 0.5925 | 40.000 |
Appendix E Discussion of Limitations
In this section, we discuss several limitations of the SMPLX-Lite dataset and driving method.
As shown in Fig. 14, the scanned mesh reconstructed from the depth map and the point cloud has adhesion in very close areas, such as hands and underarms, while the fitted parametric model SMPLX-Lite-D has not. Therefore, chamfer distance (CD) may not be the most appropriate evaluation metric and does not reflect the advantages of our fitted model. A more reasonable evaluation metric is needed to evaluate the quality of the fitted mesh.
As for the driving method, our proposed one is only a preliminary baseline, which works well overall, but artifacts can occur when driving out-of-distribution actions. Besides, the current algorithm is still elementary for facial expression control. To get a drivable model with good generalization capabilities, a large amount of data is needed to train the neural network, which our dataset now provides.
In future studies, we will further promote the diversity and number of action sequences and optimize the SMPLX-Lite-D fit results. We will improve the baseline driving algorithm to take full advantage of the diverse data in the SMPLX-Lite dataset to achieve a better driving effect and consider decoupling of expressions and whole-body poses to produce more vivid facial expressions. Also, we consider using fewer data to train available models and achieve training time reduction.
Subject | Act | Scan | SMPLX-Lite-D | CD () | ||||||
---|---|---|---|---|---|---|---|---|---|---|
PSNR | SSIM | PSNR | SSIM | |||||||
Std | Tele | Std | Tele | Std | Tele | Std | Tele | |||
WZL | 01 | 30.26 | 25.93 | 0.9803 | 0.9425 | 30.00 | 25.59 | 0.9797 | 0.9407 | 6.734 |
02 | 30.31 | 26.00 | 0.9805 | 0.9419 | 30.00 | 25.47 | 0.9799 | 0.9400 | 6.756 | |
03 | 30.23 | 25.94 | 0.9803 | 0.9415 | 29.83 | 25.32 | 0.9796 | 0.9396 | 6.829 | |
04 | 30.11 | 25.77 | 0.9800 | 0.9416 | 29.79 | 25.23 | 0.9796 | 0.9400 | 6.534 | |
05 | 30.05 | 25.59 | 0.9798 | 0.9424 | 29.74 | 25.06 | 0.9792 | 0.9406 | 6.683 | |
06 | 29.61 | 24.90 | 0.9791 | 0.9412 | 29.36 | 24.51 | 0.9786 | 0.9399 | 6.465 | |
07 | 30.11 | 25.86 | 0.9802 | 0.9448 | 29.80 | 25.47 | 0.9796 | 0.9436 | 6.746 | |
08 | 29.68 | 24.69 | 0.9790 | 0.9410 | 29.44 | 24.33 | 0.9785 | 0.9398 | 6.644 | |
09 | 30.07 | 25.77 | 0.9800 | 0.9457 | 29.82 | 25.43 | 0.9795 | 0.9442 | 6.657 | |
10 | 29.66 | 26.76 | 0.9798 | 0.9567 | 29.43 | 26.43 | 0.9792 | 0.9553 | 6.899 | |
11 | 29.23 | 27.12 | 0.9784 | 0.9672 | 29.04 | 26.83 | 0.9779 | 0.9659 | 7.042 | |
12 | 29.86 | 25.59 | 0.9797 | 0.9440 | 29.60 | 25.20 | 0.9791 | 0.9425 | 6.648 | |
13 | 29.86 | 25.71 | 0.9797 | 0.9431 | 29.55 | 25.18 | 0.9791 | 0.9414 | 6.703 | |
14 | 30.08 | 26.58 | 0.9803 | 0.9561 | 29.82 | 26.19 | 0.9797 | 0.9542 | 6.859 | |
15 | 29.96 | 25.89 | 0.9796 | 0.9432 | 29.68 | 25.44 | 0.9789 | 0.9416 | 6.858 | |
LDF | 01 | 28.72 | 27.03 | 0.9750 | 0.9572 | 28.47 | 26.63 | 0.9743 | 0.9561 | 6.902 |
02 | 28.82 | 27.11 | 0.9749 | 0.9566 | 28.59 | 26.70 | 0.9744 | 0.9551 | 6.834 | |
03 | 28.78 | 27.29 | 0.9747 | 0.9566 | 28.54 | 26.85 | 0.9742 | 0.9553 | 6.852 | |
04 | 28.83 | 27.13 | 0.9747 | 0.9551 | 28.61 | 26.74 | 0.9742 | 0.9538 | 6.873 | |
05 | 28.59 | 27.00 | 0.9741 | 0.9548 | 28.35 | 26.57 | 0.9735 | 0.9536 | 7.004 | |
06 | 28.21 | 26.91 | 0.9734 | 0.9564 | 27.98 | 26.54 | 0.9729 | 0.9553 | 6.882 | |
07 | 28.72 | 26.92 | 0.9740 | 0.9551 | 28.50 | 26.56 | 0.9735 | 0.9543 | 6.888 | |
08 | 28.48 | 26.78 | 0.9731 | 0.9536 | 28.21 | 26.31 | 0.9726 | 0.9523 | 6.840 | |
09 | 28.80 | 26.96 | 0.9746 | 0.9576 | 28.58 | 26.59 | 0.9740 | 0.9559 | 7.078 | |
10 | 29.15 | 28.75 | 0.9780 | 0.9761 | 28.93 | 28.46 | 0.9773 | 0.9751 | 9.649 | |
11 | 28.88 | 28.14 | 0.9774 | 0.9710 | 28.65 | 27.82 | 0.9768 | 0.9697 | 9.112 | |
12 | 28.86 | 27.28 | 0.9745 | 0.9569 | 28.64 | 26.88 | 0.9739 | 0.9556 | 7.215 | |
13 | 28.54 | 26.98 | 0.9731 | 0.9550 | 27.37 | 25.83 | 0.9655 | 0.9456 | 6.929 | |
14 | 28.80 | 27.12 | 0.9751 | 0.9606 | 28.61 | 26.79 | 0.9744 | 0.9590 | 7.363 | |
15 | 28.12 | 26.90 | 0.9721 | 0.9547 | 27.81 | 26.42 | 0.9712 | 0.9532 | 7.198 | |
ZX | 01 | 29.83 | 27.91 | 0.9799 | 0.9646 | 29.43 | 27.07 | 0.9790 | 0.9623 | 6.885 |
02 | 29.52 | 27.56 | 0.9795 | 0.9650 | 29.18 | 26.84 | 0.9786 | 0.9625 | 6.785 | |
03 | 29.48 | 27.69 | 0.9796 | 0.9664 | 29.20 | 27.09 | 0.9791 | 0.9654 | 6.700 | |
04 | 29.51 | 27.58 | 0.9791 | 0.9653 | 29.12 | 26.83 | 0.9783 | 0.9629 | 6.633 | |
05 | 29.36 | 27.52 | 0.9788 | 0.9645 | 28.99 | 26.69 | 0.9779 | 0.9623 | 6.775 | |
06 | 28.67 | 26.79 | 0.9768 | 0.9621 | 28.37 | 26.09 | 0.9765 | 0.9604 | 6.514 | |
07 | 29.87 | 28.40 | 0.9792 | 0.9696 | 29.56 | 27.82 | 0.9786 | 0.9681 | 6.795 | |
08 | 28.84 | 27.36 | 0.9778 | 0.9644 | 28.51 | 26.58 | 0.9768 | 0.9618 | 6.768 | |
09 | 29.23 | 27.44 | 0.9796 | 0.9654 | 28.91 | 26.76 | 0.9789 | 0.9635 | 6.869 | |
10 | 29.66 | 27.91 | 0.9806 | 0.9682 | 29.34 | 27.35 | 0.9797 | 0.9663 | 6.911 | |
11 | 29.37 | 28.19 | 0.9803 | 0.9728 | 29.16 | 27.73 | 0.9795 | 0.9713 | 7.110 | |
12 | 29.52 | 27.71 | 0.9794 | 0.9653 | 29.10 | 26.85 | 0.9785 | 0.9628 | 6.825 | |
13 | 28.94 | 27.91 | 0.9796 | 0.9684 | 28.58 | 27.22 | 0.9788 | 0.9665 | 6.924 | |
ZX | 14 | 29.38 | 27.69 | 0.9800 | 0.9668 | 29.10 | 27.09 | 0.9791 | 0.9645 | 6.941 |
15 | 29.42 | 27.75 | 0.9788 | 0.9648 | 28.95 | 26.76 | 0.9776 | 0.9622 | 6.917 | |
LW | 01 | 28.05 | 25.45 | 0.9799 | 0.9625 | 27.70 | 24.67 | 0.9794 | 0.9612 | 6.312 |
02 | 28.49 | 26.70 | 0.9775 | 0.9657 | 28.12 | 26.31 | 0.9770 | 0.9646 | 6.192 | |
03 | 28.29 | 26.24 | 0.9768 | 0.9652 | 27.90 | 25.82 | 0.9763 | 0.9642 | 6.254 | |
04 | 28.08 | 25.68 | 0.9800 | 0.9602 | 27.67 | 24.67 | 0.9791 | 0.9579 | 6.379 | |
05 | 28.07 | 25.38 | 0.9798 | 0.9603 | 27.59 | 24.40 | 0.9790 | 0.9581 | 6.394 | |
06 | 27.45 | 25.10 | 0.9786 | 0.9577 | 27.07 | 24.20 | 0.9780 | 0.9558 | 6.184 | |
07 | 27.49 | 25.27 | 0.9798 | 0.9644 | 27.10 | 24.50 | 0.9789 | 0.9627 | 6.429 | |
08 | 27.96 | 25.26 | 0.9789 | 0.9560 | 27.44 | 24.22 | 0.9779 | 0.9540 | 6.488 | |
09 | 27.99 | 25.20 | 0.9806 | 0.9630 | 27.52 | 24.25 | 0.9797 | 0.9608 | 6.622 | |
10 | 27.82 | 25.05 | 0.9806 | 0.9654 | 27.42 | 24.24 | 0.9798 | 0.9635 | 6.517 | |
11 | 28.07 | 25.93 | 0.9800 | 0.9705 | 27.70 | 25.24 | 0.9792 | 0.9692 | 6.691 | |
12 | 28.31 | 25.47 | 0.9798 | 0.9631 | 27.86 | 24.74 | 0.9788 | 0.9614 | 6.609 | |
13 | 27.32 | 24.54 | 0.9796 | 0.9649 | 26.83 | 23.89 | 0.9788 | 0.9633 | 6.282 | |
14 | 27.75 | 25.00 | 0.9805 | 0.9673 | 27.33 | 24.18 | 0.9797 | 0.9656 | 6.478 | |
15 | 27.71 | 25.33 | 0.9790 | 0.9572 | 27.21 | 24.25 | 0.9782 | 0.9551 | 6.316 | |
ZC | 01 | 29.01 | 25.78 | 0.9712 | 0.9358 | 28.61 | 25.07 | 0.9704 | 0.9344 | 6.958 |
02 | 28.04 | 25.43 | 0.9705 | 0.9367 | 27.67 | 24.81 | 0.9698 | 0.9357 | 7.033 | |
03 | 27.79 | 25.69 | 0.9706 | 0.9378 | 27.61 | 25.14 | 0.9699 | 0.9368 | 7.038 | |
04 | 27.63 | 25.69 | 0.9704 | 0.9378 | 27.40 | 25.13 | 0.9696 | 0.9372 | 7.110 | |
05 | 27.94 | 25.19 | 0.9672 | 0.9350 | 27.55 | 24.56 | 0.9663 | 0.9342 | 6.907 | |
06 | 28.02 | 25.15 | 0.9691 | 0.9286 | 27.42 | 24.33 | 0.9640 | 0.9181 | 6.706 | |
07 | 28.16 | 25.51 | 0.9710 | 0.9371 | 27.40 | 24.62 | 0.9610 | 0.9158 | 6.404 | |
08 | 28.40 | 25.35 | 0.9698 | 0.9299 | 27.97 | 24.62 | 0.9689 | 0.9290 | 6.862 | |
09 | 28.35 | 25.47 | 0.9706 | 0.9341 | 28.06 | 24.85 | 0.9698 | 0.9333 | 7.099 | |
10 | 28.64 | 26.70 | 0.9724 | 0.9570 | 28.28 | 26.35 | 0.9718 | 0.9561 | 6.684 | |
11 | 28.23 | 25.32 | 0.9722 | 0.9415 | 27.93 | 24.78 | 0.9714 | 0.9408 | 6.973 | |
12 | 28.43 | 25.69 | 0.9710 | 0.9380 | 28.07 | 25.05 | 0.9700 | 0.9373 | 7.132 | |
13 | 27.99 | 25.38 | 0.9698 | 0.9464 | 27.52 | 24.69 | 0.9688 | 0.9448 | 6.665 | |
14 | 28.56 | 25.46 | 0.9750 | 0.9515 | 28.19 | 24.89 | 0.9740 | 0.9501 | 7.033 | |
15 | 28.16 | 25.37 | 0.9699 | 0.9356 | 27.76 | 24.72 | 0.9689 | 0.9344 | 7.013 |