High-precision virtual environments are increasingly important for various education, simulation, training, performance, and entertainment applications. We present HoloCamera, an innovative volumetric capture instrument to rapidly acquire, process, and create cinematic-quality virtual avatars and scenarios. The HoloCamera consists of a custom-designed free-standing structure with 300 high-resolution RGB cameras mounted with uniform spacing spanning the four sides and the ceiling of a room-sized studio. The light field acquired from these cameras is streamed through a distributed array of GPUs that interleave the processing and transmission of 4K resolution images. The distributed compute infrastructure that powers these RGB cameras consists of 50 Jetson AGX Xavier boards, with each processing unit dedicated to driving and processing imagery from six cameras. A high-speed Gigabit Ethernet network fabric seamlessly interconnects all computing boards. In this systems paper, we provide an in-depth description of the steps involved and lessons learned in constructing such a cutting-edge volumetric capture facility that can be generalized to other such facilities. We delve into the techniques employed to achieve precise frame synchronization and spatial calibration of cameras, careful determination of angled camera mounts, image processing from the camera sensors, and the need for a resilient and robust network infrastructure. To advance the field of volumetric capture, we are releasing a high-fidelity static light-field dataset, which will serve as a benchmark for further research and applications of cinematic-quality volumetric light fields.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TVCG.2024.3372123DOI Listing

Publication Analysis

Top Keywords

volumetric capture
16
rgb cameras
8
volumetric
5
cameras
5
holocamera advanced
4
advanced volumetric
4
capture
4
capture cinematic-quality
4
cinematic-quality applications
4
applications high-precision
4

Similar Publications

Bacteria can be engineered to manufacture chemicals, but it is unclear how to optimally engineer a single cell to maximise production performance from batch cultures. Moreover, the performance of engineered production pathways is affected by competition for the host's native resources. Here, using a 'host-aware' computational framework which captures competition for both metabolic and gene expression resources, we uncover design principles for engineering the expression of host and production enzymes at the cell level which maximise volumetric productivity and yield from batch cultures.

View Article and Find Full Text PDF

Applications of Deep Neural Networks with Fractal Structure and Attention Blocks for 2D and 3D Brain Tumor Segmentation.

J Stat Theory Pract

September 2024

Statistics Online Computational Resource, University of Michigan, 426 North Ingalls Str, Ann Arbor, Michigan 48109-2003.

In this paper, we propose a novel deep neural network (DNN) architecture with fractal structure and attention blocks. The new method is tested to identify and segment 2D and 3D brain tumor masks in normal and pathological neuroimaging data. To circumvent the problem of limited 3D volumetric datasets with raw and ground truth tumor masks, we utilized data augmentation using affine transformations to significantly expand the training data prior to estimating the network model parameters.

View Article and Find Full Text PDF

Importance: Capturing high-quality images of the entire peripheral retina while minimizing the use of scleral depression could increase the quality of examinations for retinopathy of prematurity (ROP) while reducing neonatal stress.

Objective: To evaluate whether an investigational handheld ultra-widefield optical coherence tomography (UWF-OCT) device without scleral depression can be used to document high-quality images of the peripheral retina for use in ROP examinations.

Design, Setting, And Participants: This was a prospective, cross-sectional study in the neonatal intensive care unit at a single academic medical center.

View Article and Find Full Text PDF

Background: Modern radiation therapy techniques, such as intensity-modulated radiation therapy (IMRT) and volumetric-modulated arc therapy (VMAT), use complex fluence modulation strategies to achieve optimal patient dose distribution. Ensuring their accuracy necessitates rigorous patient-specific quality assurance (PSQA), traditionally done through pretreatment measurements with detector arrays. While effective, these methods are labor-intensive and time-consuming.

View Article and Find Full Text PDF

Purpose: This study aims to accurately predict the effects of hormonal therapy on prostate cancer (PC) lesions by integrating multi-modality magnetic resonance imaging (MRI) and the clinical marker prostate-specific antigen (PSA). It addresses the limitations of Convolutional Neural Networks (CNNs) in capturing long-range spatial relations and the Vision Transformer (ViT)'s deficiency in localization information due to consecutive downsampling. The research question focuses on improving PC response prediction accuracy by combining both approaches.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!